2025-04-23 –, Main hall
Learn how to build resilient data pipelines with Django, Celery, and idempotent consumers. We'll dive into robust error-handling techniques and the role of idempotency in ensuring reliable and consistent data processing.
Building resilient data pipelines is a critical challenge in modern applications. In this talk, we’ll explore how to leverage Django, Celery, and idempotent consumers to create robust pipelines that handle errors gracefully and ensure data consistency.
The presentation will introduce a scenario where we must build a data processing pipeline for a real-world problem: Dealing with content moderation in an application such as Instagram. We will start with the simplest solution, why not do it all in a single function? This will introduce us to new issues such as duplicate tasks, partial updates, and unpredictable failures. From there, we’ll evolve our solution using:
Django as the foundation for pipeline orchestration.
Celery for task management to handle asynchronous workloads effectively.
Idempotency as a core design principle for ensuring reliable and repeatable task execution.
Through real-world examples and actionable patterns, this talk will demonstrate how to combine these tools to build pipelines that are not only efficient but also bulletproof. You’ll walk away with a toolkit of strategies for error handling, task retries, and data integrity, ready to tackle even the toughest pipeline challenges.
Intermediate
Topics –Architecture, General Python, Async
I'm a software engineer who loves learning and testing new things. Currently working as a development consultant at Vinta. My favorite thing is to break systems and then think in a way of making them fail-proof.