У нас вы можете посмотреть бесплатно Data Engineering Zoomcamp 2025 - Streaming - with Zach Wilson или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
Links: Repo: https://github.com/EcZachly/flink-tra... Course: https://github.com/DataTalksClub/data... 0:00 - Introduction to the Workshop 0:33 - Overview of the Session and Goals 1:04 - Tools and Technologies Used: Docker, Flink, PostgreSQL, Kafka 2:04 - Understanding the Four Components in Docker 3:41 - Setting Up and Running Docker Containers 5:05 - Verifying Flink and PostgreSQL Setup 6:18 - Configuring PostgreSQL for Data Storage 7:48 - Creating the Processed Events Table 9:19 - Introduction to Red Panda as a Kafka Alternative 11:12 - Sending Data to Kafka Using a Producer 13:05 - Kafka Message Serialization Explained 15:24 - Running the Kafka Producer and Checking Data Flow 17:24 - Flink: Connecting Kafka to PostgreSQL 19:44 - Understanding Kafka Offsets and Data Consumption Strategies 23:03 - Configuring Flink Checkpointing for Fault Tolerance 25:19 - Flink Source and Sink: Reading from Kafka and Writing to PostgreSQL 30:25 - Deploying a Flink Job Using Docker 32:54 - Verifying Data Flow in PostgreSQL in Real Time 35:48 - Understanding Flink Parallelism and Scaling 38:54 - Introduction to Flink Windowing and Watermarking 42:48 - Handling Out-of-Order Events with Watermarks 45:56 - Live Demonstration of Data Flow Through the Pipeline 49:45 - Troubleshooting Flink Jobs and Avoiding Duplicates 52:54 - Changing Flink Offset Strategies: Earliest vs. Latest 57:13 - Aggregating Data in Flink Using Group By 1:01:45 - Understanding Flink Session Windows and Sliding Windows 1:05:24 - Live Debugging and Fixing Aggregation Jobs 1:09:50 - Use Cases for Streaming vs. Batch Processing 1:14:23 - Spark Streaming vs. Flink Streaming: Key Differences 1:19:12 - Scaling Kafka and Handling High-Throughput Events 1:24:44 - Choosing the Right Streaming Tool: Kafka vs. RabbitMQ 1:30:10 - Closing Remarks and Q&A Session 🔗 CONNECT WITH DataTalksClub Join the community - https://datatalks.club/slack.html Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/... Check other upcoming events - https://lu.ma/dtc-events LinkedIn - / datatalks-club Twitter - / datatalksclub Website - https://datatalks.club/ 🔗 CONNECT WITH ALEXEY Twitter - / al_grigor Linkedin - / agrigorev 📚Check our free online courses ML Engineering course - http://mlzoomcamp.com Data Engineering course - https://github.com/DataTalksClub/data... uj7jh1c6xg MLOps course - https://github.com/DataTalksClub/mlop... Analytics in Stock Markets - https://github.com/DataTalksClub/stoc... LLM course - https://github.com/DataTalksClub/llm-... Read about all our courses in one place - https://datatalks.club/blog/guide-to-... 👋🏼 GET IN TOUCH If you want to support our community, use this link - https://github.com/sponsors/alexeygri... If you’re a company, reach us at [email protected]