У нас вы можете посмотреть бесплатно Monitoring Half a Million ML Models, IoT Streaming Data, and Automated Quality Check on Delta Lake или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
Quby, an Amsterdam-based technology company, offers solutions to empower homeowners to stay in control of their electricity, gas and water usage. Using Europe’s largest energy dataset, consisting of petabytes of IoT data, the company has developed AI powered products that are used by hundreds of thousands of users on a daily basis. Delta Lake ensures the quality of incoming records though schema enforcement and evolution. But it is the Data Engineers role to check whether the expected data is ingested in to the Delta Lake at the right time with expected metrics so that downstream processes will function their duties. Re-training models and serving on the fly might go wrong unless we put the right monitoring infrastructure too . Quality data without a good performing model or the best model without quality data, do not bring any value. Our use-cases need training of more than half a million models on a daily basis. These models will be automatically used in production environments without human interference. We also stream training data to our delta lake in a near real-time fashion. Before training the models we have to make sure that there is enough and quality data that pass the minimum threshold we set. Even-though we monitor our data quality the accuracy of our models varies depending on multiple variables observed on the daily collected data. Therefore, we need to monitor the performances of our models too. At last we also need to evaluate the result produced by our algorithms both in terms of quality and quantity before we serve it. In this presentation, we will demonstrate how we are using Databricks dashboards to monitor our raw and processed data quality metrics. We will also present using ML flow to keep track of the performances of models. At last we will show you how we have integrated Slack to receive alerts when there is a failure at any stage of our data crunching process. About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business. Read more here: https://databricks.com/product/unifie... See all the previous Summit sessions: Connect with us: Website: https://databricks.com Facebook: / databricksinc Twitter: / databricks LinkedIn: / databricks Instagram: / databricksinc Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. https://databricks.com/databricks-nam...