Π£ Π½Π°Ρ Π²Ρ ΠΌΠΎΠΆΠ΅ΡΠ΅ ΠΏΠΎΡΠΌΠΎΡΡΠ΅ΡΡ Π±Π΅ΡΠΏΠ»Π°ΡΠ½ΠΎ Intro to Foundry Pipeline Builder ΠΈΠ»ΠΈ ΡΠΊΠ°ΡΠ°ΡΡ Π² ΠΌΠ°ΠΊΡΠΈΠΌΠ°Π»ΡΠ½ΠΎΠΌ Π΄ΠΎΡΡΡΠΏΠ½ΠΎΠΌ ΠΊΠ°ΡΠ΅ΡΡΠ²Π΅, Π²ΠΈΠ΄Π΅ΠΎ ΠΊΠΎΡΠΎΡΠΎΠ΅ Π±ΡΠ»ΠΎ Π·Π°Π³ΡΡΠΆΠ΅Π½ΠΎ Π½Π° ΡΡΡΠ±. ΠΠ»Ρ Π·Π°Π³ΡΡΠ·ΠΊΠΈ Π²ΡΠ±Π΅ΡΠΈΡΠ΅ Π²Π°ΡΠΈΠ°Π½Ρ ΠΈΠ· ΡΠΎΡΠΌΡ Π½ΠΈΠΆΠ΅:
ΠΡΠ»ΠΈ ΠΊΠ½ΠΎΠΏΠΊΠΈ ΡΠΊΠ°ΡΠΈΠ²Π°Π½ΠΈΡ Π½Π΅
Π·Π°Π³ΡΡΠ·ΠΈΠ»ΠΈΡΡ
ΠΠΠΠΠΠ’Π ΠΠΠΠ‘Π¬ ΠΈΠ»ΠΈ ΠΎΠ±Π½ΠΎΠ²ΠΈΡΠ΅ ΡΡΡΠ°Π½ΠΈΡΡ
ΠΡΠ»ΠΈ Π²ΠΎΠ·Π½ΠΈΠΊΠ°ΡΡ ΠΏΡΠΎΠ±Π»Π΅ΠΌΡ ΡΠΎ ΡΠΊΠ°ΡΠΈΠ²Π°Π½ΠΈΠ΅ΠΌ Π²ΠΈΠ΄Π΅ΠΎ, ΠΏΠΎΠΆΠ°Π»ΡΠΉΡΡΠ° Π½Π°ΠΏΠΈΡΠΈΡΠ΅ Π² ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΡ ΠΏΠΎ Π°Π΄ΡΠ΅ΡΡ Π²Π½ΠΈΠ·Ρ
ΡΡΡΠ°Π½ΠΈΡΡ.
Π‘ΠΏΠ°ΡΠΈΠ±ΠΎ Π·Π° ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΠ΅ ΡΠ΅ΡΠ²ΠΈΡΠ° ClipSaver.ru
Pipeline Builder is the most accessible way to build production-grade data pipelines in Palantir Foundry. This video is an intro tutorial that covers the basics of how to use Pipeline Builder and some essential data engineering concepts. It also showcases the first few AI features of Pipeline Builder. Ontologize Founded by Taylor Gregoire-Wright, a former Palantir implementation engineer, Ontologize offers courses & live trainings for Palantir Foundry. Visit https://ontologize.com or connect on LinkedIn: Β Β /Β tgregoirewrightΒ Β You can follow along with the same data I used for this tutorial The data used in this tutorial is notional data from a fictional set of grocery stores in the US. The datasets include: Transactions - A customer makes a transaction when they buy groceries Baskets - Baskets represents which items and how many of each were bought in a single transaction Customers - Each row is information about a single customer Products - Descriptions about the different products customers can buy, including product name, brand, store department, etc. Stores - The stores that the parent company owns Download the data from Ontologize's GitHub: https://github.com/ontologize/fake-gr... 0:00 Creating a pipeline 1:30 Reference pipeline 3:29 Adding input data 4:40 Cleaning transactions data 7:30 Aggregating data 9:00 Joining data 10:35 Nesting expressions 12:12 Completing the transactions pipeline 15:23 Changing column order 16:15 Editing upstream parts of the pipeline 17:34 AI Feature: auto-naming 17:56 Drag and drop pipeline nodes 18:39 Creating an output dataset 19:29 Saving, Builds, and Jobs 20:38 Saving vs. Deploying 23:20 Organizing with colors 24:16 Creating synthetic primary keys 26:46 Wrapping up the rest of the pipeline 28:59 Reusables: parameters aka variables 30:43 Reusables: functions 32:37 AI Features: auto-generating regex expressions 34:04 Version Control in Pipeline Builder 38:24 Pipeline organization 41:43 Course announcement