Snowflake & Snowpark Python
2026 Edition. A deep dive into Snowflake's core architecture and the Snowpark Python library. Discover how to utilize DataFrames, custom UDFs, Stored Procedures, pandas on Snowflake, and ML training natively in the Data Cloud.
Episodes
The Snowflake Architecture
3m 30sDiscover the foundational architecture of Snowflake. This episode explains the separation of compute and storage, and how Snowflake combines shared-disk and shared-nothing architectures.
Introducing Snowpark Python
3m 21sLearn how Snowpark brings Python to your data. We cover the DataFrame abstraction and how Snowpark translates Python code into distributed SQL.
Establishing a Session
3m 34sLearn how to securely connect to Snowflake using Snowpark. We discuss Session builder configurations, external browser SSO, and managing credentials.
Python Worksheets in Snowsight
2m 48sDiscover how to write and execute Snowpark Python directly within the Snowflake UI. Skip the local setup and utilize pre-installed Anaconda packages.
Constructing Snowpark DataFrames
3m 21sGet started with the core abstraction of Snowpark: the DataFrame. Learn how to reference tables, create DataFrames from literal values, and leverage lazy evaluation.
Transforming DataFrames
3m 18sMaster DataFrame transformations in Snowpark. We explore filtering, selecting columns, joining datasets, and triggering execution with action methods.
Calling System Functions
3m 14sLearn how to invoke Snowflake's powerful built-in SQL functions directly from your Python DataFrames without rewriting logic.
pandas on Snowflake: The Paradigm Shift
3m 38sDiscover how to run standard pandas code at massive scale. We introduce the Modin plugin that allows pandas execution natively within Snowflake.
pandas on Snowflake: Hybrid Execution
3m 44sDive into the mechanics of Hybrid Execution in pandas on Snowflake. Learn how the engine seamlessly switches between distributed cloud compute and local memory.
Ingesting External Data
3m 22sLearn how to ingest data from external systems using Snowpark DB-API and JDBC. Bring operational data directly into your analytical pipelines.
Creating Scalar UDFs
3m 40sBring custom Python logic to your SQL queries. This episode covers creating anonymous and named User-Defined Functions (UDFs) in Snowpark.
Dependency Management for UDFs
3m 44sLearn how to import third-party libraries and local modules into your Snowflake UDFs using the Anaconda channel and session imports.
Building User-Defined Table Functions
3m 26sMove beyond scalar values. Learn how to build User-Defined Table Functions (UDTFs) to return multiple rows and columns from a single input.
Authoring Stored Procedures
3m 42sAutomate your pipelines entirely within Snowflake. We explore creating Stored Procedures to execute complex business logic and control flow.
Training ML Models in Snowflake
3m 50sDiscover how to train heavy machine learning models securely on Snowpark-optimized warehouses using custom Python Stored Procedures.
Dynamic File Access with SnowflakeFile
3m 27sLearn how to stream large unstructured files dynamically from internal stages directly inside your UDFs and Stored Procedures.
Testing Snowpark Python
3m 42sEnsure your data pipelines are robust. We cover setting up PyTest, creating Session fixtures, and unit testing DataFrame transformations.