Mastering Modern Pandas
v3.0 — 2026 Edition. Master the core abstractions and modern capabilities of pandas 3.0 in 2026. Learn about data alignment, Copy-on-Write, PyArrow integration, time series mastery, and strategies for scaling out-of-core datasets.
Episodes
The Core Abstraction: DataFrames and Label Alignment
3m 24sWe explore the foundational mental models of pandas: the Series and the DataFrame. You will learn why intrinsic label alignment is the killer feature that prevents row-mismatch disasters.
The Copy-on-Write Revolution
3m 43sDiscover the most significant architectural change in modern pandas: Copy-on-Write. You will learn how CoW eliminates unpredictable mutations and optimizes memory usage.
The PyArrow Engine Room
3m 11sPandas isn't just powered by NumPy anymore. You will learn how to leverage the PyArrow backend for native missing data support and incredible memory savings on strings.
Modern Data Ingestion
3m 23sWe tackle efficient I/O strategies for large datasets. You will learn how to ingest massive files selectively and directly into highly optimized memory structures.
Relational Algebra: Merge and Join
3m 44sWe explore how to unify disparate datasets using relational algebra. You will learn to execute optimized SQL-style joins directly in pandas.
The Split-Apply-Combine Pattern
3m 32sUnlock the true power of the GroupBy object. You will learn how to go beyond simple averages to perform complex group-specific transformations and filtrations.
Time Series Mastery
3m 51sWe dive into pandas' undisputed dominance in time series analysis. You will learn how to leverage DatetimeIndex and native resampling for high-frequency data.
Scaling to Out-of-Core Datasets
3m 48sWe tackle the limits of your machine's RAM. You will learn how to process datasets significantly larger than memory using pure pandas chunking.