LlamaIndex: Context-Augmented LLM Applications
v0.14 — 2026 Edition. A comprehensive guide to LlamaIndex, covering context augmentation, RAG pipelines, autonomous agents, and multi-agent workflows. Learn how to build production-ready LLM applications using version 0.14.
Episodes
The Context-Augmentation Imperative
3m 49sDiscover the foundational concepts of LlamaIndex and why LLMs need external context to be truly useful. This episode covers the philosophy behind Retrieval-Augmented Generation, workflows, and agentic applications.
Data Ingestion: Documents and Nodes
4m 11sExplore the first half of the RAG pipeline. You will learn about Connectors, Documents, Nodes, and the critical process of indexing unstructured data into vector embeddings.
The Query Pipeline: Retrievers and Routers
3m 20sDive into the second half of the RAG lifecycle. Learn how Retrievers find relevant chunks, how Routers select the best approach, and how Postprocessors refine the context for the LLM.
Interfacing with LLMs and Multi-Modal Inputs
3m 42sMaster the LlamaIndex LLM class for natural language generation. This episode breaks down chat interfaces, streaming responses, and feeding images to multi-modal models.
Structured Data Extraction with Pydantic
3m 24sLearn how to force unpredictable LLMs to return strict, typed JSON data. Discover how Pydantic BaseModels act as schemas to extract reliable structured information from raw text.
Building Autonomous Function Agents
3m 28sTake the leap from static code to autonomous agents. You will learn how to wrap Python functions into tools and deploy a FunctionAgent to execute tasks dynamically.
Extending Agents with LlamaHub Tools
3m 30sSupercharge your agents with pre-built integrations. This episode shows how to browse LlamaHub, install tool specs, and give your agent real-world capabilities instantly.
Multi-Agent Swarms with AgentWorkflow
2m 59sMove beyond single-agent setups. Learn how to configure a linear swarm of specialized agents that autonomously hand off tasks to one another using AgentWorkflow.
The Orchestrator Agent Pattern
3m 38sTake granular control of your agentic workflows. Discover how to build a master orchestrator agent that manages subordinate agents as callable tools.
Custom Multi-Agent Planners
3m 34sAchieve ultimate multi-agent flexibility. Learn how to roll your own orchestration loop using custom XML prompting, Pydantic, and imperative execution.
Human-in-the-Loop Workflows
3m 09sPrevent autonomous disasters by keeping a human in the loop. You will learn how to pause workflows with events to wait for human confirmation before executing dangerous tasks.
Observability and Tracing
3m 26sStop debugging AI with print statements. This episode explores LlamaIndex callbacks and one-click observability to trace inputs, durations, and outputs across complex pipelines.
RAG Evaluation Metrics
3m 50sMeasure the true effectiveness of your applications. Learn how to use FaithfulnessEvaluator and RetrieverEvaluator to objectively score retrieval and response quality.
Scaffold to Production
3m 35sTransform prototypes into full applications instantly. Discover how to use create-llama and the RAG CLI to scaffold full-stack web apps and terminal chats without writing boilerplate.