№ 01 · The pillar

Drip.
Long reads with labs you can play.

One idea, given the time and the interactive surface area it deserves. Drip pieces are essays you read, not chapters you skim — designed to leave you with a working mental model by the last paragraph.

Featured

38 live · 40 total

Foundations8 min

A Natural Introduction to NLP

Why language is hard for machines, what tokens are, and how words become vectors an attention mechanism can compare.

Read →Featured

Diffusion Models, Denoised

Generation as un-corruption. Drag the slider, see noise resolve into a picture, one denoising step at a time.

Read →Featured

Architectures14 min

Transformers, From First Principles

softmax(QKᵀ/√d)·V — one operation, repeated. Click, hover, and break a real attention matrix as you read.

Read →Featured

Core Concepts

10 entries · 10 live

LoRA & qLoRA

Fine-tune massive LLMs on consumer hardware. Learn about Low-Rank Adaptation and 4-bit Quantization.

Tokenization

Before AI can read, it must chop. Learn how text is broken down into the fundamental atoms of meaning.

LLM Sampling

How do LLMs decide what to say next? Explore greedy vs. probabilistic sampling and log probabilities.

Context Engineering

Beyond the prompt: Curate the perfect information to feed your LLM's limited attention span.

Prompt Engineering

Learn how Zero-Shot, Few-Shot, and Chain-of-Thought prompting steer LLM probabilities.

KV Cache (Inference)

Why doesn't ChatGPT re-read your whole chat every time it types a word? Memory optimization explained.

Naive Bayes

Predicting the future by assuming simplicity. Learn how this probabilistic algorithm uses Bayes' Theorem for classification.

Random Forest

Strength in numbers. See how an ensemble of diverse decision trees can vote to make robust predictions.

Support Vector Machines

The classic algorithm that finds the widest possible street between two classes of data.

Recommender Systems

From collaborative filtering to matrix factorization: how Netflix knows what you want before you do.

Architectures

8 entries · 8 live

Seq2Seq (Encoder-Decoder)

The architecture that solved translation before Transformers. Learn about the Context Vector bottleneck.

AlexNet (The Big Bang)

The 2012 breakthrough that started the Deep Learning era. ReLU, Dropout, and GPUs.

Bahdanau Attention

Before Transformers, RNNs learned to focus. The mechanism that solved the bottleneck problem.

Recurrent Neural Networks

Understand how AI processes sequential data using hidden states and memory loops.

BERT & Bidirectionality

See how 'Masked Language Model' training enables deep context from both directions.

Vision Transformer (ViT)

Learn how breaking images into 16x16 patches allowed pure Transformers to beat CNNs.

U-Net & Skip Connections

Discover the architecture behind precise image segmentation and preserving fine details.

YOLO (Object Detection)

'You Only Look Once': Real-time object detection framed as a single regression problem.

Agents & RAG

7 entries · 7 live

HyPA-RAG (Legal AI)

New Research: A hybrid, parameter-adaptive RAG system designed specifically for high-stakes legal applications.

Agentic RAG

When RAG gets smart. Learn how adding an autonomous agent loop enables multi-hop reasoning and self-correction.

Agentic Hybrid RAG

New Research: Combining GraphRAG and VectorRAG with an autonomous router for scientific literature review.

Agentic Design Patterns

Google Cloud Architecture: From simple prompts to complex multi-agent systems.

RAG (Retrieval-Augmented)

Give AI an open-book test. Connect LLMs to external knowledge bases for accurate answers.

Advanced RAG Techniques

Go beyond basic vector search with Reranking, Hybrid Search, and Query Expansion for production-grade accuracy.

The Future of Agentic AI

Research Deep Dive: Why Small Language Models (SLMs) are replacing monolithic LLMs.

Latest Research

10 entries · 10 live

Latest Research

AI Overthinking

New Research: When models think too much, they often talk themselves out of the correct answer.

Latest Research

Latent Reasoning (Coconut)

New Research: What if LLMs didn't have to 'think' in words? Explore reasoning directly in continuous latent space.

Latest Research

Qwen3 (Unified Thinking)

New Research: A single model that can dynamically switch between fast responses and deep reasoning modes.

Latest Research

DeepSeekMath (GRPO)

New Research: How a 7B model approached GPT-4 math performance by ditching the RL 'Critic' model.

Latest Research

Kimi K2 Thinking

New Research: An open-source thinking agent that interleaves reasoning with tool use (300+ steps).

Latest Research

DeepSeek-OCR

New Research: Compressing long documents into highly efficient 2D visual tokens instead of text.

Latest Research

CoT Monitoring

New Research: Can AI models learn to hide their dangerous thoughts from safety monitors?

Latest Research

Transformer Sensitivity

New Research: Why are Transformers so robust? They naturally learn 'low sensitivity' functions.

Latest Research

Coherence (Segmentation)

New Research: An unsupervised method that uses 'sticky' keywords to find topic boundaries.

Latest Research

SFT vs. RL Generalization

New Research: Does Supervised Fine-Tuning just memorize while RL actually learns rules?