← Back to home

Free · Self-paced · No signup

Learn how modern AI systems actually work

Not tutorials that wrap an API call. Close readings of the papers and production systems behind LLM inference, distributed systems, fine-tuning, and agents — from someone who's built them at scale.

16 lessons · 4 tracks · progress saved automatically · always free

Grounded in the papers

Every lesson is a close reading of the actual paper — the mechanism, the numbers that matter, and the claims that don't survive production.

Production-tested

Written from real systems work (Ashwani Jha — ex-Amazon: Just Walk Out, Pay, AWS CloudFormation). Every topic includes when not to use it.

Yours to pace

Read on the web or get it as a daily email. Your progress is tracked in the browser — no account, no paywall to start.

Choose a track

LLM Inference & Serving

4 lessons · ~1h

How large language models actually run in production — KV-cache management, batching, speculative decoding, quantization, and the systems work that turns a model into a serveable endpoint.

0 / 4 complete
Start learning →

Distributed Systems

2 lessons · ~1h

Foundational papers and production lessons on storage, consensus, replication, and large-scale data infrastructure — the substrate everything else runs on.

0 / 2 complete
Start learning →

Fine-Tuning & Adaptation

2 lessons · ~1h

Adapting pre-trained models to your task without burning a cluster — LoRA and its variants, quantized training, and the tradeoffs that decide quality.

0 / 2 complete
Start learning →

AI Agents & Reasoning

Coming soon

Getting LLMs to plan, use tools, remember, and reason — what the agent papers actually demonstrate and how the patterns hold up in production.

Paper Breakdowns

8 lessons · ~2h

The “What the Paper Actually Says” series — close readings of the systems and ML papers that matter, focused on the mechanism, the numbers, and when not to use the idea.

0 / 8 complete
Start learning →