20 posts in total
2026
XLA02 - shapes, layout & tiling
XLA01 - architecture & workflows
Knowledge Distillation 101
GPU mode - lecture2 - CUDA 101
Pallas 101 - multi-backend kernel for JAX
5D parallelism in LLM training
Memory usage breakdown during Training
2025
JAX 101
Jeff Dean & Gemini team QA at NeurIPS ‘25
Pytorch Conference & Ray Summit 2025 summary