24 posts in total
2026
Memory usage breakdown during Training
2025
JAX 101
Jeff Dean & Gemini team QA at NeurIPS ‘25
Pytorch Conference & Ray Summit 2025 summary
Intro to PPO in RL
Truncated Importance Sampling (TIS) in RL
speculative decoding 02
vLLM 05 - vLLM multi-modal support
Perplexity DeepSeek MoE
MoE history and OpenMoE