Guides
Guides
Technical deep-dives
Deployment guides, inference tuning, and production patterns, complementing the interactive topic curriculum.
Guides
Long-form technical guides: deployment patterns, tuning, and operations, separate from the interactive topic curriculum.
Gemma-4-26B-A4B-it-GGUF on Modal
Detailed deployment guide for Gemma-4-26B-A4B-it-GGUF on Modal with llama.cpp, GPU memory snapshots, and production operations.
Open guide →GLM-5.1 FP8 on Modal
Production-grade OpenAI-compatible inference for GLM-5.1 with SGLang, 8×B200 GPUs, and scale-to-zero on Modal.
Open guide →