Running on CPU Upgrade Featured 3.04k The Smol Training Playbook 📚 3.04k The secrets to building world-class LLMs
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 Sep 18, 2024 • 276
google/embeddinggemma-300m Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 1.98M • • 1.52k
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 185
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8, 2025 • 33