Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 84
inference-optimization/GLM-4.6-NVFP4
Text Generation • 199B • Updated
• 9
inference-optimization/Qwen3-Coder-Next.w4a16
Text Generation • 12B • Updated
• 111
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6bits
23B • Updated
• 8
inference-optimization/Kimi-K2-Instruct-0905-BF16-NVFP4
Updated
• 11
inference-optimization/test_qwen3_next_mtp
Updated
• 6
inference-optimization/test_tencentbac_fastmtp
Updated
• 5
inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4
Updated
• 60
inference-optimization/Ministral-3-14B-Instruct-2512.w8a8
Updated
• 28
inference-optimization/Ministral-3-14B-Instruct-2512.w4a16
Updated
• 61
inference-optimization/Meta-Llama-3-8B-Instruct-NVFP4-GPTQ-Quant
5B • Updated
• 9
datasets 0
None public yet