local-llm 6
- Club 3090 vs My Llama Setup
- Running NVIDIA Nemotron 30B with Vision on a 24 GB GPU
- Nemotron 30B on 24 GB: Benchmarks and a Quantization Quirk
- Qwen3.6-27B on an M1 Max: when the laptop config matches the 3090
- Taming Qwen Overthinking with GBNF Grammars
- RTX 3090 Power Limit: Finding the Sweet Spot for Local LLM Inference