Notes on software, systems, and side quests

HOME
CATEGORIES
TAGS
ARCHIVES
ABOUT

Home Tags llama-swap

Tag

llama-swap 2

Taming Qwen Overthinking with GBNF Grammars Apr 28, 2026
RTX 3090 Power Limit: Finding the Sweet Spot for Local LLM Inference Apr 28, 2026

Recently Updated

My Internet Drops Weren't the ISP: One Night of OPNsense Forensics
rocket-cli: a Rocket.Chat MCP server with a local FTS5 brain
Design Fluency Meets the Knowledge Graph
How a Cryptominer Spent Two Days on My Server — and How I Found It
When your VLM test flake is actually a VNC capture race

Trending Tags

local-llm llama.cpp llm sqlite automation CUDA llama-cpp qwen Qwen speculative-decoding

© 2026 Jean Brito. Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

local-llm llama.cpp llm sqlite automation CUDA llama-cpp qwen Qwen speculative-decoding

New content available