The pipeline
Green nodes are human, coral are AI, ink are automation. The role is the node colour; the chain is the timeline — no zigzag because the diagram never asks the arrows to leave a column.The model-routing philosophy
Every model has a sweet spot:- Claude — best at multi-file refactors, deep reasoning, agentic loops. The default for non-trivial implementation work.
- Gemini — great for second opinions, code review, broad context understanding.
- GitHub Copilot — fastest for line-level completions inside the editor. Cheap for high-volume routine work.
- Local MLX — Apple Silicon native inference for typo fixes, quick edits, and “I don’t want to burn cloud tokens on this” tasks. The serving stack, tuning, and model strategy live under Local LLM.
~/CLAUDE.md and AGENTS.md say which model to use when.
Local AI gateway (Bifrost)
Bifrost is the OpenAI-compatible HTTP gateway that sits between every AI tool on the workstation and whichever provider eventually answers the call. It exposeshttp://localhost:30080/v1/chat/completions and fans out to OpenAI, Gemini, OpenRouter, and the local MLX server based on the task class.
Tools resolve task classes (Research, Coding, Review, Pre-commit) to a current model at call time — never hardcode model identifiers in committed config. When localOnlyMode is enabled, every request routes exclusively to the local MLX inference server on port 11434.
See Bifrost for routing conventions, local-only mode details, and provider capabilities.
Priority order
- Anthropic official — Claude Code plugins, skills, patterns
- Bifrost AI gateway — multi-provider routing at
localhost:30080 - Personal or custom — only when no alternative exists
Repos that power this pipeline
ai-assistant-instructions
Universal AI configuration layer — rules, permissions, workflows, agents.
claude-code-plugins
Commands, skills, hooks, agents for Claude Code.
nix-ai
Nix package and config layer for every AI coding tool.
claude-code-routines
Scheduled remote-agent routines on Claude.ai.
ai-workflows
Reusable GitHub Copilot agentic workflows.
raycast-smart-issue
Raycast extension for AI-drafted GitHub issues via local MLX.