Status: [UNTESTED] Endpoints found: http://127.0.0.1:4000/v1/chat/completions Source: https://github.com/CaptainMihawk/quantum-free-router # quantum-free-router [![License: MIT](https://img.shields.io/badge/license-MIT-green.svg)](LICENSE) [![Bifrost](https://img.shields.io/badge/Bifrost-v1.5.11-blue.svg)](https://github.com/Portkey-AI/bifrost) [![CI](https://github.com/spacepirate15/quantum-free-router/actions/workflows/ci.yml/badge.svg)](https://github.com/spacepirate15/quantum-free-router/actions/workflows/ci.yml) [![Free Tier](https://img.shields.io/badge/cost-free--tier-brightgreen.svg)](#supported-providers) Professional free-tier LLM router for long-running coding and research agents. `quantum-free-router` packages a Bifrost-based OpenAI-compatible router with a curated free-model catalog, install scripts, health checks, and certification tools. It is designed for agent workflows where one free model going down, timing out, or hitting quota should not stop the whole task. ```text OpenAI-compatible client | v http://127.0.0.1:4000/v1 | v Bifrost provider router | +-- OpenCode Zen +-- KiloCode +-- NVIDIA NIM +-- Gemini +-- Mistral / Cerebras / Groq / SambaNova / Cohere ``` ## What Problem This Solves Free LLM endpoints are useful, but they are operationally unstable when used one at a time: - quotas reset at different times - rate limits can appear mid-task - provider catalogs change without warning - some model IDs work directly but not through a router - long-context or large models can timeout under load - agent replies can fail if a single primary model is unavailable This repository gives you a practical free-tier routing layer with: - one local OpenAI-compatible base URL - a documented provider/model catalog - bounded timeout and retry settings - model certification scripts - systemd service setup - clear docs for agent integrations It does not hide provider limits. It helps you route around them cleanly. ## Quick Start One-command install on Linux or WSL2: ```bash curl -fsSL https://raw.githubuserco