Work-type → model routing
Header/tag-based routing to logical model routes (coding-default, bulk) with fallback — native Gateway API config, no plugin.
One self-hosted gateway for all your LLM traffic
Route work-type to the right model, cap tokens and spend per team, enforce model allow-lists and guardrails, and see every token — built on Higress, fully Infrastructure-as-Code, on-premise, no cloud lock-in.