Ultra-fast LLM inference router that picks the cheapest fast provider per request — without locking into Groq's quotas and waitlist. Pay per token at provider cost, sub-second Llama and Mixtral, 99% cheaper than retail Groq Cloud.
This module is your starting point. Describe what you want to layer on top — an interface, extra fields, a workflow, a whole app. Watch it build in real time. ⌘/Ctrl + Enter to run.
Your module's ready — tell us what you need
Use it, host it, give it a home, or keep building. You pick.