Why I built llm-queue: one local LLM (Ollama) behind an OpenAI-compatible HTTP API, so every process shares one serialized priority queue instead of thrashing the model.
Tag
Ollama
1 post tagged “Ollama”.
Tag
1 post tagged “Ollama”.
Why I built llm-queue: one local LLM (Ollama) behind an OpenAI-compatible HTTP API, so every process shares one serialized priority queue instead of thrashing the model.