test prompts against local Ollama models · no cloud · streaming · all on-device
notes
- All inference runs on your machine. No tokens, prompts, or responses leave your device.
- Ollama must be started with
OLLAMA_ORIGINS=*to allow browser access. - Works with any model you've pulled: llama3, mistral, gemma, phi3, etc.