Here’s a short list of Ollama models that track closest to gpt-oss:20b
in capability/feel, plus how to prompt them so you don’t lose quality when you swap away from OpenAI-tuned prompts.
-
Mistral NeMo 12B Instruct — very solid instruction following, large context (128k), good tool/JSON behavior for its size.
ollama pull mistral-nemo:12b
(ormistral-nemo:12b-instruct
where available) ([Ollama][1]) -
Llama 3.1 Instruct (8B or 70B) — stable, widely used baseline; 70B will beat 20B-class models on reasoning, 8B is a fast local workhorse.
ollama pull llama3.1:8b-instruct
(orllama3.1:70b-instruct
if you have VRAM) ([Ollama][2])