winget install --id Ollama.Ollama
About Ollama
Get up and running with large language models locally.
What's new in 0.31.1
What's Changed - mlx: tighten up gemma4 moe loading code by @pdevine in https://github.com/ollama/ollama/pull/16964 - mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen - llama.cpp: bump to b9840 @dhiltgen - improved gemma4 MTP performance @jessegross Full Changelog: https://github.com/ollama/ollama/compare/v0.31.0...v0.31.1
Version history
| Version | Updated | Notes |
|---|---|---|
| 0.31.1 | Unknown | What's Changed - mlx: tighten up gemma4 moe loading code by @pdevine in https://github.com/ollama/ollama/pull/16964 - mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen - llama.cpp... |
| 0.30.11 | Unknown | What's Changed - launch: add thinking capability detection to opencode by @hoyyeva in https://github.com/ollama/ollama/pull/15434 - launch: auto-install Claude Code by @hoyyeva in https://github.com/ollama/ollama/pull/16... |
| 0.30.9 | Unknown | What's Changed - Support for Cohere2Moe architecture - Fixed LFM2 parser/render for cases where thinking was not emitted - Fixed issue where ollama launch claude and other coding agent or assistant use cases would only o... |
| 0.30.8 | Unknown | What's Changed - Fixed ollama launch selecting the wrong provider in some cases - Improved prompt caching by decoupling it from context shift for better KV cache reuse - More stable MLX inference with hardened linear and... |
| 0.30.7 | Unknown | Ollama Launch now supports Hermes Desktop, a native desktop interface for the Hermes agent. Run it alongside your Hermes agent to get a visual interface for managing conversations, integrations, and messaging apps. ollam... |
| 0.30.6 | Unknown | What's Changed - ollama launch omp now integrates with Oh My Pi, an AI coding agent with IDE integration - MLX embedding layers now use NVFP4 global scale for improved quantization on Apple Silicon Full Changelog: https:... |
| 0.30.4 | Unknown | What's Changed - llama.cpp version update by @dhiltgen in https://github.com/ollama/ollama/pull/16463 - Kill llama-server during Windows cleanup by @dhiltgen in https://github.com/ollama/ollama/pull/16458 Known Issues -... |
| 0.30.3 | Unknown | What's Changed - models: add support for gemma4-12b by @pdevine in https://github.com/ollama/ollama/pull/16457 Full Changelog: https://github.com/ollama/ollama/compare/v0.30.2...v0.30.3 |
| 0.30.2 | Unknown | What's Changed - feat(launch): show and auto-install Cline CLI by @hoyyeva in https://github.com/ollama/ollama/pull/16402 - log template details to aid troubleshooting by @dhiltgen in https://github.com/ollama/ollama/pul... |
| 0.30.0 | Unknown | Ollama 0.30 is now available, with improved compatibility and performance using llama.cpp. This augments the MLX engine on Apple Silicon, bringing support to a wider range of hardware. This release brings support for a w... |
| 0.24.0 | Unknown | Codex App The OpenAI Codex App is now available on Ollama. Use any Ollama model — local or cloud — inside the desktop app to code, browse, and review. ollama launch codex-app Built-in browser Codex can spin up local serv... |
| 0.23.4 | Unknown | What's Changed - ollama launch opencode now supports vision models with image inputs - Fixed formatting of Claude tool results when using local image paths Full Changelog: https://github.com/ollama/ollama/compare/v0.23.3... |
| 0.23.3 | Unknown | What's Changed - mlx: refined model push behavior by @dhiltgen in https://github.com/ollama/ollama/pull/15431 - test: integration test hardening by @dhiltgen in https://github.com/ollama/ollama/pull/13532 - app: harden u... |
| 0.23.2 | Unknown | What's Changed - ollama launch no longer includes Claude Desktop due to the third-party integration being limited to Anthropic models. - Use ollama launch claude-desktop --restore to restore Claude Desktop to its normal... |
| 0.23.1 | Unknown | Gemma 4 MTP (Multi-token Processing) for the MLX runner Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. ollama run gemma4:31b-c... |
| 0.23.0 | Unknown | Claude Desktop Claude Desktop is now supported with Ollama Launch. Claude Cowork and Claude Code are supported within the Claude Desktop App. ollama launch claude-desktop Claude Cowork Claude Code Claude Code on the term... |
| 0.22.1 | Unknown | What's Changed - Updated the Gemma 4 renderer for thinking and tool calling improvements - Model recommendations are now updated without updating Ollama - Aligned the desktop app's launch page with ollama launch integrat... |
| 0.22.0 | Unknown | New models - NVIDIA's Nemotron 3 Omni - Poolside's first open-weight coding model - Laguna XS.2 Full Changelog: https://github.com/ollama/ollama/compare/v0.21.2...v0.22.0 |
| 0.21.2 | Unknown | What's Changed - Improved reliability of the OpenClaw onboarding flow in ollama launch - Recommended models in ollama launch now appear in a fixed, canonical order - OpenClaw integration now bundles Ollama's web search p... |
| 0.21.1 | Unknown | What's Changed Kimi CLI You can now install and run the Kimi CLI through Ollama. ollama launch kimi --model kimi-k2.6:cloud Kimi CLI with Kimi K2.6 excels at long horizon agentic execution tasks through a multi-agent sys... |