About Ollama

Get up and running with large language models locally.

What's new in 0.31.1

What's Changed - mlx: tighten up gemma4 moe loading code by @pdevine in https://github.com/ollama/ollama/pull/16964 - mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen - llama.cpp: bump to b9840 @dhiltgen - improved gemma4 MTP performance @jessegross Full Changelog: https://github.com/ollama/ollama/compare/v0.31.0...v0.31.1

Read release notes

Version history

Version	Updated	Notes
0.31.1	Unknown	What's Changed - mlx: tighten up gemma4 moe loading code by @pdevine in https://github.com/ollama/ollama/pull/16964 - mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen - llama.cpp...
0.30.11	Unknown	What's Changed - launch: add thinking capability detection to opencode by @hoyyeva in https://github.com/ollama/ollama/pull/15434 - launch: auto-install Claude Code by @hoyyeva in https://github.com/ollama/ollama/pull/16...
0.30.9	Unknown	What's Changed - Support for Cohere2Moe architecture - Fixed LFM2 parser/render for cases where thinking was not emitted - Fixed issue where ollama launch claude and other coding agent or assistant use cases would only o...
0.30.8	Unknown	What's Changed - Fixed ollama launch selecting the wrong provider in some cases - Improved prompt caching by decoupling it from context shift for better KV cache reuse - More stable MLX inference with hardened linear and...
0.30.7	Unknown	Ollama Launch now supports Hermes Desktop, a native desktop interface for the Hermes agent. Run it alongside your Hermes agent to get a visual interface for managing conversations, integrations, and messaging apps. ollam...
0.30.6	Unknown	What's Changed - ollama launch omp now integrates with Oh My Pi, an AI coding agent with IDE integration - MLX embedding layers now use NVFP4 global scale for improved quantization on Apple Silicon Full Changelog: https:...
0.30.4	Unknown	What's Changed - llama.cpp version update by @dhiltgen in https://github.com/ollama/ollama/pull/16463 - Kill llama-server during Windows cleanup by @dhiltgen in https://github.com/ollama/ollama/pull/16458 Known Issues -...
0.30.3	Unknown	What's Changed - models: add support for gemma4-12b by @pdevine in https://github.com/ollama/ollama/pull/16457 Full Changelog: https://github.com/ollama/ollama/compare/v0.30.2...v0.30.3
0.30.2	Unknown	What's Changed - feat(launch): show and auto-install Cline CLI by @hoyyeva in https://github.com/ollama/ollama/pull/16402 - log template details to aid troubleshooting by @dhiltgen in https://github.com/ollama/ollama/pul...
0.30.0	Unknown	Ollama 0.30 is now available, with improved compatibility and performance using llama.cpp. This augments the MLX engine on Apple Silicon, bringing support to a wider range of hardware. This release brings support for a w...
0.24.0	Unknown	Codex App The OpenAI Codex App is now available on Ollama. Use any Ollama model — local or cloud — inside the desktop app to code, browse, and review. ollama launch codex-app Built-in browser Codex can spin up local serv...
0.23.4	Unknown	What's Changed - ollama launch opencode now supports vision models with image inputs - Fixed formatting of Claude tool results when using local image paths Full Changelog: https://github.com/ollama/ollama/compare/v0.23.3...
0.23.3	Unknown	What's Changed - mlx: refined model push behavior by @dhiltgen in https://github.com/ollama/ollama/pull/15431 - test: integration test hardening by @dhiltgen in https://github.com/ollama/ollama/pull/13532 - app: harden u...
0.23.2	Unknown	What's Changed - ollama launch no longer includes Claude Desktop due to the third-party integration being limited to Anthropic models. - Use ollama launch claude-desktop --restore to restore Claude Desktop to its normal...
0.23.1	Unknown	Gemma 4 MTP (Multi-token Processing) for the MLX runner Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. ollama run gemma4:31b-c...
0.23.0	Unknown	Claude Desktop Claude Desktop is now supported with Ollama Launch. Claude Cowork and Claude Code are supported within the Claude Desktop App. ollama launch claude-desktop Claude Cowork Claude Code Claude Code on the term...
0.22.1	Unknown	What's Changed - Updated the Gemma 4 renderer for thinking and tool calling improvements - Model recommendations are now updated without updating Ollama - Aligned the desktop app's launch page with ollama launch integrat...
0.22.0	Unknown	New models - NVIDIA's Nemotron 3 Omni - Poolside's first open-weight coding model - Laguna XS.2 Full Changelog: https://github.com/ollama/ollama/compare/v0.21.2...v0.22.0
0.21.2	Unknown	What's Changed - Improved reliability of the OpenClaw onboarding flow in ollama launch - Recommended models in ollama launch now appear in a fixed, canonical order - OpenClaw integration now bundles Ollama's web search p...
0.21.1	Unknown	What's Changed Kimi CLI You can now install and run the Kimi CLI through Ollama. ollama launch kimi --model kimi-k2.6:cloud Kimi CLI with Kimi K2.6 excels at long horizon agentic execution tasks through a multi-agent sys...

Install Ollama

About Ollama

What's new in 0.31.1

Version history