Available on winget

Install llamafile

Distribute and run LLMs with a single file.

Install with winget

winget install --id Mozilla.llamafile

Upgrade

winget upgrade --id Mozilla.llamafile

Uninstall

winget uninstall --id Mozilla.llamafile

About llamafile

llamafile lets you distribute and run LLMs with a single file. Our goal is to make open LLMs much more accessible to both developers and end users. We're doing that by combining llama.cpp with Cosmopolitan Libc into one framework that collapses all the complexity of LLMs down to a single-file executable (called a "llamafile") that runs locally on most computers, with no installation.

What's new in 0.10.3

What's Changed This release is mainly a bugfix to a regression introduced in 0.10.2 (many many thanks to @niutech for opening the issue! 🙏) As releases become more frequent, we are working to better automate the process and making this more open. This PR brings a new llamafile/release.sh script that's the one used to create all the new release files (we'll be testing it until the next release before updating the RELEASE.md doc). - Update release scripts by @aittalam in https://github.com/mozilla-ai/llamafile/pull/990 - Fix uncaught SIGSEGV when GPU init fails, restore CPU fallback (#988) by @aittalam in https://github.com/mozilla-ai/llamafile/pull/989 Full Changelog: https://github.com/mozilla-ai/llamafile/compare/0.10.2...0.10.3

Read release notes

Version history

Version	Updated	Notes
0.10.3	Unknown	What's Changed This release is mainly a bugfix to a regression introduced in 0.10.2 (many many thanks to @niutech for opening the issue! 🙏) As releases become more frequent, we are working to better automate the process...
0.10.2	Unknown	What's Changed - Add gpu accel probe by @aittalam in https://github.com/mozilla-ai/llamafile/pull/953 - Added executable dir to list of dylib valid paths by @aittalam in https://github.com/mozilla-ai/llamafile/pull/959 -...
0.10.1	Unknown	What's Changed Summary: - Added support for vulkan dylibs - Added windows build scripts → we now have cuda, rocm, vulkan both as .so and as .dll libraries - Updated llama.cpp submodule to 5e9c63546 → we now have llama.cp...
0.10.0	Unknown	llamafile versions starting from 0.10.0 use a new build system, aimed at keeping our code more easily aligned with the latest versions of llama.cpp. This means they support more recent models and functionalities, but at...
0.9.3	Unknown	Release notes
0.9.2	Unknown	Release notes
0.9.1	Unknown	Release notes
0.9.0	Unknown	Release notes
0.8.17	Unknown	Release notes
0.8.16	Unknown	Release notes
0.8.15	Unknown	Release notes
0.8.14	Unknown	This release introduces our new CLI chatbot interface. It supports multi-line input using triple quotes. It will syntax highlight Python, C, C++, Java, and JavaScript code. This chatbot is now the default mode of operati...