← Package directory
Available on winget

Install Lemonade Server

Refreshingly fast LLMs on GPUs and NPUs

Install with winget
winget install --id AMD.LemonadeServer
Upgrade
winget upgrade --id AMD.LemonadeServer
Uninstall
winget uninstall --id AMD.LemonadeServer

About Lemonade Server

🍋 Lemonade Server is a server interface that uses the standard Open AI API, allowing applications to integrate with local LLMs. This means that you can easily replace cloud-based LLMs with private and free LLMs that run locally on your own PC's NPU and GPU.

What's new in 10.5.1

Lemonade Server Operating System Downloads Windows lemonade.msi Ubuntu 24.04+ Launchpad PPA Fedora 43 lemonade-server-10.5.1-fc43.x86_64.rpm Fedora 44 lemonade-server-10.5.1-fc44.x86_64.rpm macOS Lemonade-10.5.1-Darwin.pkg Other platforms? See our Installation Options for Docker, Snap, Arch, Debian, and more. Embeddable Lemonade Portable binaries for bundling into your own installer. Run lemond ./ as a subprocess. Platform Download Ubuntu x64 lemonade-embeddable-10.5.1-ubuntu-x64.tar.gz Windows x64 lemonade-embeddable-10.5.1-windows-x64.zip macOS arm64 lemonade-embeddable-10.5.1-macos-arm64.tar.gz What's Changed Thanks @bitgamma, @dmitrii-galantsev, @fl0rianr, @github-actions, @jeremyfowers, @kpoineal, @lucifer-vali, @seii, @sofiageo, @superm1 for your awesome contributions to this release! Click to expand changelog - feat(app): add full custom sd.cpp model options by @fl0rianr in https://github.com/lemonade-sdk/lemonade/pull/1909 - fix: update bug report template with current CLI commands by @kpoineal in https://github.com/lemonade-sdk/lemonade/pull/1914 - keep quotes during arg merging by @bitgamma in https://github.com/lemonade-sdk/lemonade/pull/1920 - Separate releases for Fedora 43 and 44 by @lucifer-vali in https://github.com/lemonade-sdk/lemonade/pull/1917 - Update Fedora maintainer in contribution guide by @lucifer-vali in https://github.com/lemonade-sdk/lemonade/pull/1928 - fix: add missing rdna1, rdna2, and gfx1150 targets by @sofiageo in https://github.com/lemonade-sdk/lemonade/pull/1923 - Introduce per-backend args by @bitgamma in https://github.com/lemonade-sdk...

Read release notes

Version history

Version Updated Notes
10.5.1 Unknown Lemonade Server Operating System Downloads Windows lemonade.msi Ubuntu 24.04+ Launchpad PPA Fedora 43 lemonade-server-10.5.1-fc43.x86_64.rpm Fedora 44 lemonade-server-10.5.1-fc44.x86_64.rpm macOS Lemonade-10.5.1-Darwin.p...
10.5.0 Unknown Headline - Upgraded to ROCm 7.13 for llama.cpp and stable-diffusion.cpp! - macOS has graduated from beta with all major features fully supported. - Overhauled management of custom/imported models and recipes. Breaking ch...
10.4.0 Unknown Lemonade Server Operating System Downloads Windows lemonade.msi (Server + App) Ubuntu 24.04+ Launchpad PPA (Server + App) Fedora 43 lemonade-server-10.4.0.x86_64.rpm (Server + App) macOS (beta) Lemonade-10.4.0-Darwin.pkg...
10.3.0 Unknown Headline - OmniRouter unifies the best backend engines to give you true omni-modal LLM chat. Make and edit images, generate speech, and more with natural language. Open AI-compliant. - The desktop app is now based on Tau...
10.2.0 Unknown Lemonade Server Operating System Downloads Windows lemonade.msi (Server + App) · lemonade-server-minimal.msi (Server Only) Ubuntu 24.04+ Launchpad PPA (Server) · lemonade-app-10.2.0-x86_64.AppImage (Companion Desktop App...
10.1.0 Unknown Headline - The new lemonade CLI is a much nicer way to interact with the service. - lemonade config replaces most uses of the conf file on Linux. - Coding is supercharged with improved lemonade launch codex|claude. - Def...
10.0.1 Unknown Headline - Debian packages are now available via PPA instead of .deb - Streamlined experience for searching and adding GGUFs from Hugging Face in the app - qwen3.5-4b now available on NPU via FastFlowLM v0.9.36 - llama.c...
10.0.0 Unknown Headline - Linux NPU support is available for LLMs and Whisper via FastFlowLM - Native integration with Claude Code: lemonade-server launch claude - A Fedora .rpm installer is now published in the release Quick Install O...
9.4.1 Unknown Headline - Support for the Qwen3.5 family of models on ROCm and Vulkan! - Redesigned the app for easier navigation with a full Backend Manager available in the app, CLI, and endpoints. - Image generation support is great...
9.4.0 Unknown Headline - Support for the Qwen3.5 family and LFM2-24B-A2B on llama.cpp + ROCm - The app's layout has been redesigned, with a new Backend Manager - Image editing endpoint added Breaking Change .deb installation has chang...
9.3.4 Unknown Headline - macOS (beta) installers are now a part of the release! - Text-to-speech is now available in the Lemonade App with the Kokoro recipe. - XDNA 2 NPU detection has been overhauled for better compatibility. Quick I...
9.3.3 Unknown Headline - FastFlowLM upgraded to 0.9.33 with Qwen2.5-VL-3B-Instruct support. - Streaming transcription enabled in the app and with /realtime endpoint. - Added Ollama API support for better native integration with local...
9.3.2 Unknown Headline - Image generation with ROCm is now support on Windows and Linux - --max-loaded-models N has been simplified and now enables N models of each type - Lots of under-the-hood improvements to polish the changes from...
9.3.1 Unknown Headline - Windows MSI installers are now code signed. Thanks, SignPath Foundation! - Ryzen AI SW 1.7 is integrated and fully replaces the prior Ryzen AI SW support. - Lemonade Marketplace is now included in the app. Bre...
9.3.0 Unknown Headline - NPU enabled for Whisper models on the /audio/transcription endpoint (Windows only). - Kokoro TTS is available on the /audio/speech endpoint. - Desktop and web apps have backend selection, providing a ROCm opti...
9.2.0 Unknown Headline - Introduces the images/generations endpoint with stablediffusion.cpp as the first image generation engine. - Compatibility improvements: Debian now supported; Debian, Arch, and Fedora builds tested in CI. - ryz...
9.1.4 Unknown Headline - Support GLM-4.7-Flash-GGUF on ROCm and Vulkan - Support installing for All Users on Windows and Linux - Allow an absolute path to go into the CLI pull command - Rev to FastFlowLM 0.9.27 and support new LFM2.5...
9.1.3 Unknown Headline - Lemonade App can be used remotely over the network - Lemonade CLI and /load endpoint can save custom load options for each model - LFM2.5 model enabled via FastFlowLM v0.9.25 - Initial support for publishing D...
9.1.2 Unknown Headline - Create, save, and share custom model recipes: https://lemonade-server.ai/docs/server/server_spec/#per-model-options - Full NPU and ROCm support for the ROG Ally X handheld and RAI Z2 Extreme chip - Full compat...
9.1.1 Unknown Headline - Add support for transcriptions, reranking, and embeddings in the App - ROCm support for Strix Point (gfx1150) - Solve a bunch of community-reported feedback What's Changed - Revamp README file by @danielholand...