← Package directory
Available on winget

Install Lemonade Server

Refreshingly fast LLMs on GPUs and NPUs

Install with winget
winget install --id AMD.LemonadeServer
Upgrade
winget upgrade --id AMD.LemonadeServer
Uninstall
winget uninstall --id AMD.LemonadeServer

About Lemonade Server

🍋 Lemonade Server is a server interface that uses the standard Open AI API, allowing applications to integrate with local LLMs. This means that you can easily replace cloud-based LLMs with private and free LLMs that run locally on your own PC's NPU and GPU.

What's new in 10.9.0

Headline - ARM64 (aarch64) Linux packages now ship for Debian, Fedora, the portable embeddable archive, and Docker. - Users can set custom system prompts for Lemonade omni models, as demonstrated in the new RPG-HaloTales-V1 narrative experience model. - A new OTLP telemetry subsystem exports traces using OpenInference and OpenTelemetry GenAI conventions, driven by a lemonade telemetry command. - The vLLM backend now works with Claude Code through the Anthropic Messages API, and adds Qwen3.6 and GLM-4.7-Flash models. Breaking Changes - The Linux system tray is now disabled by default (REQUIRE_LINUX_TRAY=OFF), so Linux server packages ship headless; rebuild with the tray option enabled to restore it. Lemonade Server Operating System Downloads Windows lemonade.msi Ubuntu 24.04+ Launchpad PPA Debian 13 (x86_64) lemonade-server_10.9.0-debian13_amd64.deb Debian 13 (ARM64) lemonade-server_10.9.0-debian13_arm64.deb Fedora 43 (x86_64) lemonade-server-10.9.0-fc43.x86_64.rpm Fedora 43 (ARM64) lemonade-server-10.9.0-fc43.aarch64.rpm Fedora 44 (x86_64) lemonade-server-10.9.0-fc44.x86_64.rpm Fedora 44 (ARM64) lemonade-server-10.9.0-fc44.aarch64.rpm macOS Lemonade-10.9.0-Darwin.pkg Other platforms? See our Installation Options for Docker, Snap, Arch, Debian, and more. Embeddable Lemonade Portable binaries for bundling into your own installer. Run lemond ./ as a subprocess. Platform Download Ubuntu x64 lemonade-embeddable-10.9.0-ubuntu-x64.tar.gz Ubuntu arm64 lemonade-embeddable-10.9.0-ubuntu-arm64.tar.gz Windows x64 lemonade-embeddable-10.9.0-windows-x64.zip macOS arm64 lemonade-embeddabl...

Read release notes

Version history

Version Updated Notes
10.9.0 Unknown Headline - ARM64 (aarch64) Linux packages now ship for Debian, Fedora, the portable embeddable archive, and Docker. - Users can set custom system prompts for Lemonade omni models, as demonstrated in the new RPG-HaloTales...
10.8.1 Unknown Headline - Speculative decoding now accepts draft/MTP/EAGLE3 checkpoints, with new Gemma-4 MTP models ready to pull. - ROCm installation and GPU detection are restored for Radeon RX RDNA2/3/4 dGPUs on Windows and Linux....
10.8.0 Unknown Release notes
10.7.0 Unknown Release notes
10.6.0 Unknown Headline - Lemonade's true omni-modal features have gotten a glow-up! - LMX-Omni-52B-Halo and LMX-Omni-5.5B-Lite replace the old "collections". - Add your own custom omni models in the GUI or CLI. - Downloading custom mo...
10.5.1 Unknown Lemonade Server Operating System Downloads Windows lemonade.msi Ubuntu 24.04+ Launchpad PPA Fedora 43 lemonade-server-10.5.1-fc43.x86_64.rpm Fedora 44 lemonade-server-10.5.1-fc44.x86_64.rpm macOS Lemonade-10.5.1-Darwin.p...
10.5.0 Unknown Headline - Upgraded to ROCm 7.13 for llama.cpp and stable-diffusion.cpp! - macOS has graduated from beta with all major features fully supported. - Overhauled management of custom/imported models and recipes. Breaking ch...
10.4.0 Unknown Lemonade Server Operating System Downloads Windows lemonade.msi (Server + App) Ubuntu 24.04+ Launchpad PPA (Server + App) Fedora 43 lemonade-server-10.4.0.x86_64.rpm (Server + App) macOS (beta) Lemonade-10.4.0-Darwin.pkg...
10.3.0 Unknown Headline - OmniRouter unifies the best backend engines to give you true omni-modal LLM chat. Make and edit images, generate speech, and more with natural language. Open AI-compliant. - The desktop app is now based on Tau...
10.2.0 Unknown Lemonade Server Operating System Downloads Windows lemonade.msi (Server + App) · lemonade-server-minimal.msi (Server Only) Ubuntu 24.04+ Launchpad PPA (Server) · lemonade-app-10.2.0-x86_64.AppImage (Companion Desktop App...
10.1.0 Unknown Headline - The new lemonade CLI is a much nicer way to interact with the service. - lemonade config replaces most uses of the conf file on Linux. - Coding is supercharged with improved lemonade launch codex|claude. - Def...
10.0.1 Unknown Headline - Debian packages are now available via PPA instead of .deb - Streamlined experience for searching and adding GGUFs from Hugging Face in the app - qwen3.5-4b now available on NPU via FastFlowLM v0.9.36 - llama.c...
10.0.0 Unknown Headline - Linux NPU support is available for LLMs and Whisper via FastFlowLM - Native integration with Claude Code: lemonade-server launch claude - A Fedora .rpm installer is now published in the release Quick Install O...
9.4.1 Unknown Headline - Support for the Qwen3.5 family of models on ROCm and Vulkan! - Redesigned the app for easier navigation with a full Backend Manager available in the app, CLI, and endpoints. - Image generation support is great...
9.4.0 Unknown Headline - Support for the Qwen3.5 family and LFM2-24B-A2B on llama.cpp + ROCm - The app's layout has been redesigned, with a new Backend Manager - Image editing endpoint added Breaking Change .deb installation has chang...
9.3.4 Unknown Headline - macOS (beta) installers are now a part of the release! - Text-to-speech is now available in the Lemonade App with the Kokoro recipe. - XDNA 2 NPU detection has been overhauled for better compatibility. Quick I...
9.3.3 Unknown Headline - FastFlowLM upgraded to 0.9.33 with Qwen2.5-VL-3B-Instruct support. - Streaming transcription enabled in the app and with /realtime endpoint. - Added Ollama API support for better native integration with local...
9.3.2 Unknown Headline - Image generation with ROCm is now support on Windows and Linux - --max-loaded-models N has been simplified and now enables N models of each type - Lots of under-the-hood improvements to polish the changes from...
9.3.1 Unknown Headline - Windows MSI installers are now code signed. Thanks, SignPath Foundation! - Ryzen AI SW 1.7 is integrated and fully replaces the prior Ryzen AI SW support. - Lemonade Marketplace is now included in the app. Bre...
9.3.0 Unknown Headline - NPU enabled for Whisper models on the /audio/transcription endpoint (Windows only). - Kokoro TTS is available on the /audio/speech endpoint. - Desktop and web apps have backend selection, providing a ROCm opti...