
Changes
- Add attachments support (text files, PDF documents) (#7005).
- This is not RAG. The attachment gets fully added to the prompt!
- Add a web search feature (#7023). The search query is generated by the LLM based on your input, and the search is performed using DuckDuckGo.
- Add date/time to chat messages (#7003)
- Add message version navigation (#6947). Thanks @Th-Underscore.
- This is equivalent to the "swipes" in SillyTavern. Press left/right to navigate versions, press right while at the latest reply version to generate a new version.
- Add footer buttons for editing messages (#7019). Thanks @Th-Underscore.
- Add a "Branch here" footer button to chat messages (#6967). Thanks @Madrawn
- Add a token counter to the chat tab (counts input + history, including attachments)
- Make the dark theme darker
- Improve the light theme
- Improve the style of thinking blocks
- Add back
max_updates_second
to resolve a UI performance issue when streaming very fast (~200 tokens/second)
Bug fixes
- Close response generator when stopping API generation (#7014). Thanks @djholtby
- Fix the chat area height when "Show controls" is unchecked
- Remove unnecessary js that was causing scrolling issues during streaming
- Fix loading
Llama-3_3-Nemotron-Super-49B-v1
and similar models - Fix Dockerfile for AMD and Intel (#6995). Thanks @TheGameratorT
- Fix 'Start reply with' (new in v3.4.1)
- Fix exllamav3_hf models failing to unload (new in v3.4.1)
Backend updates
- Bump llama.cpp to ggml-org/llama.cpp@b7a1746
Portable builds
Below you can find portable builds: self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip, and run.
Choosing the right build:
-
Windows/Linux:
- NVIDIA GPU: Use
cuda12.4
for newer GPUs orcuda11.7
for older GPUs and systems with older drivers. - AMD/Intel GPU: Use
vulkan
builds. - CPU only: Use
cpu
builds.
- NVIDIA GPU: Use
-
Mac:
- Apple Silicon: Use
macos-arm64
. - Intel CPU: Use
macos-x86_64
.
- Apple Silicon: Use
Updating a portable install:
- Download and unzip the latest version.
- Replace the
user_data
folder with the one in your existing install. All your settings and models will be moved.