Skip to content

v3.4.1

Latest
Compare
Choose a tag to compare
@oobabooga oobabooga released this 31 May 02:12
ae61c1a

Changes

  • Add attachments support (text files, PDF documents) (#7005).
    • This is not RAG. The attachment gets fully added to the prompt!
  • Add a web search feature (#7023). The search query is generated by the LLM based on your input, and the search is performed using DuckDuckGo.
  • Add date/time to chat messages (#7003)
  • Add message version navigation (#6947). Thanks @Th-Underscore.
    • This is equivalent to the "swipes" in SillyTavern. Press left/right to navigate versions, press right while at the latest reply version to generate a new version.
  • Add footer buttons for editing messages (#7019). Thanks @Th-Underscore.
  • Add a "Branch here" footer button to chat messages (#6967). Thanks @Madrawn
  • Add a token counter to the chat tab (counts input + history, including attachments)
  • Make the dark theme darker
  • Improve the light theme
  • Improve the style of thinking blocks
  • Add back max_updates_second to resolve a UI performance issue when streaming very fast (~200 tokens/second)

Bug fixes

  • Close response generator when stopping API generation (#7014). Thanks @djholtby
  • Fix the chat area height when "Show controls" is unchecked
  • Remove unnecessary js that was causing scrolling issues during streaming
  • Fix loading Llama-3_3-Nemotron-Super-49B-v1 and similar models
  • Fix Dockerfile for AMD and Intel (#6995). Thanks @TheGameratorT
  • Fix 'Start reply with' (new in v3.4.1)
  • Fix exllamav3_hf models failing to unload (new in v3.4.1)

Backend updates


Portable builds

Below you can find portable builds: self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip, and run.

Choosing the right build:

  • Windows/Linux:

    • NVIDIA GPU: Use cuda12.4 for newer GPUs or cuda11.7 for older GPUs and systems with older drivers.
    • AMD/Intel GPU: Use vulkan builds.
    • CPU only: Use cpu builds.
  • Mac:

    • Apple Silicon: Use macos-arm64.
    • Intel CPU: Use macos-x86_64.

Updating a portable install:

  1. Download and unzip the latest version.
  2. Replace the user_data folder with the one in your existing install. All your settings and models will be moved.