Skip to content

Conversation

BrewTestBot
Copy link
Member

@BrewTestBot BrewTestBot commented Dec 4, 2024

Created by brew bump


Created with brew bump-formula-pr.

  • resource blocks have been checked for updates.
release notes
# LocalAI release v2.24.0!

b642257566578

🚀 Highlights

  • Backend deprecation: We’ve removed rwkv.cpp and bert.cpp, replacing them with enhanced functionalities in llama.cpp for simpler installation and better performance.
  • New Backends Added: Introducing bark.cpp for text-to-audio and stablediffusion.cpp for image generation, both powered by the ggml framework.
  • Voice Activity Detection (VAD): Added support for silero-vad to detect speech in audio streams.
  • WebUI Improvements: Now supports API key authentication for enhanced security.
  • Real-Time Token Usage: Monitor token consumption during streamed outputs.
  • Expanded P2P Settings: Greater flexibility with new configuration options like listen_maddrs, dht_announce_maddrs, and bootstrap_peers.

📤 Backends Deprecation

As part of our cleanup efforts, the rwkv.cpp and bert.cpp backends have been deprecated. Their functionalities are now integrated into llama.cpp, offering a more streamlined and efficient experience.

:new: New Backends Introduced

  • bark.cpp Backend: Transform text into realistic audio using Bark, a transformer-based text-to-audio model. Install it easily with:

    local-ai models install bark-cpp-small

    Or start it directly:

    local-ai run bark-cpp-small
  • stablediffusion.cpp Backend: Create high-quality images from textual descriptions using the Stable Diffusion backend, now leveraging the ggml framework.

  • Voice Activity Detection with silero-vad: Introducing support for accurate speech segment detection in audio streams. Install via:

    local-ai models install silero-vad

Or configure it through the WebUI.

:lock: WebUI Access with API Keys

The WebUI now supports API key authentication. If one or more API Keys are configured, the WebUI will automatically display a page to authenticate with.

🏆 Enhancements and Features

  • Real-Time Token Usage: Monitor token consumption dynamically during streamed outputs. This feature helps optimize performance and manage costs effectively.
  • P2P Configuration: New settings for advanced peer-to-peer mode:
    • listen_maddrs: Define specific multiaddresses for your node.
    • dht_announce_maddrs: Specify addresses to announce to the DHT network.
    • bootstrap_peers: Set custom bootstrap peers for initial connectivity.
      These options offer more control, especially in constrained networks or custom P2P environments.

:framed_picture: New Models in the Gallery

We've significantly expanded our model gallery with a variety of new models to cater to diverse AI applications. Among these:

  • Calme-3 Qwen2.5 Series: Enhanced language models offering improved understanding and generation capabilities.
  • Mistral-Nemo-Prism-12b: A powerful model designed for complex language tasks.
  • Llama 3.1 and 3.2 Series: Upgraded versions of the Llama models with better performance and accuracy.
  • Qwen2.5-Coder Series: Specialized models optimized for code generation and programming language understanding.
  • Rombos-Coder Series: Advanced coder models for sophisticated code-related tasks.
  • Silero-VAD: High-quality voice activity detection model for audio processing applications.
  • Bark-Cpp-Small: Lightweight audio generation model suitable for quick and efficient audio synthesis.

Explore these models and more in our updated model gallery to find the perfect fit for your project needs.

🐞 Bug Fixes and Improvements

  • Performance Enhancements: Resolved issues with AVX flags and optimized binaries for accelerated performance, especially on macOS systems.
  • Dependency Updates: Upgraded various dependencies to ensure compatibility, security, and performance improvements across the board.
  • Parsing Corrections: Fixed parsing issues related to maddr and ExtraLLamaCPPArgs in P2P configurations.

📚 Documentation and Examples

  • Updated Guides: Refreshed documentation with new configuration examples, making it easier to get started and integrate the latest features.

📥 How to Upgrade

To upgrade to LocalAI v2.24.0:

  • Download the Latest Release: Get the binaries from our GitHub Releases page.
  • Update Docker Image: Pull the latest Docker image using:
docker pull localai/localai:latest

See also the Documentation at: https://localai.io/basics/container/#standard-container-images

Happy hacking!

What's Changed

Breaking Changes 🛠

Bug fixes :bug:

Exciting New Features 🎉

🧠 Models

📖 Documentation and examples

👒 Dependencies

Other Changes

New Contributors

Full Changelog: mudler/LocalAI@v2.23.0...v2.24.0

@github-actions github-actions bot added go Go use is a significant feature of the PR or issue python Python use is a significant feature of the PR or issue bump-formula-pr PR was created using `brew bump-formula-pr` labels Dec 4, 2024
localai: grpcio-tools 1.68.1

Signed-off-by: Rui Chen <rui@chenrui.dev>
Copy link
Contributor

github-actions bot commented Dec 5, 2024

@github-actions github-actions bot added the CI-published-bottle-commits The commits for the built bottles have been pushed to the PR branch. label Dec 5, 2024
@BrewTestBot BrewTestBot enabled auto-merge December 5, 2024 01:52
@BrewTestBot BrewTestBot added this pull request to the merge queue Dec 5, 2024
Merged via the queue into master with commit b507346 Dec 5, 2024
15 checks passed
@BrewTestBot BrewTestBot deleted the bump-localai-2.24.0 branch December 5, 2024 01:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bump-formula-pr PR was created using `brew bump-formula-pr` CI-published-bottle-commits The commits for the built bottles have been pushed to the PR branch. go Go use is a significant feature of the PR or issue python Python use is a significant feature of the PR or issue
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants