Nab theme, more professional navigation theme
Ctrl + D Favorites
Current Position:fig. beginning " AI Tool Library

Weebo: a real-time voice chatbot that provides a natural language conversational experience

2025-01-17 875

General Introduction

Weebo is an open source real-time voice chatbot that utilizes the Whisper Small for speech recognition, Llama 3.2 for natural language generation, and Kokoro-82M for speech synthesis. Developed by Amanvir Parhar, the project aims to provide an efficient voice dialog solution that runs on native devices.Weebo supports multiple voices and smoothly generates real-time responses for a wide range of application scenarios that require voice interaction.

 

Function List

  • Real-time speech recognition: Efficient speech-to-text processing using the Whisper Small model.
  • Natural Language Generation: Generate natural language responses with the Llama 3.2 model.
  • Speech Synthesis: Converts text to speech using the Kokoro-82M model.
  • Multi-sound support: Provides multiple sound options to enhance the user experience.
  • Runs locally: No need to rely on cloud services, all processing is done on the local device.
  • Open source code: the code is open, allowing users to freely modify and extend the functionality.

 

Using Help

Installation process

  1. Download the required model:
    • Download Kokoro-82M model file kokoro-v0_19.onnx and placed in the project folder.
    • utilization Ollama The tool pulls the Llama 3.2 model.
  2. Clone Weebo project code:
   git clone https://github.com/amanvirparhar/weebo.git
cd weebo
  1. Install the dependencies:
   pip install -r requirements.txt
  1. Run the chatbot:
   python main.py

Instructions for use

  1. After starting the program, Weebo will start listening for voice input.
  2. Users can speak naturally and Weebo will generate a voice response after a short pause.
  3. check or refer to Ctrl+C The program can be stopped.

Main function operation flow

  • speech recognition: Weebo uses the Whisper Small model for speech recognition and is able to accurately convert a user's speech into text.
  • natural language generation: Using the Llama 3.2 model, Weebo understands the user's speech input and generates a natural language response.
  • speech synthesis: Using the Kokoro-82M model, Weebo converts the generated text response into speech and plays it back over the loudspeaker.
  • Multi-Voice Support: Users can select different sound models in the configuration file to meet different application requirements.

Detailed steps

  1. Launch Weebo: Run python main.pyThe program will start listening to the user's voice input.
  2. voice input: Users can speak directly into the microphone and Weebo will automatically recognize and process the voice.
  3. Generating a Response: After recognizing the speech, Weebo generates a natural language response using the Llama 3.2 model and converts it to speech using the Kokoro-82M model.
  4. Playback Response: The generated voice response is played through the speaker and the user can hear Weebo's answer.
  5. stop program: Press Ctrl+C Weebo can be stopped at any time.

With the above steps, users can easily start using Weebo to have real-time voice conversations and experience natural and smooth voice interaction.

Recommended

Can't find AI tools? Try here!

Just type in the keyword Accessibility Bing SearchYou can quickly find all the AI tools on this site.

Scan the code to follow

qrcode

Contact Us

Top

en_USEnglish