Modern local voice assistant with React frontend and Python backend using Ollama, Whisper, and XTTS
📺 Watch the tutorial • 💬 Get free consultation • 📞 AI phone solutions
Live Demo Features: Chat Interface • Voice Mode • Real-time Processing • Modern UI • Completely Local
- 💬 Chat Interface: Text conversation with local AI models
- 🎤 Voice Mode: Speech-to-text with natural voice responses
- 🔄 Real-time Status: Live feedback during processing
- 🎨 Modern UI: Beautiful React interface with smooth animations
- 🌐 Completely Local: No cloud services, full privacy
- 🔧 Open Source: Extend and customize as needed
- Python 3.8+ (3.11+ recommended)
- Node.js 16+
- Ollama (Install here)
# 1. Clone the repository
git clone https://github.com/everlastconsulting/gpt-oss-local-voice-agent-demo.git
cd gpt-oss-local-voice-agent-demo
# 2. Backend setup
python3 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
pip install -r requirements.txt
# 3. Frontend setup
npm install
# 4. Configure Ollama
ollama pull gpt-oss:20b
# 5. Start the application
./start.sh # Starts both backend and frontendAccess the app at: http://localhost:3000
- Type your message and press Enter
- Get AI responses in real-time
- Click "Voice Mode" button
- Click microphone and speak your question
- Watch real-time transcription
- Listen to AI voice response
Create .env file for custom settings:
# Ollama Model
OLLAMA_MODEL=gpt-oss:20b
# Audio Settings
RECORD_DURATION=4
TTS_LANGUAGE=de
# Server
FLASK_RUN_PORT=8080- Frontend: React, Tailwind CSS, Framer Motion
- Backend: Flask, Python
- AI/ML: Ollama (LLM), Whisper (STT), XTTS (TTS)
- Audio: SoundDevice, NumPy, SciPy
This is a demo project - feel free to:
- 🌟 Star if you find it useful
- 🐛 Report issues you encounter
- 💡 Suggest features in discussions
- 🔧 Submit pull requests for improvements
- 🌍 Add more languages
- 🎨 Improve UI/UX
- 📱 Mobile responsiveness
- ⚡ Performance optimizations
This project was created as part of our AI development series. Check out:
- 📺 YouTube Channel: EverLast AI - AI tutorials and demos
- 💬 Free Consultation: kiberatung.de - Get expert AI advice
- 📞 AI Phone Assistants: kitelefonagent.de - Professional AI phone solutions
MIT License - see LICENSE file
- Ollama for local LLM hosting
- OpenAI Whisper for speech recognition
- Coqui TTS for text-to-speech
⭐ Star this repo if you like it! ⭐
Built with ❤️ for the open source community