Skip to content

Commit a363e14

Browse files
committed
feat: add some blogs for release notes from 1.2 to 1.0
1 parent efae7af commit a363e14

File tree

8 files changed

+864
-1
lines changed

8 files changed

+864
-1
lines changed

blog/2025-02-04-v1-0-1-release.md

Lines changed: 142 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,142 @@
1+
---
2+
title: 1.0.1 版本发布
3+
description: Version 1.0.1 Release
4+
slug: v1.0.1-release
5+
authors: [tim, ethan]
6+
tags: [release]
7+
image: https://i.imgur.com/mErPwqL.png
8+
hide_table_of_contents: false
9+
---
10+
11+
12+
# Open-LLM-VTuber v1.0.1 Release 💥
13+
14+
This release marks a significant milestone for Open-LLM-VTuber, featuring a complete rewrite of the backend and frontend with over 240+ new commits, along with numerous enhancements and new features. If you were using a version before this, version `v1.0.0` is basically a new app.
15+
16+
⚠️ Direct upgrades from older versions are impossible due to architectural changes. Please refer to our **[new documentation site](https://open-llm-vtuber.github.io/docs/intro)** for installation.
17+
18+
(v1.0.0 had a bug after the release, so let's just ignore that and have the v1.0.1)
19+
20+
| ![i4_pet_desktop](https://github.com/user-attachments/assets/06eff9dc-e141-4401-90ac-823b08662aae) | ![i1](https://github.com/user-attachments/assets/e0175aa3-62c8-4cde-9c6f-5d010727c04f) |
21+
|:---:|:---:|
22+
| ![i3](https://github.com/user-attachments/assets/082d8f29-9b48-4dbb-87f6-0f12d89a92f2) | ![i2](https://github.com/user-attachments/assets/f6b50eda-8187-4d37-b39b-a34e33683328) |
23+
![i4](https://github.com/user-attachments/assets/fa4a5884-0ec7-4377-8a3b-204aafaf8ede) | ![i3_browser_world_fun](https://github.com/user-attachments/assets/8e0819d2-75dd-4ebf-97ab-399bf2d01795) |
24+
25+
<!-- truncate -->
26+
27+
## ✨ Highlights
28+
* **Vision Capability:** Video chat with the AI.
29+
* **Desktop Pet Mode:** A new Desktop Pet Mode lets you have your VTuber companion directly on your desktop.
30+
* **Brand New Frontend:** A completely redesigned frontend built with React, ChakuraUI, and Vite offers a modern user experience. Available as web and desktop apps, located in the [Open-LLM-VTuber-Web](https://github.com/Open-LLM-VTuber/Open-LLM-VTuber-Web) repository.
31+
* **Chat History Management:** Implemented a system to store and retrieve conversation history, enabling persistent interactions with your AI.
32+
* **New LLM support:** Many new (stateless) LLM providers are now supported (and refactored), including Ollama, OpenAI, Gemini, Claude, Mistral, DeepSeek, Zhipu, and llama.cpp.
33+
* **DeepSeek R1 Reasoning model support**: The reasoning chain will be displayed but not spoken. See your waifu's inner thoughts!
34+
* **Major Backend Rewrite:** The core of Open-LLM-VTuber has been rebuilt from the ground up, focusing on asynchronous operations, improved memory management, and a more modular architecture.
35+
* **Refactored Configuration:** The `conf.yaml` file was restructured, and `config_alts` has been renamed to `characters`.
36+
* **TTS Preprocessor**: Text inside `asterisks`, `brackets`, `parentheses`, and `angle brackets` will no longer be spoken by the TTS.
37+
* **Dependency management:** Switched to `uv` for dependency management, removed unused dependencies such as `rich`, `playsound3`, and `sounddevice`.
38+
* **Documentation Site:** A comprehensive documentation site is now live at [https://open-llm-vtuber.github.io/](https://open-llm-vtuber.github.io/).
39+
40+
## 📋 Detailed Changes
41+
42+
### 🧮 Backend
43+
44+
* **Architecture:**
45+
* The project structure has been reorganized to use the `src/` directory.
46+
* The backend is now fully asynchronous, improving responsiveness.
47+
* CLI mode (`main.py`) has been removed.
48+
* The "exit word" has been removed.
49+
* Models are initialized and managed using `ServiceContext`, offering better memory management, particularly when switching characters.
50+
* Refactored LLMs into `agent` and `stateless_llm`, supporting a wider range of LLMs with a new agent interface: `basic_memory_agent` and `hume_ai_agent`.
51+
* **LLM (Language Model) Enhancements:**
52+
* New (and old but refactored) providers: Ollama, OpenAI (and any OpenAI Compatible API), Gemini, Claude, Mistral, DeepSeek, Zhipu, llama.cpp.
53+
* `temperature` parameter added.
54+
* No more tokens will be generated after interruption, improving the responsiveness of voice interruption.
55+
* Ollama models are preloaded at startup, kept in memory for the server's duration, and unloaded at exit.
56+
* Added a `hf_mirror` flag to specify whether to use the Hugging Face mirror source.
57+
* **TTS (Text-to-Speech) Enhancements:**
58+
* TTS now generates multiple audio segments concurrently and sends them sequentially, reducing latency.
59+
* New interruption logic for smoother transitions.
60+
* Added filters (`asterisks`, `brackets`, `parentheses`) to prevent unwanted text from being spoken.
61+
* Implemented `faster_first_response` feature to prioritize the synthesis and playback of the first sentence fragment, minimizing latency.
62+
* **ASR (Automatic Speech Recognition) Enhancements:**
63+
* Made Sherpa-onnx ASR with the **SenseVoiceSmall int8** model the default for both English and Chinese presets, with automatic model download.
64+
* Added a `provider` option for sherpa-onnx-asr.
65+
* **Other Improvements:**
66+
* Chat log persistence is used to maintain conversation history.
67+
* All `print` statements are replaced with `loguru` for structured logging.
68+
* Added a Chinese configuration preset: `conf.CN.yaml`.
69+
* Basic AI proactive speaking (experimental).
70+
* Added some checks in the CI/CD process
71+
* Added input/output type system to agents
72+
* Added **Tencent Translate** in https://github.com/Open-LLM-VTuber/Open-LLM-VTuber/pull/107
73+
74+
### 🖥️ Frontend
75+
76+
* **New frontend built with Electron, React, ChakuraUI, and Vite.**
77+
* **Multi-Mode in Single Codebase:**
78+
* Web Mode: Browser interface
79+
* Window Mode: Desktop window
80+
* Pet Mode: Transparent desktop companion
81+
* Seamless context sharing between Window and Pet modes, allowing for the preservation of settings, history, connections, and model states.
82+
* **Enhanced UI Features**
83+
* Responsive layout with collapsible sidebar and footer
84+
* Customizable Live2D model interactions: Mouse tracking for eye movement, Click-triggered animations, Drag & resize capabilities.
85+
* Persistent local storage for user preference settings, including background, VAD configuration, Live2D size and interactions, and agent behavior.
86+
* Supports viewing, loading, and deleting conversation history with streaming subtitles.
87+
* (Electron pet-mode) A transparent, always-on-top desktop companion with click-through, non-interactive areas featuring draggable and hideable Live2D and UI, right-click menu controls.
88+
* Camera and screen capturing panel
89+
* Switch characters easily
90+
91+
### 📖 Documentation
92+
93+
* Rewritten README file.
94+
* New comprehensive documentation with a dedicated website.
95+
96+
### 🧹 Cleanup
97+
98+
* Removed unused and legacy code, including `TaskQueue.py`, `scripts/install_piper_tts.py`, `model_manager_old.py`, `service_context_old.py`, `main.py`, `asr_with_vad`, `vad`, `start_cli`, `fake_llm`, `MemGPT`, the `pywhispercpp` submodule, and CoreML script.
99+
* Removed unused dependencies: `rich`, `playsound3`, `sounddevice`, among others.
100+
* Removed configuration options that are no longer relevant: `VOICE_INPUT_ON`, `MIC_IN_BROWSER`, `LIVE2D`, `EXTRA_SYSTEM_PROMPT_RAG`, `AI_NAME`, `USER_NAME`, `SAVE_CHAT_HISTORY`, `CHAT_HISTORY_DIR`, `RAG_ON`, `LLMASSIST_RAG_ON`, `SAY_SENTENCE_SEPARATELY`, `MEMORY_SNAPSHOT`, `PRELOAD_MODELS`, `tts_on`.
101+
102+
103+
## ⚠️⚠️⚠️ Critical Upgrade Notice
104+
105+
106+
1. No Direct Upgrades - Previous installations are incompatible
107+
108+
2. Fresh Install Required - Follow new documentation
109+
110+
3. Config Changes - Back up existing configurations before migration
111+
112+
### Why the Hassle? 💡
113+
114+
1. UV dependency manager replaces legacy systems
115+
2. Complete configuration schema overhaul
116+
117+
118+
119+
Please check out the [new documentation](https://open-llm-vtuber.github.io/docs/quick-start/) to install Open-LLM-VTuber again. Fortunately, thanks to `uv,` there should be fewer headaches during installation.
120+
121+
122+
## 🎉 Contributors
123+
- @t41372, which is me
124+
- @ylxmf2005, the creator of the new frontend, implemented LLM vision capability, chat history management, TTS concurrency, hume AI agent, better sentence division, a better live2d configuration, countless bug fixes, and more. He also wrote the majority of the documentation and provided countless insights. The version `v1.0.0` was a close collaboration with him and wouldn't have existed without his tremendous contribution.
125+
- @Stewitch, who added the hf_mirror option and is currently working on a launcher for this project to streamline the installation and configuration process. It's still a work in progress but will be completed very soon. https://github.com/Stewitch/LiZhen
126+
- @Fluchw, who added Tecent translator and helped us fix the translator bug.
127+
128+
And all the other contributors who worked on this project in previous versions.
129+
130+
131+
**Full Changelog**: https://github.com/Open-LLM-VTuber/Open-LLM-VTuber/compare/v0.5.2...v1.0.0
132+
133+
134+
## Faster download links for Chinese users 给内地用户准备的(相对)快速的下载链接
135+
Open-LLM-VTuber-v1.0.3.zip (包含 sherpa onnx asr 的 sense-voice 模型,就不用再从github上拉取了)
136+
- https://pub-17317087be374bc68161ac63de2022a5.r2.dev/v1.0.3/Open-LLM-VTuber-v1.0.3.zip
137+
138+
open-llm-vtuber-electron-1.0.0-frontend.exe (桌面版前端,Windows)
139+
- https://pub-17317087be374bc68161ac63de2022a5.r2.dev/v1.0.3/open-llm-vtuber-electron-1.0.0-setup.exe
140+
141+
open-llm-vtuber-electron-1.0.0-frontend.dmg (桌面版前端,macOS)
142+
- https://pub-17317087be374bc68161ac63de2022a5.r2.dev/v1.0.3/open-llm-vtuber-electron-1.0.0.dmg

blog/2025-02-20-v1-1-0-release.md

Lines changed: 80 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,80 @@
1+
---
2+
title: 1.1.0 Release
3+
description: Version 1.1.0 Release
4+
slug: v1.1.0-release
5+
authors: [tim, ethan]
6+
tags: [release]
7+
image: https://i.imgur.com/mErPwqL.png
8+
hide_table_of_contents: false
9+
---
10+
11+
12+
## What's Changed
13+
14+
### Major Features
15+
* Implemented group chat functionality (@ylxmf2005)
16+
* Added Silero-VAD voice activity detection (@AnyaCoder)
17+
* Added CosyVoice2 text-to-speech support (@Warma10032)
18+
* Added frontend ASR/TTS tools accessible at `http://localhost:web-tool`
19+
- Users can now directly use the project's speech recognition and text-to-speech engines
20+
* Introduced one-click CUDA-ready setup using pixi (@mokurin000)
21+
* Improved configuration management and update mechanism:
22+
- `conf.yaml` is no longer tracked in git
23+
- New config template system for generating and updating `conf.yaml` during upgrades
24+
25+
<!-- truncate -->
26+
27+
### Bug Fixes & Improvements
28+
* Fixed sentence divider issues
29+
* Fixed system prompt override bug for certain LLMs
30+
* Removed deprecated `prompts/persona` directory (unused since v1.0.0)
31+
* Major codebase refactoring of conversation and handler components (@ylxmf2005)
32+
33+
### New Contributors
34+
* @mokurin000
35+
* @AnyaCoder
36+
* @Warma10032
37+
38+
**Full Changelog**: https://github.com/Open-LLM-VTuber/Open-LLM-VTuber/compare/v1.0.0...v1.1.0
39+
40+
41+
## Which files should I get? 我应该下载哪些文件?
42+
43+
### For Existing Open-LLM-VTuber Users (v1.0.0 or newer) 现有 Open-LLM-VTuber 用户(v1.0.0 或更新版本)
44+
1. Run `uv run upgrade.py` to update to the latest version 运行 `uv run upgrade.py` 来更新到最新版本
45+
2. Download the new electron app from the releases section 从发布区(下面)下载新的 electron 应用程序
46+
47+
### For New Users or Versions Below v1.0.0 新用户或 v1.0.0 以下版本用户
48+
Please refer to the [new deployment documentation](https://docs.llmvtuber.com/docs/quick-start) for installation instructions.
49+
请参考[新部署文档](https://docs.llmvtuber.com/docs/quick-start)获取安装说明。
50+
51+
### Download Files 下载文件
52+
If you are here because you read the documentation, download the zip file and the electron app below.
53+
Download both of these files:
54+
1. The electron app
55+
2. The language-specific ZIP file:
56+
- English: `Open-LLM-VTuber-v1.1.0-en.zip`
57+
- Chinese: `Open-LLM-VTuber-v1.1.0-zh.zip`
58+
59+
Note: The ZIP files are identical except for the language of the configuration file. Both packages include the SenseVoiceSmall model file to ensure accessibility for Chinese users.
60+
61+
如果您是按照文档指引来到这里的,请下载以下的 zip 文件和 electron 应用程序。
62+
请下载这两个文件:
63+
1. electron 应用程序
64+
2. 对应语言的 ZIP 文件:
65+
- 英文版:`Open-LLM-VTuber-v1.1.0-en.zip`
66+
- 中文版:`Open-LLM-VTuber-v1.1.0-zh.zip`
67+
68+
注意:这些 ZIP 文件除了配置文件的语言不同外完全相同。两个包都包含 SenseVoiceSmall 模型文件以确保内地用户可以愉快使用。
69+
70+
71+
## Faster download links for Chinese users 给内地用户准备的(相对)快速的下载链接
72+
Open-LLM-VTuber-v1.1.0-zh.zip (包含 sherpa onnx asr 的 sense-voice 模型,就不用再从github上拉取了)
73+
- [Open-LLM-VTuber-v1.1.0-en.zip](https://pub-17317087be374bc68161ac63de2022a5.r2.dev/v1.1.0/Open-LLM-VTuber-v1.1.0-en.zip)
74+
- [Open-LLM-VTuber-v1.1.0-zh.zip](https://pub-17317087be374bc68161ac63de2022a5.r2.dev/v1.1.0/Open-LLM-VTuber-v1.1.0-zh.zip)
75+
76+
open-llm-vtuber-electron-1.1.0-frontend.exe (桌面版前端,Windows)
77+
- https://pub-17317087be374bc68161ac63de2022a5.r2.dev/v1.1.0/open-llm-vtuber-electron-1.1.0-setup.exe
78+
79+
open-llm-vtuber-electron-1.1.0-frontend.dmg (桌面版前端,macOS)
80+
- https://pub-17317087be374bc68161ac63de2022a5.r2.dev/v1.1.0/open-llm-vtuber-electron-1.1.0.dmg

0 commit comments

Comments
 (0)