This is Terrence's first hardware project.
👉 Build your AI chat companion with ESP32+SenseVoice+Qwen72B! [bilibili]
👉 DIY Your AI Companion - Beginner's Tutorial [bilibili]
This project is developed based on Espressif's ESP-IDF.
This is an open-source project primarily for educational purposes. Through this project, we aim to help more people get started with AI hardware development and understand how to integrate rapidly evolving large language models into actual hardware devices. Whether you're a student interested in AI or a developer looking to explore new technologies, this project offers valuable learning experiences.
Everyone is welcome to participate in the project's development and improvement. If you have any ideas or suggestions, please feel free to raise an Issue or join our chat group.
Learning & Discussion QQ Group: 946599635
- Wi-Fi / ML307 Cat.1 4G
- BOOT button wake-up and interrupt, supporting both click and long-press triggers
- Offline voice wake-up ESP-SR
- Streaming voice dialogue (WebSocket or UDP protocol)
- Support for 5 languages: Mandarin, Cantonese, English, Japanese, Korean SenseVoice
- Voice print recognition to identify who's calling AI's name 3D Speaker
- Large model TTS (Volcengine or CosyVoice)
- Large Language Model (Qwen2.5 72B or Doubao API)
- Configurable prompts and voice tones (custom characters)
- Short-term memory with self-summary after each conversation round
- OLED / LCD display showing signal strength or conversation content
For detailed tutorial, see the Feishu document:
👉 XiaoZhi AI Chatbot Encyclopedia
Breadboard setup shown below:
- LiChuang ESP32-S3 Development Board
- Espressif ESP32-S3-BOX3
- M5Stack CoreS3
- AtomS3R + Echo Base
- MagiClick 2.4
- Xmini C3
- Waveshare ESP32-S3-Touch-AMOLED-1.8
For beginners, it's recommended to first try flashing the firmware without setting up a development environment. The firmware uses a test server provided by the author, currently available for free use (not for commercial purposes).
👉 Flash Firmware Guide (No IDF Environment Required)
- Cursor or VSCode
- Install ESP-IDF plugin, select SDK version 5.3 or above
- Linux is preferred over Windows for faster compilation and fewer driver issues
If you already have a XiaoZhi AI chatbot, please refer to 👉 Backend Operation Video Tutorial
For detailed usage instructions and test server notes, please refer to 👉 XiaoZhi Test Server Help Guide.