GuijiAI / HeyGem.ai
- воскресенье, 13 апреля 2025 г. в 00:00:02
Heygem digital human cloning intelligent agent and plugins have been successfully launched on the Coze platform. No complex deployment is required, even novice users can easily get started and use it directly.
Click here to instantly access the Coze store experience👉Silicon-based Intelligent Digital Human Cloning Agent | Silicon-based Intelligent Digital Human Cloning Plugin
Scan the code to watch the operation video
Ubuntu Version Officially Released
Dear Heygem Open Source Community Members:
We sincerely thank you for your enthusiastic attention and active participation in the Heygem digital human open source project! We have noticed that some developers face challenges during local deployment. To better meet the needs of different scenarios, we are now announcing two parallel service solutions:
Project | HeyGem Open Source Local Deployment | Digital Human/Clone Voice API Service |
---|---|---|
Usage | Open Source Local Deployment | Rapid Clone API Service |
Recommended | Technical Users | Business Users |
Technical Threshold | Developers with deep learning framework experience/pursuing deep customization/wishing to participate in community co-construction | Quick business integration/focus on upper-level application development/need enterprise-level SLA assurance for commercial scenarios |
Hardware Requirements | Need to purchase GPU server | No need to purchase GPU server |
Customization | Can modify and extend the code according to your needs, fully controlling the software's functions and behavior | Cannot directly modify the source code, can only extend functions through API-provided interfaces, less flexible than open source projects |
Technical Support | Community Support | Dynamic expansion support + professional technical response team |
Maintenance Cost | High maintenance cost | Simple maintenance |
Lip Sync Effect | Usable effect | Stunning and higher definition effect |
Commercial Authorization | Supports global free commercial use (enterprises with more than 100,000 users or annual revenue exceeding 10 million USD need to sign a commercial license agreement) | Commercial use allowed |
Iteration Speed | Slow updates, bug fixes depend on the community | Latest models/algorithms are prioritized, fast problem resolution |
We always adhere to the open source spirit, and the launch of the API service aims to provide a more complete solution matrix for developers with different needs. No matter which method you choose, you can always obtain technical support documents through James@toolwiz.com. We look forward to working with you to promote the inclusive development of digital human technology!
Silicon-based Intelligent Developer Team
From scratch, hand-in-hand to teach you how to create your own HeyGem open source AI digital human!
Rapid Clone API | API Documentation Center
Real-time Interaction SDK | SDK Documentation Center
Local Real-time Interaction (realtime) duix.ai Open Source Address | Android Version | IOS Version
Since we open-sourced Heygem, global geeks have illuminated the digital avatar matrix in the code universe, with each commit reconstructing the future! But joy is better shared than enjoyed alone—now we invite all experts to join the "Open Source Co-Creation Plan," empowering everyone with AI creativity and propelling the Chinese AI fleet towards the stars!
Co-Creation Content Direction
Share high-quality videos or articles on Heygem deployment tutorials, optimization guides, and practical cases (Bilibili, Douyin, Xiaohongshu, WeChat Official Accounts, Zhihu, etc.)
Open Source Co-Creation Special Reward Pool (Real Cash Rewards!)
(1) Basic Rewards
Content receiving 20-100 likes will be awarded the [Heygem.ai Master Award] and a 20 RMB cash red envelope.
Content receiving 100+ likes will be awarded the [Heygem.ai God Award] and a 50 RMB cash red envelope.
(2) Special Achievements:
The monthly MVP will unlock the Open Source Hall of Fame digital badge (permanently on-chain).
Participation Method
Send your creativity to the customer service lady, add a friend with the note "Name+999".
Heygem Open Source Witnessed History! Cyber Worker Revolution!
Digital Human Project Heygem Local Deployment Tutorial
So Tempting! From Paid to Open Source, AI Digital Humans Will Open a New Era
Open Source Free Digital Humans Are Here, Unlimited Times, Fast Cloning
AI Digital Humans Are Free! GitHub's Hot Project Can Run on Your Computer
HeyGem One-Click Package Windows Direct Run Without Docker Silicon-Based Open Source Digital Human
Heygem is a fully offline video synthesis tool designed for Windows systems that can precisely clone your appearance and voice, digitalizing your image. You can create videos by driving virtual avatars through text and voice. No internet connection is required, protecting your privacy while enjoying convenient and efficient digital experiences.
Must have D Drive: Mainly used for storing digital human and project data
C Drive: Used for storing service image files
System Requirements:
Recommended Configuration:
Ensure you have an NVIDIA graphics card with properly installed drivers
NVIDIA driver download link: https://www.nvidia.cn/drivers/lookup/
Use the command wsl --list --verbose
to check if WSL is installed. If it shows as below, it's already installed and no further installation is needed.
- WSL installation command:
wsl --install
- May fail due to network issues, try multiple times
- During installation, you'll need to set and remember a new username and password
Update WSL using wsl --update
.
Download Docker for Windows, choose the appropriate installation package based on your CPU architecture.
When you see this interface, installation is successful.
Run Docker
Accept the agreement and skip login on first run
Installation using Docker, docker-compose as follows:
The docker-compose.yml
file is in the /deploy
directory.
Execute docker-compose up -d
in the /deploy
directory, if you want to use the lite version, execute docker-compose -f docker-compose-lite.yml up -d
Wait patiently (about half an hour, speed depends on network), download will consume about 70GB of traffic, make sure to use WiFi
When you see three services in Docker, it indicates success
HeyGem-x.x.x-setup.exe
to installFirst, check if Docker is installed using
docker --version
. If it is installed, skip the following steps.
HeyGem-x.x.x.AppImage
to launch, no installation requiredReminder: On Ubuntu systems, if you are using the
root
user to access the desktop, double-clickingHeyGem-x.x.x.AppImage
may not work. You need to execute./HeyGem-x.x.x.AppImage --no-sandbox
in the terminal, adding the--no-sandbox
parameter.
We have opened APIs for model training and video synthesis. After Docker starts, several ports will be exposed locally, accessible through http://127.0.0.1
.
For specific code, refer to:
D:\heygem_data\voice\data
D:\heygem_data\voice\data
is agreed with theguiji2025/fish-speech-ziming
service, can be modified in docker-compose
http://127.0.0.1:18180/v1/preprocess_and_tran
interface
Parameter example:
{ "format": ".wav", "reference_audio": "xxxxxx/xxxxx.wav", "lang": "zh" }Response example:
{ "asr_format_audio_url": "xxxx/x/xxx/xxx.wav", "reference_audio_text": "xxxxxxxxxxxx" }Record the response results as they will be needed for subsequent audio synthesis
Interface: http://127.0.0.1:18180/v1/invoke
// Request parameters
{
"speaker": "{uuid}", // A unique UUID
"text": "xxxxxxxxxx", // Text content to synthesize
"format": "wav", // Fixed parameter
"topP": 0.7, // Fixed parameter
"max_new_tokens": 1024, // Fixed parameter
"chunk_length": 100, // Fixed parameter
"repetition_penalty": 1.2, // Fixed parameter
"temperature": 0.7, // Fixed parameter
"need_asr": false, // Fixed parameter
"streaming": false, // Fixed parameter
"is_fixed_seed": 0, // Fixed parameter
"is_norm": 0, // Fixed parameter
"reference_audio": "{voice.asr_format_audio_url}", // Return value from previous "Model Training" step
"reference_text": "{voice.reference_audio_text}" // Return value from previous "Model Training" step
}
Synthesis interface: http://127.0.0.1:8383/easy/submit
// Request parameters
{
"audio_url": "{audioPath}", // Audio path
"video_url": "{videoPath}", // Video path
"code": "{uuid}", // Unique key
"chaofen": 0, // Fixed value
"watermark_switch": 0, // Fixed value
"pn": 1 // Fixed value
}
Progress query: http://127.0.0.1:8383/easy/query?code=${taskCode}
GET request, the parameter
taskCode
is thecode
from the synthesis interface input above
Check if all three services are in Running status
Confirm that your machine has an NVIDIA graphics card and drivers are correctly installed.
All computing power for this project is local. The three services won't start without an NVIDIA graphics card or proper drivers.
Ensure both server and client are updated to the latest version. The project is newly open-sourced, the community is very active, and updates are frequent. Your issue might have been resolved in a new version.
/deploy
directory and re-execute docker-compose up -d
pull
code and re-build
GitHub Issues are continuously updated, issues are being resolved and closed daily. Check frequently, your issue might already be resolved.
Problem Description
Describe the reproduction steps in detail, with screenshots if possible.
Provide Error Logs
James@toolwiz.com