Music & Speech Documentation
MuAPI provides professional-grade audio synthesis and lip-synchronization tools to bring your creative projects to life with realistic sound and speech.
1. Music Generation (Suno)
Create, remix, and extend professional music tracks using the Suno model family.
- Endpoints:
POST /api/v1/suno-create-music: Generate new tracks from prompts.POST /api/v1/suno-remix-music: Create variations of existing audio.POST /api/v1/suno-extend-music: Continue an existing track with new sections.
- Features: High-fidelity audio, genre/mood control, and seamless extensions.
2. Lip-Synchronization
Synchronize character lip movements with audio tracks using state-of-the-art sync models.
- Models Supported:
- Sync-Lipsync: Optimized for high-fidelity facial alignment.
- LatentSync: Faster inference with smooth temporal consistency.
- Creatify/Veed: Specialized models for different video formats.
- Endpoints:
POST /api/v1/sync-lipsyncPOST /api/v1/latentsync-videoPOST /api/v1/creatify-lipsyncPOST /api/v1/veed-lipsync
3. Audio & Music Utilities (MMAudio)
- Text-to-Audio: Generate Foley, sound effects, or speech.
- Video-to-Video Audio: Synchronize audio with pre-existing video motion.
- Endpoints:
POST /api/v1/mmaudio-v2/text-to-audio,POST /api/v1/mmaudio-v2/video-to-video