Avatar & Voice Service
MeetLoyd provides its own built-in avatar and voice service. There are no third-party providers involved -- MeetLoyd is the sole provider for all avatar and voice capabilities on the platform.
Avatar Types
| Type | Description | Best For |
|---|---|---|
| Image-based | Created from a single photograph or AI-generated portrait | Quick setup, consistent branding, lower resource usage |
| Video-based | Created from video footage with natural expressions and movements | Most realistic results, premium client-facing experiences |
Both types support AI lip-sync, multiple languages, and custom backgrounds.
Voice Profiles
Every agent can have a voice profile that gives it a distinctive speaking voice. Voice profiles come in two flavors:
- Cloned -- Upload a WAV sample through the voice onboarding flow to create a custom voice that matches a specific speaker.
- Library -- Select from MeetLoyd's built-in voice library with a range of tones, accents, and languages.
Voice profiles are stored per tenant and linked to individual agents. Each agent can have at most one voice profile assigned.
How It Works
Agent generates text response --> Avatar service selects avatar + voice settings --> AI lip-sync + voice synthesis --> Video or audio returned to client
The agent writes its response as text. MeetLoyd's avatar service handles everything else -- voice synthesis, lip syncing, and video rendering.
Video Status Lifecycle
| Status | Description |
|---|---|
| Pending | Job queued, waiting to start |
| Processing | Video is being generated |
| Completed | Video is ready for playback |
| Failed | Generation failed (check logs) |
Best Practices
Shorter messages generate faster and cost less. Aim for under 30 seconds of speech.
Set the language to match your users. Avatars and voices support multiple languages.
For the best user experience, display the text response immediately and load the avatar video asynchronously when it finishes rendering.
Credential Security
- All avatar and voice data is encrypted at rest using AES-256
- Voice samples uploaded for cloning are stored securely per tenant
- All data is transmitted over HTTPS only
Pricing
Avatar video generation is billed per second of generated video. Voice and avatar products are available as addon subscriptions through the Store, billed monthly or yearly.
See Also
- Team Manifests -- Use avatars in manifests
- Avatars & Voices Overview -- General avatar concepts
- Streaming -- Real-time audio streaming