Avatar & Voice Service

MeetLoyd provides its own built-in avatar and voice service. There are no third-party providers involved -- MeetLoyd is the sole provider for all avatar and voice capabilities on the platform.

Avatar Types

Type	Description	Best For
Image-based	Created from a single photograph or AI-generated portrait	Quick setup, consistent branding, lower resource usage
Video-based	Created from video footage with natural expressions and movements	Most realistic results, premium client-facing experiences

Both types support AI lip-sync, multiple languages, and custom backgrounds.

Voice Profiles

Every agent can have a voice profile that gives it a distinctive speaking voice. Voice profiles come in two flavors:

Cloned -- Upload a WAV sample through the voice onboarding flow to create a custom voice that matches a specific speaker.
Library -- Select from MeetLoyd's built-in voice library with a range of tones, accents, and languages.

Voice profiles are stored per tenant and linked to individual agents. Each agent can have at most one voice profile assigned.

How It Works

Agent generates text response --> Avatar service selects avatar + voice settings --> AI lip-sync + voice synthesis --> Video or audio returned to client

The agent writes its response as text. MeetLoyd's avatar service handles everything else -- voice synthesis, lip syncing, and video rendering.

Video Status Lifecycle

Status	Description
Pending	Job queued, waiting to start
Processing	Video is being generated
Completed	Video is ready for playback
Failed	Generation failed (check logs)

Best Practices

Keep Video Messages Concise

Shorter messages generate faster and cost less. Aim for under 30 seconds of speech.

Match Language to Audience

Set the language to match your users. Avatars and voices support multiple languages.

Show Text First, Then Video

For the best user experience, display the text response immediately and load the avatar video asynchronously when it finishes rendering.

Credential Security

All avatar and voice data is encrypted at rest using AES-256
Voice samples uploaded for cloning are stored securely per tenant
All data is transmitted over HTTPS only

Pricing

Avatar video generation is billed per second of generated video. Voice and avatar products are available as addon subscriptions through the Store, billed monthly or yearly.

Avatar Types​

Voice Profiles​

How It Works​

Video Status Lifecycle​

Best Practices​

Credential Security​

Pricing​

See Also​