Skip to main content

Avatar & Voice Service

MeetLoyd provides its own built-in avatar and voice service. There are no third-party providers involved -- MeetLoyd is the sole provider for all avatar and voice capabilities on the platform.

Avatar Types

TypeDescriptionBest For
Image-basedCreated from a single photograph or AI-generated portraitQuick setup, consistent branding, lower resource usage
Video-basedCreated from video footage with natural expressions and movementsMost realistic results, premium client-facing experiences

Both types support AI lip-sync, multiple languages, and custom backgrounds.

Voice Profiles

Every agent can have a voice profile that gives it a distinctive speaking voice. Voice profiles come in two flavors:

  • Cloned -- Upload a WAV sample through the voice onboarding flow to create a custom voice that matches a specific speaker.
  • Library -- Select from MeetLoyd's built-in voice library with a range of tones, accents, and languages.

Voice profiles are stored per tenant and linked to individual agents. Each agent can have at most one voice profile assigned.

How It Works

Agent generates text response --> Avatar service selects avatar + voice settings --> AI lip-sync + voice synthesis --> Video or audio returned to client

The agent writes its response as text. MeetLoyd's avatar service handles everything else -- voice synthesis, lip syncing, and video rendering.

Video Status Lifecycle

StatusDescription
PendingJob queued, waiting to start
ProcessingVideo is being generated
CompletedVideo is ready for playback
FailedGeneration failed (check logs)

Best Practices

Keep Video Messages Concise

Shorter messages generate faster and cost less. Aim for under 30 seconds of speech.

Match Language to Audience

Set the language to match your users. Avatars and voices support multiple languages.

Show Text First, Then Video

For the best user experience, display the text response immediately and load the avatar video asynchronously when it finishes rendering.

Credential Security

  • All avatar and voice data is encrypted at rest using AES-256
  • Voice samples uploaded for cloning are stored securely per tenant
  • All data is transmitted over HTTPS only

Pricing

Avatar video generation is billed per second of generated video. Voice and avatar products are available as addon subscriptions through the Store, billed monthly or yearly.


See Also