Skip to main content

Voice Setup

Enable your AI persona to speak with your voice, creating an even more authentic connection with your audience.

Voice Features Overview

FeatureDescription
Voice CloningAI learns your unique voice
Audio ResponsesPersona speaks responses aloud
Voice ConversationsReal-time voice chat
Voice MessagesAsync audio replies

Voice Cloning

How It Works

Your persona learns your voice from:

  • Video audio tracks
  • Podcast episodes
  • Voice recordings
  • Any audio content you provide

The AI captures:

  • Your tone and pitch
  • Speech patterns
  • Pronunciation
  • Pacing and rhythm
  • Emotional expression
Minimum 30 Minutes of Audio Required

Voice cloning quality depends heavily on the amount and variety of audio you provide. Uploading less than 30 minutes of audio will produce a noticeably lower-quality voice clone that may sound robotic or unlike you.

Audio Requirements

RequirementSpecification
Total DurationMinimum 30 minutes recommended
QualityClear audio, minimal background noise
VarietyDifferent topics and emotions
FormatMP3, WAV, M4A, or video with audio

Recording Best Practices

For Best Voice Quality:

  • Use a quality microphone
  • Record in a quiet space
  • Speak naturally (don't read robotically)
  • Include emotional variety
  • Cover your typical topics

Voice Training Process

  1. Upload Audio

    • Go to Voice > Voice Training
    • Upload audio files or connect content
    • Processing begins automatically
  2. Review Samples

    • Listen to AI-generated samples
    • Compare to your actual voice
    • Note any issues
  3. Fine-Tune

    • Adjust voice settings
    • Add more audio if needed
    • Iterate until satisfied
  4. Approve

    • Confirm voice quality
    • Enable for your persona
    • Set availability options

Voice Quality Scores

ScoreQualityAction
90-100ExcellentReady to use
75-89GoodMinor differences
60-74FairConsider more training
Below 60Needs workAdd more/better audio

Voice Response Settings

When to Use Voice

SettingDescription
AlwaysEvery response includes audio
On RequestOnly when fan asks
Auto-DetectBased on conversation context
NeverText only

Response Modes

ModeHow It Works
Text + AudioBoth text and voice provided
Audio OnlyVoice response, no text shown
Text FirstText immediate, audio follows
Audio FirstVoice plays, text appears

Audio Length Limits

SettingMax DurationBest For
Brief30 secondsQuick answers
Standard2 minutesMost responses
Extended5 minutesDetailed explanations
UnlimitedNo limitSpecial content

Voice Conversation Mode

Real-Time Voice Chat

Enable fans to have voice conversations:

How It Works:

  1. Fan initiates voice chat
  2. They speak their message
  3. Persona responds with voice
  4. Natural back-and-forth conversation

Voice Chat Settings

SettingOptions
AvailabilityAlways, Scheduled, Subscriber Only
Turn DetectionAutomatic, Manual
Response DelayInstant, Natural pause
Interruption HandlingAllow, Queue, Disable

Voice Chat Quality

Optimize for best experience:

FactorRecommendation
LatencyAim for under 1 second
Audio QualityHigh bitrate when possible
FallbackText backup if audio fails

Voice Messages

Asynchronous Voice

Fans can send voice messages:

  1. Fan records voice message
  2. AI transcribes and understands
  3. Persona responds with voice
  4. Fan listens when convenient

Voice Message Settings

SettingDescription
Max LengthHow long fans can record
Auto-PlayResponse plays automatically
TranscriptionShow text of fan's message

Voice Customization

Voice Parameters

Fine-tune your cloned voice:

ParameterRangeEffect
Speed0.5x - 2.0xSpeaking pace
Pitch-10 to +10Voice pitch adjustment
ExpressivenessLow to HighEmotional range
ClarityStandard to EnhancedArticulation

Emotion Settings

Configure emotional expression:

EmotionWhen Used
NeutralStandard responses
ExcitedPositive topics, enthusiasm
ThoughtfulComplex questions
EmpatheticSupportive moments
PlayfulHumorous content

Pronunciation Guides

Add custom pronunciations:

  1. Go to Voice > Pronunciation
  2. Click Add Word
  3. Enter word and phonetic guide
  4. Test in preview
  5. Save

Common Uses:

  • Your name
  • Brand names
  • Technical terms
  • Inside jokes

Multilingual Voice

Multiple Languages

If you speak multiple languages:

  1. Upload audio in each language
  2. Train voice separately
  3. Set language detection
  4. Persona switches automatically

Language Settings

SettingDescription
PrimaryDefault language
SecondaryAdditional languages
DetectionAuto-detect fan's language
FallbackLanguage if detection fails

Voice Analytics

Voice Usage Metrics

MetricDescription
Voice RequestsHow often voice is used
Completion RateAudio listened to completion
PreferenceVoice vs. text preference
Quality FeedbackFan ratings

Viewing Analytics

  1. Go to Analytics > Voice
  2. See usage trends
  3. Review quality metrics
  4. Identify improvement areas

Voice Privacy and Safety

Content Filtering

Voice responses filtered for:

  • Inappropriate content
  • Personal information
  • Boundary violations

Voice Watermarking

Optional invisible watermark:

  • Identifies AI-generated audio
  • Helps prevent misuse
  • Transparent to listeners
AI Transparency Required

Fans must always be informed that they are interacting with an AI-generated voice, not the real creator. This is both a platform requirement and a legal best practice in many jurisdictions.

Configure consent requirements:

SettingDescription
NoticeInform fans it's AI voice
Opt-InFans must enable voice
TermsLink to voice usage policy

Troubleshooting

Common Issues

IssueCauseSolution
Voice sounds roboticNot enough trainingAdd more diverse audio
Wrong pronunciationMissing pronunciation guideAdd custom pronunciations
Audio cuts offLength limitsAdjust response length
Quality variesInconsistent trainingUse consistent audio quality

Voice Quality Tips

  • Upload at least 30 minutes of audio
  • Use varied content (different emotions, topics)
  • Ensure clean audio without background noise
  • Include natural speech, not just reading
  • Update periodically with new content

Best Practices

Creating Natural Voice

  • Train with conversational audio
  • Include emotional variety
  • Use authentic content
  • Test with real questions

Fan Experience

  • Set appropriate expectations
  • Provide text alternative
  • Monitor feedback
  • Iterate based on usage

Next Steps

  1. Publish Your Persona - Launch with voice
  2. Fan Engagement - Engage with voice
  3. Monetization - Premium voice features