Documentation Index
Fetch the complete documentation index at: https://mintlify.com/konhi/elevenlabs-speech-to-text-api-ui/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Advanced settings provide fine-grained control over the transcription process, including domain-specific terminology, entity detection, and output randomness control. These options are part of theTranscriptOptions interface:
Randomness Control
Temperature
Controls the randomness of the transcription output (0.0-2.0).
- 0.0: Deterministic, always produces the same output for the same input
- 1.0: Balanced creativity and consistency
- 2.0: Maximum randomness and creativity
Seed
Random seed for reproducible transcriptions.When a seed is provided, the model will produce the same transcription output for the same input audio, even with non-zero temperature values.Useful for:
- Debugging and testing
- Ensuring consistent results across multiple runs
- Comparing different configurations with the same randomness
Domain-Specific Configuration
Keyterms
Array of domain-specific terms, technical jargon, product names, or proper nouns that should be recognized accurately.Examples:
- Technical terms:
["API", "OAuth", "REST", "GraphQL"] - Product names:
["ElevenLabs", "Scribe API", "GPT-4"] - Company names:
["Anthropic", "OpenAI", "Google Cloud"] - Medical terms:
["hypertension", "acetaminophen"]
parseKeytermsInput utility function converts the comma-separated string to an array:
Keyterms are case-sensitive. Include variations if needed (e.g., both “API” and “api” if the speaker might say it differently).
Entity Detection
Enable detection and redaction of sensitive information in the transcription.Common values:
"pii": Personally Identifiable Information (names, addresses, phone numbers, email addresses)"phi": Protected Health Information (medical record numbers, diagnoses, treatment information)"all": All supported entity types- Custom entity types as supported by the API
Healthcare Transcription
Healthcare Transcription
For medical consultations or healthcare-related audio:This will detect and tag Protected Health Information while accurately transcribing medical terminology.
Customer Support Calls
Customer Support Calls
For customer service or support call transcription:This will identify customer information while preserving product-specific terms.
Legal Proceedings
Legal Proceedings
For legal or compliance-related transcription:Ensures reproducible, accurate transcriptions with entity detection and legal terminology.
Complete API Call Example
Here’s how all advanced settings are used in the actual API call fromspeech-to-text-playground.tsx:
Advanced Configuration Examples
Reproducible Research Transcription
Reproducible Research Transcription
For academic or research purposes requiring reproducible results:
Multi-Language Technical Content
Multi-Language Technical Content
For technical content that might contain multiple languages:
Privacy-Focused Meeting Transcription
Privacy-Focused Meeting Transcription
For meetings with sensitive information:
Creative Content with Variation
Creative Content with Variation
For content where you want some variation in transcription:
Best Practices
Keyterms Tips
- Include acronyms and their expanded forms
- Add product names, company names, and proper nouns
- Include technical jargon specific to your domain
- Keep the list focused (10-50 terms typically)
- Update the list as you discover commonly misheard terms
Temperature & Seed
- Use
temperature: 0for maximum consistency - Set a
seedvalue when you need reproducible results - Avoid temperatures above 1.0 for production use
- Test different temperature values to find the right balance
Entity Detection
- Always review auto-detected entities for accuracy
- Combine with manual review for compliance-critical use cases
- Test with sample data before processing sensitive information
- Understand your regulatory requirements (HIPAA, GDPR, etc.)
Next Steps
API Setup
Learn how to configure your ElevenLabs API key
Transcription Options
Configure basic transcription options