Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/konhi/elevenlabs-speech-to-text-api-ui/llms.txt

Use this file to discover all available pages before exploring further.

ElevenLabs Speech-to-Text API UI

A fast, modern web interface for transcribing audio and video files using the ElevenLabs Scribe API. Built with Bun, React, and Tailwind CSS, this application provides a complete playground for testing and exploring ElevenLabs’ powerful speech-to-text capabilities.

Quickstart

Get up and running in minutes with your first transcription

Installation

Detailed setup instructions for Bun and project dependencies

Key features

This application provides a comprehensive interface for the ElevenLabs Speech-to-Text API with the following capabilities:

Advanced transcription options

  • Multiple models - Choose between Scribe V1 and Scribe V2
  • Speaker diarization - Automatically detect and label different speakers in your audio
  • Timestamp granularity - Get word-level or character-level timestamps for precise alignment
  • Multi-channel support - Process audio files with multiple channels
  • Audio event tagging - Detect and tag non-speech audio events

Customization and control

  • Language detection - Specify language codes for better accuracy
  • Custom keyterms - Provide domain-specific terms for improved recognition
  • Entity detection - Identify PII, PHI, or other entities in transcripts
  • Temperature control - Fine-tune the randomness of transcription output
  • Deterministic output - Use seed values for reproducible results

Interactive transcript viewer

  • Synchronized playback - Click words to jump to specific timestamps in the audio
  • Speaker labeling - Rename speakers for better readability
  • Visual timeline - See the full transcript with speaker segments
  • Audio scrubbing - Navigate through your audio with a visual scrub bar

Technology stack

This project is built with modern web technologies optimized for performance:
  • Bun - All-in-one JavaScript runtime with built-in dev server and hot reload
  • React 19 - Latest version of React for building the UI
  • Tailwind CSS - Utility-first CSS framework
  • Radix UI - Accessible component primitives
  • @elevenlabs/elevenlabs-js - Official ElevenLabs JavaScript SDK

Get started

Follow the quickstart

Learn how to transcribe your first audio file in under 5 minutes

View installation guide

Complete setup instructions and verification steps

Build docs developers (and LLMs) love