FunASR Engine for RealtimeSTT

FunASR is Alibaba DAMO’s industrial-grade speech recognition toolkit. It supports 50+ languages, speaker diarization, emotion detection, streaming inference, and runs at up to 170× real-time speed. It is especially well-suited for Chinese speech recognition through models like SenseVoiceSmall and Paraformer-zh, but works for many other languages as well.

Install

FunASR is not bundled with any RealtimeSTT extra. Install it directly:

pip install funasr

Engine Name

Pass "funasr" as the transcription_engine parameter:

transcription_engine="funasr"

Basic Usage

CUDA
CPU

from RealtimeSTT import AudioToTextRecorder

recorder = AudioToTextRecorder(
    transcription_engine="funasr",
    model="iic/SenseVoiceSmall",
    device="cuda",
)

from RealtimeSTT import AudioToTextRecorder

recorder = AudioToTextRecorder(
    transcription_engine="funasr",
    model="iic/SenseVoiceSmall",
    device="cpu",
)

Model Selection

Known model names such as SenseVoiceSmall, Fun-ASR-Nano, and Paraformer-zh are downloaded automatically through ModelScope when first used. Pass the model name or a full ModelScope repository path:

recorder = AudioToTextRecorder(
    transcription_engine="funasr",
    model="iic/SenseVoiceSmall",
    device="cuda",
)

Configuration Options

The following RealtimeSTT parameters map directly to FunASR AutoModel arguments:

RealtimeSTT parameter	FunASR mapping
`model`	`model`
`device`	`device`
`beam_size`	`beam_size`
`batch_size`	`batch_size`
`transcription_engine_options: {"vad_filter": bool, "vad_model": str}`	`vad_model`

VAD Integration

To use FunASR’s built-in VAD model, pass vad_filter and vad_model together via transcription_engine_options:

recorder = AudioToTextRecorder(
    transcription_engine="funasr",
    model="iic/SenseVoiceSmall",
    device="cuda",
    transcription_engine_options={
        "vad_filter": True,
        "vad_model": "fsmn-vad",
    },
)

Notes and Limitations

The FunASR integration is still under active development. If you encounter an issue, please open a GitHub issue on the RealtimeSTT repository.

For more details about FunASR itself, see the official FunASR GitHub repository.

Get Started

Guides

Transcription Engines

Resources

FunASR Engine for RealtimeSTT

Install

Engine Name

Basic Usage

Model Selection

Configuration Options

VAD Integration

Notes and Limitations

Build docs developers (and LLMs) love

Get Started

Guides

Transcription Engines

Resources

Documentation Index

​Install

​Engine Name

​Basic Usage

​Model Selection

​Configuration Options

​VAD Integration

​Notes and Limitations

Build docs developers (and LLMs) love

Install

Engine Name

Basic Usage

Model Selection

Configuration Options

VAD Integration

Notes and Limitations