Speech to Text Converter

Descript instantly turns speech into text in real time. Just start recording and watch our AI speech recognition transcribe your voice—with 95% accuracy—into text that’s ready to edit or export.

Get started →

How to automatically convert speech to text with Descript

Step 1

Start a recording session or upload voice audio

Create a project in Descript, select record, and choose your microphone input to start a recording session. Or upload a voice file to convert the audio to text.

Step 2

Talk and let the AI transcribe

As you speak into your mic, Descript’s speech-to-text software turns what you say into text in real time. Don’t worry about filler words or mistakes; Descript makes it easy to find and remove those from both the generated text and recorded audio.

Step 3

Edit and export your text

Enter Correct mode (press the C key) to edit, apply formatting, highlight sections, and leave comments on your speech-to-text transcript. Filler words will be highlighted, which you can remove by right clicking to remove some or all instances. When ready, export your text as HTML, Markdown, Plain text, Word file, or Rich Text format.