PublicSoftTools

Speech to Text Online Free

Convert speech to text in real time using your microphone. Supports 15 languages with live transcription. No signup, no upload — uses your browser's built-in voice recognition.

Press Start Recording, then speak

How Speech to Text Works

  1. 1Select your language from the dropdown. The default is English (US).
  2. 2Click Start Recording. Your browser will ask for microphone permission — allow it.
  3. 3Speak clearly. Words appear in real time — grey text is interim (being processed), black text is finalised.
  4. 4Click Stop when done, then Copy text to paste your transcript anywhere.

Powered by the Web Speech API

This tool uses the browser's native Web Speech API — no external library or API key required. Chrome and Edge route the audio through Google's speech recognition infrastructure; Safari uses Apple's on-device recognition. No data passes through PublicSoftTools' servers. Accuracy is equivalent to the dictation feature built into your operating system.

Use Cases

Meeting Notes

Dictate meeting notes hands-free and copy the transcript into your note-taking app or email in seconds.

Accessibility

Useful for anyone who types slowly or has difficulty using a keyboard — just speak and get text.

Content Drafting

Dictate first drafts of articles, emails, or social media posts faster than typing, then edit the transcript.

Language Practice

Check your pronunciation by speaking in a foreign language — if the transcription is correct, the recognition engine understood you.

Frequently Asked Questions

Which browsers support speech to text?

The tool uses the Web Speech API, which is supported in Chrome, Edge, and Safari. Firefox does not currently support the Web Speech API. For the best experience, use the latest version of Chrome or Edge on desktop or Android, or Safari on iOS.

Which languages are supported?

The tool supports 15 languages: English (US and UK), Spanish, French, German, Italian, Portuguese (Brazilian), Arabic, Chinese (Simplified), Japanese, Korean, Hindi, Russian, Turkish, and Dutch. Select your language from the dropdown before starting.

Is my voice recorded or sent to a server?

The transcription uses your browser's built-in Web Speech API. On Chrome and Edge, the audio is processed by Google's speech recognition servers, which is how the browser's native API works. On Safari, it is processed on-device by Apple. No data is sent to PublicSoftTools' servers.

Can I transcribe a long recording?

Yes. The tool uses continuous recognition mode, so it keeps transcribing as long as you are speaking and have not pressed Stop. For very long sessions, pause and resume as needed. There is no enforced time limit, though browser tab focus and microphone permissions must remain active.

Why does it stop after a few seconds of silence?

The Web Speech API automatically pauses recognition after a period of silence as a browser-level behaviour. Click Resume to continue from where you left off — your transcript is preserved.

How do I improve transcription accuracy?

Speak clearly at a moderate pace in a quiet environment. Use a good-quality microphone and position it close to your mouth. Select the correct language before starting — using the wrong language will produce garbled output. Avoid background music or TV during transcription.