Text to Speech Online Free
Paste any text and hear it read aloud instantly using your browser's built-in voice engine. Choose voice, speed, and pitch — no upload, no signup, runs entirely in your browser.
Loading voices… if no voices appear, try refreshing the page.
How to Use the Text-to-Speech Tool
- 1Paste or type any text into the box — articles, emails, notes, summaries, or any document you want to hear.
- 2Choose a voice from the dropdown. Voices are grouped by language and sourced directly from your operating system and browser.
- 3Adjust speed (0.5× to 2×) and pitch to your preference. These are locked during playback — set them before pressing Play.
- 4Press Play to start. Use Pause to hold your place and Resume to continue. Stop resets to the beginning.
What Is Speech Synthesis?
Speech synthesis (text-to-speech, or TTS) is the process of converting written text into spoken audio. Modern browsers expose this capability through the Web Speech API, which routes text to the operating system's speech engine — the same engine used by screen readers and voice assistants. The voices you see in the dropdown are the same high-quality voices available to accessibility tools on your device.
Because everything runs inside the browser, there is no audio file to upload or download, no account to create, and no data sent to an external server when using offline voices. It is an ideal tool for proofreading by ear, creating personal voice notes, following along with long documents, or improving accessibility for any piece of text.
Tips for the Best Listening Experience
Proofread by Ear
Listening to your own writing reveals errors that eyes skip over — repeated words, awkward sentences, and punctuation mistakes become obvious when heard aloud. Set speed to 0.9× for comfortable proofreading pace.
Pick the Right Voice
System voices vary widely in quality. On Windows, try Microsoft Zira or Aria (if available). On macOS, Samantha and Daniel sound very natural. In Chrome, Google UK English Female is a popular choice.
Speed Up for Skimming
Set speed to 1.4×–1.6× to skim long articles or reports quickly. After listening once at speed, replay key sections at normal rate for detail. This works especially well with document summaries.
Combine with the Summarizer
Use the Document Summarizer to condense a long article, then paste the summary here for a quick audio briefing. A 2,000-word article becomes a 90-second listen at 1.0× speed.
Use Offline Voices for Privacy
System voices (not labelled as "online" or "Google") process text entirely on your device. Choose these when reading private documents, contracts, or sensitive personal notes.
Adjust Pitch for Fatigue
Slightly lowering pitch (0.8–0.9) can make long listening sessions feel less fatiguing. Higher pitch (1.1–1.2) can help with clarity when listening to technical or dense text.
Frequently Asked Questions
How does the text-to-speech tool work?
The tool uses the Web Speech API built into your browser — specifically the SpeechSynthesis interface — to convert text into spoken audio. Your browser hands the text to the operating system's speech engine, which uses the voices installed on your device to generate audio. No audio files are created on a server, and no data ever leaves your browser.
Which voices are available?
The voice list shows every voice installed on your operating system and browser. On Windows, this typically includes Microsoft Zira, David, and Mark (English) plus voices for other installed languages. On macOS it includes Siri-quality voices like Samantha, Alex, and many more. Chrome on any platform adds Google's online voices (marked with an accent in the name) which sound especially natural when you have an internet connection.
Is my text sent to a server?
For built-in system voices (those without a network indicator in the name), speech is generated entirely on your device — no data leaves your browser. If you select a Google online voice in Chrome, the text is sent to Google's speech service to generate audio. Stick to offline system voices if you are reading sensitive or private text.
What do the Speed and Pitch controls do?
Speed (rate) controls how fast the voice reads — 1.0× is the default natural pace. Below 1.0× slows it down (useful for learning or transcription), above 1.0× speeds it up (useful for quickly skimming long documents). Pitch adjusts the tone of the voice: lower values produce a deeper sound, higher values a lighter, higher-pitched sound. Both controls are locked during playback — stop first, then adjust, and press Play again.
Why does the voice stop or restart partway through long text?
Some browsers impose a character limit per speech utterance (Chrome limits utterances to roughly 200–300 words). The tool automatically splits your text into chunks of 200 words and queues them sequentially, so playback continues seamlessly across the whole document. You may notice a very brief pause between chunks — this is normal.
Can I pause and resume without losing my place?
Yes. The Pause button suspends speech at the current word. Resume continues from exactly where it stopped. The Stop button cancels playback entirely — pressing Play after stopping restarts from the beginning of the text.
Which browsers support this tool?
The Web Speech API (SpeechSynthesis) is supported in Chrome (desktop and Android), Edge, Safari (desktop and iOS), and Firefox. Internet Explorer and very old browser versions are not supported. If you see an unsupported message, try opening the page in Chrome or Edge.
Can I use this to create audio files or voice notes?
The tool plays audio directly in your browser — it does not export an audio file. To save the output as an MP3 or WAV, you can use your operating system's audio capture feature (e.g., the Windows Sound Recorder set to record the system audio output, or Audio Hijack on macOS) while the tool is playing.