Is Read Aloud really free?

The extension is free and open-source. But good voice quality requires API keys from Google WaveNet ($4/1M chars), Amazon Polly, or Azure. Without API keys, you get basic browser voices.

Is CastReader better than Read Aloud?

For most users: yes. CastReader has better built-in voices (no API setup), paragraph highlighting, and works on Kindle. Read Aloud is better for power users who want full control over their TTS engine.

Does Read Aloud work on Kindle?

No. Read Aloud reads DOM text, which is encrypted/scrambled on Kindle Cloud Reader. CastReader uses OCR to bypass this.

Can I use my own API keys with CastReader?

No, CastReader uses its built-in Kokoro AI voices. Read Aloud is better if you want to use your own Google/Azure/OpenAI API keys.

Which has better voice quality?

CastReader out of the box (Kokoro AI, 8/10). Read Aloud with OpenAI or Azure API keys can match or exceed this, but requires setup and costs money per character.

Do they conflict if I install both?

No, they don't conflict. You can install both and use whichever works better on each site.

Read Aloud vs CastReader: Two Free TTS Extensions, Very Different Approaches

Read Aloud and CastReader are both free Chrome extensions that read webpages aloud. They're both good. They're also almost nothing alike in how they work, what they prioritize, and who they're built for. Read Aloud is open-source, minimal, and connects to almost every cloud TTS engine that exists — bring your own API key and you're running Google WaveNet or OpenAI voices at raw cost. CastReader ships with built-in Kokoro AI voices, paragraph highlighting, and 15+ dedicated site extractors including Kindle Cloud Reader. One is a toolkit. The other is an appliance.

This matters because "read aloud Chrome extension" sounds like a solved problem. It is not. The extension you pick determines whether you spend two minutes configuring API keys and debugging voice engine settings, or whether you click one button and hear the article. Neither approach is wrong. But they're different enough that picking the wrong one for your workflow means you'll uninstall it within a week.

I've used both extensively. Here's what I found.

The Philosophy Gap

Read Aloud is built like a Linux tool. It does one thing — sends text to a speech engine — and gives you complete control over which engine, what settings, how it behaves. The extension itself is open-source on GitHub, maintained by a community, and has over a million users on the Chrome Web Store. It deserves every single one of those installs. The philosophy is: we handle the text extraction and playback controls, you bring the voice.

CastReader is built like an iPhone. It ships with everything included — voices, content extraction, highlighting, platform-specific handlers — and the goal is that you never open a settings panel. Install it, click the icon, hear the page. The philosophy is: we handle everything, you just listen.

These aren't competing visions of TTS. They're competing visions of software. And your preference probably says more about how you use computers in general than how you feel about text-to-speech specifically.

Voice Quality: The Central Tradeoff

This is where most people's decision gets made, so let me be specific.

Read Aloud's default voices are your browser's built-in Web Speech API voices. On macOS, these are passable — a solid 4 out of 10. Robotic but intelligible. You can listen to an article without wincing. On Windows, they're worse. The default Microsoft voices sound like they learned English from a pronunciation dictionary and have never heard an actual conversation. Maybe a 2 out of 10. On Chrome OS, somewhere in between.

Read Aloud with API keys is a completely different product. Plug in a Google Cloud API key and you get WaveNet voices — natural cadence, proper intonation, 7 out of 10 easily. Plug in an Azure key and you get their neural TTS, which is an 8 or 9 depending on the voice. Plug in an OpenAI API key and you get voices that are genuinely hard to distinguish from human narration. The ceiling is as high as you want it to be, because you're connecting directly to the best TTS engines in the world.

The cost for this is real but small. Google WaveNet charges about $4 per million characters. A typical 2,000-word article is roughly 12,000 characters. So you're looking at about $0.05 per article. Read a hundred articles a month and that's $5. Azure and Amazon Polly are in the same ballpark. OpenAI's TTS is slightly more expensive but still pennies per article. If you already have cloud accounts, this is genuinely the most cost-effective way to get premium TTS.

CastReader's built-in voices are Kokoro AI, which runs locally in the browser. No API key, no cloud calls, no per-character billing. The quality sits at about an 8 out of 10 — noticeably better than browser voices, slightly below the best cloud offerings from Azure or OpenAI. Natural enough that you forget you're listening to synthesis after a few minutes. Not natural enough that you'd mistake it for a podcast host. For most people reading articles over breakfast, this is more than sufficient.

Here's the honest summary: if you want the absolute best voice quality and you're comfortable managing API keys, Read Aloud with Azure or OpenAI will beat CastReader. If you want good voice quality without touching a single setting, CastReader wins by a mile.

Setup and First-Run Experience

I sent both extensions to a friend last month. She's a teacher, reads a lot online, not particularly technical. Her experience:

Read Aloud: Installed it, clicked the icon on an article. It started reading. The voice was the default browser voice. She said it sounded "like a GPS." She opened the settings to find something better. There were options for speech rate, pitch, volume, voice selection, and an "engine" dropdown that listed "WebSpeech," "Google Translate," and several others that required API keys. She closed the settings. "I don't know what any of this means." She kept using it with the default voice for about three days and then stopped. "It works but the voice annoys me."

CastReader: Installed it, clicked the icon. It started reading in a Kokoro voice. She texted me: "This one sounds way better." She's still using it.

That anecdote contains the entire setup comparison. Read Aloud's power is behind a configuration wall. CastReader's quality is immediate. For the subset of users who know what a Google Cloud API key is and how to create one, Read Aloud unlocks a higher ceiling. For everyone else, CastReader's floor is higher than Read Aloud's floor.

The Read Aloud team knows this, by the way. Their documentation is solid and they've written guides for setting up each cloud provider. The problem isn't documentation quality — it's that most people who want a "read aloud Chrome extension" don't want to read documentation at all. They want to press a button.

Platform Support: Where CastReader Pulls Ahead

Read Aloud uses standard DOM text extraction. It finds the text on the page, sends it to the speech engine, done. This works on most websites. It works on blog posts, news articles, Wikipedia, documentation sites. For the standard web, it's fine.

Where it breaks down is on platforms that don't render text in the normal way.

Kindle Cloud Reader. Amazon encrypts their font rendering — the text you see on screen isn't actually selectable text in the DOM. It's rendered using custom font subsets that scramble character mappings. Read Aloud can't read it. Neither can any other extension that relies on DOM text extraction. CastReader solves this with local OCR that reads directly from the rendered page, bypassing the font encryption entirely. It's the only Chrome extension that works on Kindle.

ChatGPT and Claude. AI chat interfaces stream text dynamically, use complex nested DOM structures, and don't look like "articles" to standard extractors. Read Aloud struggles here. CastReader has dedicated extractors for AI chat pages that understand the conversation structure and read responses cleanly.

Google Docs, Notion, Substack. Each of these platforms has a custom rendering approach that confuses generic text extractors. CastReader has specific handlers for Notion, Substack, Medium, Wikipedia, arXiv, and about ten more. Read Aloud treats them all the same way — extract whatever text is in the DOM — which sometimes works and sometimes reads menus, sidebars, and metadata.

CastReader's approach is more labor-intensive to build (every new platform needs a dedicated extractor) but produces cleaner results on the platforms it supports. Read Aloud's generic approach works everywhere but excels nowhere in particular.

Paragraph Highlighting

CastReader highlights the current paragraph on the actual page and auto-scrolls to follow along. This sounds like a minor visual feature until you've used it for a week and then tried going back. You're cooking dinner, you glance at the screen, and the highlighted paragraph tells you exactly where the voice is. You look away again. Zero cognitive effort to re-orient.

Read Aloud doesn't have paragraph-level highlighting. It has a word-level highlight option, but in my testing it jitters — lands on the wrong word, corrects, jumps ahead. Distracting enough that I turned it off. The extension was designed primarily as an audio tool, not a read-along tool. If you just want audio in your ears and don't care about visual tracking, this doesn't matter. If you like to glance at the screen occasionally and see where you are, CastReader's highlighting is a meaningful advantage.

The Open-Source Advantage

Read Aloud is open-source. The code is on GitHub. Anyone can inspect it, fork it, contribute to it, or verify that it's not doing anything sketchy with your data. For privacy-conscious users, this is a genuine and important advantage.

You can see exactly what text is sent where. You can verify that your API keys aren't being logged. You can read the content extraction logic and understand why it reads a particular piece of text on a particular page. Transparency isn't just a philosophical preference — it's a practical safeguard.

CastReader is not open-source. The Kokoro voice model runs locally in your browser (no text sent to external servers for speech synthesis), but the extension code itself is proprietary. If "I need to read the source code" is a hard requirement for you, Read Aloud is the only choice. That's not a minor point. For some users, especially those in security-sensitive environments, this is the deciding factor and everything else is irrelevant.

The community around Read Aloud is also worth mentioning. Bug reports get addressed. Feature requests get discussed. There's a real ecosystem of users who care about the project and contribute to it. That kind of community sustainability is something money can't easily replicate.

Side-by-Side Comparison

Feature	Read Aloud	CastReader
Price	Free (open-source)	Free
Default voice quality	Browser voices (4/10)	Kokoro AI (8/10)
Best possible voice quality	9/10 (with OpenAI/Azure API key)	8/10 (built-in)
Setup required for good voices	Yes (API key configuration)	No
Kindle Cloud Reader	No	Yes (OCR-based)
ChatGPT / Claude	Limited	Yes (dedicated extractors)
Google Docs / Notion	Basic DOM extraction	Dedicated extractors
Paragraph highlighting	No (word-level, unreliable)	Yes (on-page, auto-scroll)
Open source	Yes	No
Voice engine options	6+ (Google, Azure, Amazon, OpenAI, etc.)	Kokoro AI (built-in)
BYOK (Bring Your Own Key)	Yes	No
Chrome Web Store users	1,000,000+	Growing
Account required	No	No
Languages	40+ (depends on engine)	40+

Who Should Use Which

Choose Read Aloud if:

You already have Google Cloud, Azure, or OpenAI API keys
You want the absolute best possible voice quality and don't mind configuration
Open-source and code transparency are requirements, not preferences
You primarily read standard blog posts and articles (no Kindle, no AI chats)
You enjoy tweaking settings and optimizing your tools

Choose CastReader if:

You want good TTS immediately with zero setup
You read on Kindle Cloud Reader
You want to listen to ChatGPT or Claude responses
Paragraph highlighting and visual tracking matter to you
You don't want to manage API keys or cloud billing

Install both if:

You want CastReader for daily reading and Kindle, but Read Aloud for those moments when you need a specific cloud voice engine for a specific task. They don't conflict. They coexist peacefully in your extension bar.

The Verdict

Read Aloud is the better tool. CastReader is the better product.

That distinction matters. Read Aloud gives you more power, more options, more control, and more transparency. If you're the kind of person who runs their own home server, uses Vim, and has opinions about shell configurations — Read Aloud is your extension. It rewards investment. The more you put into configuring it, the better it gets.

CastReader gives you a better experience out of the box. If you're the kind of person who just wants their computer to read an article while they make coffee — and that describes the vast majority of people searching for a read aloud Chrome extension — CastReader is the right choice. Kokoro AI voices that sound natural without configuration. Paragraph highlighting that keeps you oriented. Platform support that handles Kindle, AI chats, and complex web apps without you even knowing there was a problem to solve.

The good news is that both are free, both are respectful of your time (no sign-up, no trial expiration, no premium upsell), and both can live in your browser simultaneously. Install CastReader for the daily stuff. Keep Read Aloud around for the edge cases. Or pick one and commit. Either way, you're listening to the web instead of staring at it, and that's the point.

If you're still comparing options, check out our broader TTS extension comparison or the full Read Aloud review. And if you just want to try CastReader right now, it's at castreader.com/free-text-to-speech — one click install, no account needed.

Read Aloud vs CastReader: Two Free TTS Extensions, Very Different Approaches (2026)

Table of Contents