guides · Yomite Team

The Japanese TTS reader you actually own — offline, one-time purchase, no subscription

Tired of cloud TTS subscriptions that expire the moment you stop paying? Yomite is a fully offline Japanese text-to-speech reader you buy once and own forever on iPhone, iPad, and Mac.

The Japanese TTS reader you actually own — offline, one-time purchase, no subscription

You’ve got a novel you want to listen to right now. But before you hit play, there’s a wall.

“You’ve used up your synthesis credits.” “No Wi-Fi, no playback.” “We updated our model — your saved audio sounds different now.” If you’ve used a cloud-based TTS reader long enough, you’ve hit at least one of these. Probably more.

Yomite was built to route around every one of them. 100% offline. One-time purchase. No subscription. Your text never leaves your device.

The quiet cost of cloud TTS

Speechify is the most prominent TTS reader in the English-speaking world. It works. But at $139/year, the pricing has generated a consistent stream of complaints that are easy to find and verify.

“Speechify charges $139/year and the quality jumps whenever they switch model versions. You don’t own anything.” — App Store review (verified public listing)

“Every time I’m on a plane, it can’t synthesize. What’s the point of a reading app that needs a data connection?” — Reddit r/productivity (2024)

The pattern across these complaints is the same: the tool is useful, but the relationship is adversarial. You pay, and in return you get access — not ownership. Stop paying, and access disappears.

The Audible catalog gap

Audible solves a different problem: it sells professionally produced audiobooks with real voice actors. The quality is excellent. But the catalog is the catalog. If you want to listen to a web novel currently serializing on a popular site, or an EPUB you purchased from an indie author, Audible cannot help you. Professional voice recording is expensive and slow. New titles lag by months or years — if they’re recorded at all.

For readers who follow web fiction, light novels, and self-published authors, the catalog gap is the entire problem.

Offline TTS for Japanese specifically

Japanese TTS faces an additional layer of complexity. Kanji readings are context-dependent. Proper nouns — character names, place names, invented terminology — trip up every synthesizer unless you can teach it the correct reading. Cloud systems update their models without warning, which means the reading you loved last week may sound different today.

For Japanese web novels in particular, where proper nouns are dense and idiosyncratic, you need a tool you can tune once and rely on permanently.

How Yomite is different

Yomite runs all speech synthesis on-device, using Apple Silicon’s Neural Engine. The text you read is processed locally. Nothing is sent to a server. Nothing is metered.

Download a voice once. After that, you’re offline for life.

Plane, subway, mountain cabin, emergency scenario with no data signal — Yomite sounds identical in all of them.

31 anime-style character voices, 101 style variations

This is the number that matters: 31 character voices with 101 total style variations. These aren’t generic “male voice” or “female voice” options. They’re named characters with distinct personalities and multiple emotional registers.

A few examples:

VoiceCharacterNotable
凛音エル (Rinnne El)Cool, urbanStrong protagonist narration
桜音 (Sakurane)Fresh, clearSlice-of-life, romance
天深シノ (Amane Shino)Whisper registerBedtime, late-night reads
阿井田 茂 (Aida Shigeru)Baritone7 styles including shout, whisper, grief
まお (Mao)Expressive6 styles
らせつん (Rasetsun)Wide range8 styles
みちのくあいり (Michinoku Airi)Warm7 styles
れな (Rena)5 styles
コハク (Kohaku)4 styles
TANAKA5 styles

A curated set of starter voices — including まい (Mai) — is free forever. The Premium Voice Pack unlocks all 31 voices as a one-time purchase, no subscription. One purchase covers iPhone, iPad, and Mac for life. See the App Store for the current price.

Ownership is the feature

In the streaming era, “ownership” can sound quaint. It isn’t.

When you subscribe to a TTS service, you’re renting access to synthesis. Miss a payment, switch banks, let a card expire — your library pauses. The service can reprice, change the model, or shut down entirely, and you have limited recourse.

When you buy Yomite’s Premium Voice Pack, those voices are yours. They run on your device. They don’t need the internet to function. They’ll work the same in two years as they do today.

Your bookshelf should belong to you.

Choosing the right voice for the book

With 31 voices and 101 style variations, the real power is matching the voice to the material. You choose a voice and emotion style per book or chapter — set it manually at the chapter header and it holds for the entire chapter. Switch any time you want a different feel.

Some natural fits: 阿井田 茂 (Aida Shigeru) — baritone, 7 styles including shout and grief — suits heavy fantasy and serious drama. 桜音 (Sakurane) works cleanly for slice-of-life and romance. 天深シノ (Amane Shino)‘s whisper register is ideal for late-night listening. For conversation-heavy isekai where the tone shifts fast, picking a voice with multiple emotion styles — らせつん (8 styles), まお (6 styles) — gives you range within a single character’s voice.

Web novels, EPUB, and anything you can import

Yomite works with popular Japanese web novel sites, EPUB files, and saved web articles. Import your own text; Yomite reads it aloud.

(Note: Yomite is not affiliated with or endorsed by any third-party platform. Always review each site’s terms before importing content.)

The vertical reader (縦書き / tategaki) with furigana support means classical Japanese prose, Meiji-era literature, and novels with heavy kanji are readable as intended. The per-novel pronunciation dictionary lets you register correct readings for character names and invented terminology — permanently, across every book.

Getting your reading time back

Most people who say they “don’t have time to read” have more passive time than they realize.

  • 30-minute commute each way = 1 hour daily
  • Housework, cooking, cleaning = 20–30 minutes
  • Winding down before sleep = 15 minutes

That’s 90 minutes to two hours every day that can become listening time. A standard light novel runs 5–8 hours of audio at 1.0× speed. At 1.2× — which is surprisingly comfortable — you finish one in three to four days.

The sleep timer’s end-of-chapter mode stops playback at a chapter boundary, not mid-sentence. If you fall asleep, you’ll know exactly where you left off.

Honest comparison

Speechify / Cloud TTSAudibleYomite
Offline synthesisNo (requires connection)Downloaded titles only100% offline
Pricing modelAnnual subscriptionMonthly subscriptionOne-time purchase
Content catalogDocuments / your ownPublisher catalog onlyAnything you import
Voice selectionGeneric / uniformFixed per title31 voices, your choice
Text privacySent to cloudCloudOn-device only
Japanese-specific tuningLimitedN/APer-novel pronunciation dict

This isn’t a knock on Speechify or Audible. They do what they do well. But if your use case is offline Japanese text-to-speech for web fiction and self-sourced content, they’re the wrong tool.

The wedge

Every subscription you pay for TTS is money you’re spending on access, not ownership. Every cloud dependency is a single point of failure waiting for a bad moment.

Yomite is the opposite of that: a tool you buy once, that runs entirely on your device, that gets out of the way and lets you listen to whatever you want, wherever you are, with a voice that sounds like a character rather than a machine.

You own it. That’s not a marketing line. That’s the architecture.


Download Yomite on the App Store →

Questions or feedback: [email protected]

— Yomite Team