Japanese Voice Studio

Design a voice.
Generate Japanese speech.

Describe a voice in plain words or clone one from a clip, then render studio-quality Japanese speech — characters, full dialogue scenes, sound effects, and more, right in your browser.

No sign-up · Runs in your browser · Free

Samples

Hear it

Real output from the model — the same quality you get in the Studio.

To clearly demonstrate the effect of captions, the samples within each group below were generated using the exact same random seed. The variations in delivery are purely the result of the changed prompts.

Pure voice design

Text + Caption

Generate diverse voices and styles purely through descriptive text captions — no reference audio needed.

Text 1

本日はお越しいただき、誠にありがとうございます。どうぞごゆっくりお過ごしください。

Thank you so much for coming today. Please make yourself at home.

1

落ち着いた大人の男性。フォーマルな場で、深く響く声で丁寧かつ歓迎の意を込めて話している。

Calm adult man — formal, deep resonant voice, polite and welcoming.

2

若く元気な女性の声。カフェの店員のように、明るくハキハキとした少し高めのトーンで話している。

Young, upbeat woman — like a café clerk, bright and crisp, slightly high tone.

Text 2

すみません!この近くにコンビニってありますか?ちょっと急いでて、道に迷っちゃったみたいで

Excuse me! Is there a convenience store nearby? I'm in a hurry and seem to have gotten lost.

1

低めの声の男性が、丁寧に道を尋ねている。穏やかで礼儀正しく、余裕のある口調。

Low-voiced man asking directions politely — calm, courteous, composed.

2

若い女性が、慌てた様子で早口に話している。焦りと不安が声ににじんでいる。

Young woman speaking fast and flustered — anxiety bleeding into her voice.

Toolkit

The whole studio, end to end

Scrub the signal chain — from designing a voice to producing the scene and keeping it organized. Every tool, one tab.

Workflow

How it works

Three steps from blank page to finished clip.

01

Type your Japanese

Paste or write a line. Built-in translation and furigana help you get the reading right.

02

Design or clone a voice

Describe the voice in plain words, or upload a short reference clip to clone its character.

03

Generate & refine

Render studio-quality speech in seconds, then trim, mix, and export — all in the browser.

Who it’s for

Made for creators

Indie game & visual-novel devsVTubers & streamersDubbers & video creatorsLanguage learnersWriters & hobbyists

Principles

Free, private, open

Free & open

Built on open models — Irodori-TTS for voice and Stable Audio for sound effects.

Private by default

Your voices, scripts, and notes live in your browser — not on a server. Generation runs transiently and isn’t stored.

No install

Everything runs in the browser, including the AI noise removal. Nothing to download or set up.

Get started

Ready to create?

Open the studio and generate your first line in seconds — free, no sign-up.