Skip to main content

Quick TTS

Quick TTS is for one-character checks and short audio generation. It appears in the Voice Bank character inspector and should be the first validation step after configuring a provider and character.

Quick TTS page

When to Use It

ScenarioPurpose
New providerVerify URL, API key, model, and voice with one sentence.
New characterConfirm the provider binding and task mode.
Voice design tuningCompare voice instruction changes quickly.
Free quota testingAvoid spending many requests through Dialogue or Phase batches.

Basic Steps

  1. Select a voice bank.
  2. Select a character that is bound to a configured provider.
  3. Check the provider, model, voice, and task mode in the right panel.
  4. Type short text in the Quick Test input.
  5. Click the purple generate button.
  6. The synthesis job enters the shared TTS queue.
  7. When complete, the audio is saved to disk and played automatically.

First Test Sentence

Start with a short sentence instead of a long paragraph:

Hello, this is a short Neiroha voice test.

For multilingual or Chinese voices, use a short sentence in the target language. For Gemini or MiMo voice design, keep the synthesis text short and put style requirements in the character's voice instruction.

Output Archive

Quick TTS results are stored in the Quick TTS archive for reuse, cleanup, and storage scans. On Windows the default voice asset root is:

%APPDATA%\com.neiroha.neiroha\voice_asset\quick_tts\

The voice asset root can be changed from Settings -> Storage.

Troubleshooting Order

SymptomCheck First
Fails immediately after clickThe character must bind to an enabled provider.
401 / 403Cloud API key, Azure region, or MiMo key is wrong.
404Base URL may include or omit /v1 incorrectly.
429Provider limit is too high, or the free quota has been reached.
Stays queuedProvider max concurrency is 0, or a previous job is blocking the queue.
Audio is generated but silentCheck system volume, audio format support, and player permission.

After Quick TTS passes, continue to Dialogue TTS or Phase TTS.