Skip to main content

Neiroha Wiki

Neiroha

AI audio middleware and dubbing workstation

Neiroha is a Flutter desktop application that connects one unified UI to local or cloud TTS backends. It brings voice characters, voice banks, long-form reading, dialogue dubbing, video dubbing, and an OpenAI-compatible HTTP API into one workflow.

Neiroha overview

Documentation Entry Points

Feature Overview

ModuleWhat It Does
ProvidersConnects TTS backends, including OpenAI-compatible services, Azure, GPT-SoVITS, CosyVoice, VoxCPM2, Gemini, and Windows system voices.
Voice CharactersBind a provider, model, voice, speed, mode, and optional reference audio into a reusable character.
Voice BanksGroup multiple characters and expose them to projects and the API model list.
Quick TTSSingle-character test synthesis; generated files are archived for later reuse and cleanup.
Dialogue TTSMulti-character dialogue projects with chat-style lines and per-line audio.
Phase TTSLong-form script splitting, per-segment character assignment, and batch synthesis.
Novel ReaderImport TXT files or folders, synthesize missing audio on demand, cache output, prefetch, and keep reading across screens.
Video DubbingImport video, audio, and subtitles, synthesize speech per cue, and export audio or dubbed video on desktop platforms.
Settings / TasksInspect shared TTS queue state, provider limits, API logs, storage, and media tool configuration.
Local APIExpose an OpenAI-compatible TTS endpoint for scripts, games, DAWs, and other tools.

Platform Scope

Neiroha treats platform support as concrete capability boundaries, not as a promise that every platform exposes identical native features.

CapabilityWindowsLinuxAndroid phones / tablets
Release build
Main UI and project management
Cloud TTS backend connection
Same-device local inference backend connection-
LAN-hosted local inference backend connection
Windows SAPI system voice--
External FFmpeg CLI detection and invocation-
Video dubbing export, muxing, trimming, and waveform extraction-

means the capability is covered by current documentation and release builds. - means it is not currently promised or exposed in the UI.

System TTS currently uses Windows SAPI only. Android and Linux system TTS should only appear in the UI after native adapters exist.