Skip to main content

Phase TTS

Phase TTS is designed for long narration, audiobooks, and scripts that need paragraph-by-paragraph handling.

Phase TTS page

Suitable Text

Text TypeFit
Narration manuscriptGood fit; split by blank lines or paragraph boundaries.
Novel chapter with a small amount of dialogueGood fit; split first, then assign characters manually.
Heavy multi-character dialoguePrefer Dialogue TTS and clean speaker labels before import.
Very long bookSplit by chapter and process one file or project at a time.

Basic Flow

  1. Create a project.
  2. Select a voice bank.
  3. Paste the full script into the text box.
  4. Use Split to divide the script by blank lines or sentence boundaries.
  5. Review each phase and fix segments that are too long, too short, or incorrectly punctuated.
  6. Assign a character to each phase.
  7. Generate 1 to 3 phases for preview.
  8. Click Generate All for batch synthesis.
  9. Export or copy audio from the output directory shown in the status bar.

Split Recommendations

ProblemRecommendation
Segment is too longInsert blank lines manually to avoid cloud context or TPM limits.
Segment is too shortMerge pure interjections or punctuation-only fragments.
Frequent character switchingSplit each spoken line separately for easier voice assignment.
Narration and dialogue mixed togetherSeparate narration phases from dialogue phases before assigning characters.

Before Batch Generation

CheckReason
Provider concurrencyCloud free tiers should use low concurrency.
RPD / TPMAvoid triggering 429 errors with long text.
Character assignmentPhases without a voice cannot be generated.
Output directoryConfirm disk space and storage path.

Character Assignment Tips

When long text contains several speakers, split it into readable short phases first, then manually assign each phase to a character from the selected voice bank.

SituationHandling
Mostly narrationAssign all phases to the narrator first, then adjust dialogue phases.
Speaker names appear before linesKeep names as review hints, then decide whether to remove them before generation.
One character has large emotional variationDuplicate the character and adjust voice instruction, speed, or reference audio.
Very long batchGenerate a small set first, confirm stability, then continue.

Export Tips

  • Create separate projects by chapter or scene for easier organization.
  • Collect audio from the output directory after generation.
  • If stable filenames matter, number phases inside the project before exporting or sorting files.