Skip to content

benchmarks/tts-quality

TTS quality benchmark — compare local vs remote voiceover models.

Category: benchmarks
Source: workflows/benchmarks/tts_quality.py

FieldTypeDefaultDescription
modelsstring[][]Subset of models to benchmark. Empty = all available.
narrationstring""Generated narration text (populated by generate_script task).
regenerateobjectWhen set, this run is a regeneration. Workflows may read direction / keep / extra_instructions to modulate prompts; the engine persists parent_run_id and parent_variant_index as run lineage columns.
scriptstring""Direct narration text. If empty, a script is generated from the topic.
topicstring"The future of artificial intelligence and its impact on creative industries"Topic to generate a narration script about (ignored if script is provided).
variantsinteger1Number of independent variant executions (1–10). When > 1, the engine runs the workflow N times with different sampling, producing N outputs.
voice_genderstring"female"Voice gender: male or female.
FieldTypeDefaultDescription
kindobjectVariant card shape: video / carousel / image / text. Surfaced on the per-variant entry of the run-output API and used by gallery UIs to pick the right layout.
output_dirstring""Directory containing audio files.
resultsobject[]Per-model benchmark results.
summaryobjectAggregated comparison summary.
generate_script → benchmark_tts_models
TaskDescription
generate_scriptGenerate or use a provided narration script (~60 seconds when spoken).
benchmark_tts_modelsRun TTS generation with each model and collect results.

Save the YAML below as my-run.yaml, edit the values, and run with the CLI or POST it to the API. Required fields are uncommented; optional knobs are documented above the input: block — copy any line under input: and uncomment to set.

workflow: benchmarks/tts-quality
# Optional fields — copy any line(s) under `input:` and uncomment to set:
# Subset of models to benchmark. Empty = all available.
# models: []
#
# Generated narration text (populated by generate_script task).
# narration: ""
#
# Direct narration text. If empty, a script is generated from the topic.
# script: ""
#
# Topic to generate a narration script about (ignored if script is provided).
# topic: The future of artificial intelligence and its impact on creative industries
#
# Voice gender: male or female.
# voice_gender: female
#
input: {}

Run it locally:

Terminal window
fab-workflow --from-file my-run.yaml

Or submit over the wire — the same file is the request body:

Terminal window
curl -X POST 'https://gofabric.dev/v1/workflows/runs?name=benchmarks/tts-quality' \
-H 'Authorization: Bearer fab_xxx' \
-H 'content-type: application/yaml' \
--data-binary @my-run.yaml

Every workflow also accepts the universal WorkflowInput fields — variants (1–10 fan-out) and regenerate (creative-direction hints with run lineage). See Run-specs (YAML / TOML / JSON) for the full top-level shape (metadata, priority, bundle, parent, etc.).