Documentation

How video-pipeline turns a brief into a film

Cinema, frame by frame. A one-line brief becomes a finished film, reasoned and gated before a single frame renders. This reference walks the gated pipeline, the five-seat committee, the pluggable providers and the environment it all runs on.

What it is

video-pipeline is a strategy-first AI video studio. A one-line brief becomes a finished film and a client-ready brief, with the reasoning surfaced at every step.

Research, brief and blueprint are reasoned by Claude. Image, video and audio are pluggable providers, so the best model for a shot is never welded into the pipeline. Two artifacts ship at the end of every standard run:

G5_final.mp4

Finished film

Stitched, scored, captioned MP4.

G6_brief.pdf

Client brief

Puppeteer-rendered strategy PDF with projections.

The gated pipeline

Ten gates run in order. Strategy is reasoned and voted on first; a human approves the blueprint before a single frame is generated; only then do stills, clips, the stitch and the PDF run.

GateStageKind
G0ResearchStage
G1BriefStage
G2BlueprintStage
GAAudio PlanStage
QCCommittee QCCommittee
Strategy ReviewHuman gate
G3StillsStage
G4ClipsStage
G5StitchStage
G6Brief PDFStage

Human gate. Strategy Review () requires a person to approve the blueprint before any frame renders. The committee advises, the human decides.

Render path. G3 stills run in parallel, G4 clips route per shot, G5 stitches with ffmpeg, and G6 renders the brief PDF with puppeteer.

The Counter-Propaganda Committee

Before you're ever asked to approve, five independent seats (each its own model and brief) vote on the strategy. A FAIL from a veto seat blocks the run. Verdicts are PASS · WARN · FAIL.

genz

Gen Z taste-checker

Will this actually land with a 22-year-old, or does it read as a brand trying too hard?

Claude Opus 4.7Universal veto #1
cultural

UAE / Khaleeji authenticity

Does it respect the culture it's speaking to: language, symbols, setting?

Claude Sonnet 4.6Veto #2
director

Story · pacing · shot logic

Does the cut sequence actually tell the story, with the right rhythm?

Claude Sonnet 4.6Tie-break
red_team

Attack, then steelman

Find the strongest case against this idea, then the strongest case for it.

Claude Sonnet 4.6Advisory
hook

First 3 seconds · platform fit

Does the open earn the scroll on the platform it's built for?

Claude Sonnet 4.6Advisory

PASS: strategy proceeds.

WARN: noted, not blocking.

FAIL from a veto seat blocks it.

A FAIL from a veto seat blocks the strategy; the run ends in a failed state with the committee summary attached. A standard tier runs three seats (genz · director · hook); a special tier runs all five.

Pluggable providers

Pick the best model for the shot, not the one your pipeline is welded to. Backends are selected by env, with sensible fallbacks. Shots can even mix video providers.

Image

Selected by IMAGE_BACKEND, else first available in order falhiggsfieldopenai.

falDefault
nano-banana-2nano-banana-pro
FAL_API_KEY
Higgsfield
soul-standardreve
HIGGSFIELD_API_KEY (+ secret)
OpenAI
gpt-image-2gpt-image-1.5dall-e-3
OPENAI_API_KEY

Video

Picked per shot from video_model. Shots can mix providers; routing.video_default is the fallback.

Seedance (Ark)
seadance_2seadance_2_fast
SEADANCE_API_KEY

Direct via Volcengine Ark

Higgsfield
kling3_0seedance_2_0
HIGGSFIELD_API_KEY (+ secret)

Audio

ElevenLabs drives voiceover (GA1) and the music bed (GA2).

ElevenLabs
Voiceover (GA1)Music bed (GA2)
ELEVENLABS_API_KEY

Run tiers

Three tiers trade ceremony for confidence. standard is the production default; special adds the full cultural review and two QC passes.

test

Fast iteration, no ceremony

Strategy
Inline mini-blueprint
Committee
None
QC
None
Audio
Not included
Brief PDF
Not included
standardDefault

The production default

Strategy
Full package + rationale + PDF projections
Committee
3 seats · genz / director / hook
QC
Auto-pass
Audio
Included
Brief PDF
Included
special

Flagship, fully gated

Strategy
Full package + cultural review
Committee
5 seats · full committee
QC
QC1 after stills · QC2 after brief
Audio
Included
Brief PDF
Included

Environment reference

Every key the pipeline reads, and what it powers. ANTHROPIC_API_KEY is always required; provider keys are needed only for the backends you actually route to.

VariableStatus
ANTHROPIC_API_KEYrequired
FAL_API_KEYconditional
HIGGSFIELD_API_KEYconditional
HIGGSFIELD_API_KEY_SECRETconditional
OPENAI_API_KEYconditional
SEADANCE_API_KEYconditional
ELEVENLABS_API_KEYconditional
IMAGE_BACKENDoptional

Higgsfield needs both HIGGSFIELD_API_KEY and HIGGSFIELD_API_KEY_SECRET. Set IMAGE_BACKEND to pin the image backend; leave it unset to fall through falhiggsfieldopenai.