data-slots=image, heading, text
data-background=rgb(233, 80, 80)
Audio/Video APIs
Overview
Audio/Video APIs are a collection of resources that leverage Firefly Services' AI to create and customize audio and video content.
data-slots=heading, link, text
Explore our APIs
data-slots=link, text
data-slots=link, text
data-slots=link, text
data-slots=link, text
Dynamic Graphics Render API
Programmatically generate video variations from Motion Graphics Templates (MOGRTs).
What is the Dynamic Graphics Render API?
Use the Dynamic Graphics Render API to automate branded video creation. The service consumes Motion Graphics Templates (MOGRTs) and lets you override essential graphics parameters to deliver fully customized clips in seconds.
This API ingests Adobe After Effect's (AE) Motion Graphics Templates (MOGRTs), exposes editable Essential Graphics controls, and renders finalized video assets using predefined or custom encoding presets. Users can:
- Inspect editable template controls.
- Override text, image, video (fixed, per defined slot length), audio, and design parameters.
- Render brand‑ready, social‑optimized video outputs at scale.
Why choose this API?
Common use cases:
- Localize on-screen text to distribute branded video in multiple geographies.
- Personalizing marketing videos with dynamic text and images.
- Generate social-ready videos for different products from the same video template.
- Automating brand‑compliant video creation for campaigns.
For Marketers/Designers:
- Create a custom MOGRT template and use the Dynamic Graphics Render API to dynamically control the Essential Graphics parameters defined in the template to generate video variations.
How it works
The API workflow is simple and powerful. It mirrors other video API's asynchronous job model:
-
Discover presets using the Get Presets API.
Use our predefined social presets or your own custom ones. -
Describe a template to retrieve editable controls.
These are Essential Graphics parameters defined in MOGRT. -
Render template variations with defined assets and presets.
Render up to 10 variations of the same template in a single call. -
Get Status with a job ID to get results of Describe and Render template API calls.
Explore what you can do with this API in the Dynamic Graphics Render API guides.
Reframe API
The Reframe API intelligently analyzes video content to dynamically adjust frame composition to fit the aspect ratios that you've specified, generating seamless content where it's needed from the existing video characteristics.
data-slots=video, heading, text
Reframe your videos with AI
data-variant=info
data-slots=text
Reframe features include:
- Generate Video Variations: The API accepts video input, processes it, and delivers output with specific aspect ratios (including but not limited to 4:3, 9:16, and 1:1) via downloadable links.
- Analyze Scenes: Enable scene edit detection to analyze video transitions and use the existing video characteristics to maintain compositional integrity across different aspect ratio outputs.
- Track Status: Check a job's progress using a designated endpoint. Response times and update frequencies are subject to system load and configuration.
- Add Overlays: Apply pre-generated graphic overlays, such as GIFs or PNGs, over videos with precise control over timing, positioning, scaling, and looping behavior. Customization ensures that overlays align across different aspect ratios and remain consistent with the visual layout.
Why choose Reframe v2?
Consider the Reframe v2 API to take your video workflows to the next level. Whether you're optimizing for e-commerce, brand storytelling, or high-volume creative production, Reframe v2 delivers unmatched flexibility and control.
-
Pixel-Perfect Resolution
Define exact output sizes, like1920x1080, for precision processing. -
Semantic Subject Lock
Keep your subject in focus across every frame, every shot, every time. Just provide a keyword or prompt (for example, "Frisbee" or "man in yellow jacket") and let AI automatically reframe around your chosen subject. -
Media Destination
Define where your rendered video should go with a simple, secure upload flow. Just provide a pre-signed PUT URL to your storage bucket, and we'll handle the rest.
Explore what you can do with this API in the Reframe API guides.
Translate and Lip Sync API
The Translate and Lip Sync (TLS) API uses transcriptions to generate audio and video with precise, accurate dubbing and composited lip sync. This feature supports multi-speaker scenarios.
What is this API?
The Translate and Lip Sync (TLS) API allows you to:
- Transcribe audio and video.
- Generate captions for audio and video.
- Automated Dubbing for audio and video.
- Dubbing with edited transcripts.
- Dubbing with pre-existing translations.
Lip Sync is also included as a parameter of the Dub API to create high-quality composited videos with precise lip-syncing. Content Authenticity Initiative (CAI) support ensures protection against deepfakes.
Explore what you can do with this API in the Translate and Lip Sync API guides.
Text to Speech API
The Text to Speech (TTS) API generates lifelike spoken audio from a provided transcript.
What is this API?
The Text to Speech API allows you to:
- Choose voices from Firefly's catalog of voices.
- Turn prompts into spoken audio.
- Generate speech in a variety of languages and accents.
Explore what you can do with this API in the Text to Speech API guides.
Text to Avatar API
Using the Avatar API you can generate an Avatar video with a text prompt or audio input.
What is this API?
Options with the endpoint allow you to:
- Select an avatar from a catalog of stock actors.
- Select a voice from a catalog of stock voices.
- Use your own voice file to create avatar videos.
- Set your own image/video as a video background.
Explore what you can do with this API in the Text to Avatar API guides.
data-slots=heading, text, button
data-variant=secondary
data-backgroundColor=background-color-gray