Video Services API Usage Notes
Here you'll find what's currently supported, known limitations and workarounds, and the current usage limits for Adobe's Translate and Lip Sync (TLS) API.
Known limitations and workarounds
- Speaker Mismatch: Speaker mismatches or additional/missing speakers may occasionally occur in output transcripts. This has been observed in approximately 9% of cases. Content where speakers overlap may not produce the best results and should be avoided.
- Voice Modulation: Voices in the output may vary in pitch or show significant modulation. Regenerating the video/audio can often resolve this issue.
- Re-dubbing Dubbed Content: Avoid using deepfake content for re-dubbing purposes.
- Singing Isn't Supported: A music video or a song won't be dubbed correctly.
For editing transcripts
Only sentence editing is currently supported. Do not modify the timestamps.
Speakers can be updated, however don't remove speakers before dubbing. Also, dub using the edited transcripts in different target languages.
Language support
Dubbing is supported for the following languages:
- English (Indian) (
en-IN
) - English (American) (
en-US
) - English (British) (
en-GB
) - Spanish (Spanish) (
es-ES
) - Spanish (Argentina) (
es-AR
) - Spanish (Latin America) (
es-419
) - French (France) (
fr-FR
)) - French (Canada) (
fr-CA
) - Danish (Denmark) (
da-DK
) - Norwegian (Norway) (
nb-NO
) - German (
de-DE
) - Italian (
it-IT
) - Portuguese (Brazil) (
pt-BR
) - Portuguese (Portugal) (
pt-PT
) - Hindi (India) (
hi-IN
) - Japanese (Japan) (
ja-JP
) - Korean (South Korea) (
ko-KR
)
Input video support
Technical details for videos used as input:
- Duration (max): 30 mins
- FPS: 24 fps, 25 fps, 29.97, 30, 50, 59.94, 60
- Resolution (max): Full HD
1920*1080px
or1080*1920px
- CODEC:
H.264, HEVC
- Formats/container:
.mp4, .mov
- Input medium: Pre-signed URL
- Render time: 3x the video length, 10x the video length (for 30 fps and 1080 resolution) if
lipSync
is enabled - Speaker speech (min): 5 secs
- Dubbing and Lip Sync: Multi-speaker support
Input audio support
Technical details for audio used as input:
- Duration (max): 30 mins
- CODEC:
MPEG, PCM
- Formats/container:
.mp3, .wav, .aac
- Input medium: Pre-signed URL
- Render time: 3x the audio length
- Dubbing: Multi-speaker support
Request limits per API
To ensure equitable peak performance, Adobe places limits on the volume, frequency, and concurrency of API calls, and monitors API usage in order to proactively reach out and resolve any risks to performance.
These usage limits apply to your entire organization.
The current limitations are:
Transcribe endpoint: 5 requests per minute.
Dubbing/Lip Sync endpoint: 5 requests per minute and 150 requests per day.
Get Result endpoint: 100 requests per minute.