Video Services API Usage Notes

Here you'll find what's currently supported, known limitations and workarounds, and the current usage limits for Adobe's Translate and Lip Sync (TLS) API.

Known limitations and workarounds

  • Speaker Mismatch: Speaker mismatches or additional/missing speakers may occasionally occur in output transcripts. This has been observed in approximately 9% of cases. Content where speakers overlap may not produce the best results and should be avoided.
  • Voice Modulation: Voices in the output may vary in pitch or show significant modulation. Regenerating the video/audio can often resolve this issue.
  • Re-dubbing Dubbed Content: Avoid using deepfake content for re-dubbing purposes.
  • Singing Isn't Supported: A music video or a song won't be dubbed correctly.

For editing transcripts

Only sentence editing is currently supported. Do not modify the timestamps.

Speakers can be updated, however don't remove speakers before dubbing. Also, dub using the edited transcripts in different target languages.

Language support

Dubbing is supported for the following languages:

  • English (Indian) (en-IN)
  • English (American) (en-US)
  • English (British) (en-GB)
  • Spanish (Spanish) (es-ES)
  • Spanish (Argentina) (es-AR)
  • Spanish (Latin America) (es-419)
  • French (France) (fr-FR))
  • French (Canada) (fr-CA)
  • Danish (Denmark) (da-DK)
  • Norwegian (Norway) (nb-NO)
  • German (de-DE)
  • Italian (it-IT)
  • Portuguese (Brazil) (pt-BR)
  • Portuguese (Portugal) (pt-PT)
  • Hindi (India) (hi-IN)
  • Japanese (Japan) (ja-JP)
  • Korean (South Korea) (ko-KR)

Input video support

Technical details for videos used as input:

  • Duration (max): 30 mins
  • FPS: 24 fps, 25 fps, 29.97, 30, 50, 59.94, 60
  • Resolution (max): Full HD 1920*1080px or 1080*1920px
  • CODEC: H.264, HEVC
  • Formats/container: .mp4, .mov
  • Input medium: Pre-signed URL
  • Render time: 3x the video length, 10x the video length (for 30 fps and 1080 resolution) if lipSync is enabled
  • Speaker speech (min): 5 secs
  • Dubbing and Lip Sync: Multi-speaker support

Input audio support

Technical details for audio used as input:

  • Duration (max): 30 mins
  • CODEC: MPEG, PCM
  • Formats/container: .mp3, .wav, .aac
  • Input medium: Pre-signed URL
  • Render time: 3x the audio length
  • Dubbing: Multi-speaker support

Request limits per API

To ensure equitable peak performance, Adobe places limits on the volume, frequency, and concurrency of API calls, and monitors API usage in order to proactively reach out and resolve any risks to performance.

The current limitations are:

Transcribe endpoint: 5 requests per minute.

Dubbing/Lip Sync endpoint: 5 requests per minute and 150 requests per day.

Get Result endpoint: 100 requests per minute.

fly0102030405BaskarMitrah
Was this helpful?