How do I lip sync a video with multiple speakers or a two-person dialogue?

Last updated: March 31, 2026

Sync currently supports lip syncing one speaker at a time per generation. Here's how to handle multi-speaker content:

For videos with multiple people visible

Sync will detect faces in your video. Use the face selection feature to choose which person's lips should be synced to the audio. See: How do I select which person to lipsync?

For two-person dialogues (podcasts, interviews, etc.)

The recommended workflow is:

  1. Split your audio into separate tracks — one per speaker

  2. Run separate generations — one for each speaker, using their respective audio track

  3. Combine the outputs in your video editor (Premiere Pro, DaVinci Resolve, etc.)

Tips for best results

  • Keep each speaker's audio isolated — remove crosstalk and overlap where possible

  • Use the same source video for both generations so timing stays aligned

  • If using the API, you can run both generations concurrently to save time

Using the Premiere Pro Plugin

The Sync Premiere Plugin works per-clip. For multi-speaker timelines, apply lip sync to each speaker's segment individually within your timeline.

Need help?

If you're working on a complex multi-speaker project and need guidance, reach out to our support team — we're happy to help you plan the best workflow for your specific use case.