
Introducing Hummingbird-0: A Leap in Lip Sync
Today, we're releasing Hummingbird-0, a photorealistic, zero-shot lip sync model that emerged as a research artifact during the development of Phoenix-3, Tavus’ full-face rendering model.

Hummingbird-0: Advancing Zero-Shot Lip Synchronization in AI-Generated Video
We made an unexpected discovery while developing our premium conversational AI technology. Components of our advanced video pipeline could be isolated and explicitly optimized for lip synchronization, with remarkable results. This serendipitous research byproduct evolved into Hummingbird, a specialized zero-shot lip-sync model that achieves state-of-the-art performance compared to other leading solutions.

Sparrow-0: Advancing Conversational Responsiveness in Video Agents with Transformer-Based Turn-Taking
In this paper, we dive into the development and research behind Sparrow-0, exploring the innovative transformer-based approach for turn-taking and its integration alongside Raven and Phoenix models within our Conversational Video Interface (CVI), an end-to-end operating system designed for building responsive video agents.

Phoenix-1: Realistic Avatar Generation in the Wild
This research paper, written by the Tavus team, details the development of Phoenix, a groundbreaking generative model for realistic avatar creation and text-to-video generation. Phoenix leverages audio and text-driven 3D models, integrating volumetric rendering techniques and 2D Generative Adversarial Networks (GANs) to create lifelike replicas from short video clips.

