Senior Researcher - Generative Video and Audio Models

tavus • United State

Relocation Remote

Apply

AI Summary

Join Tavus as a Senior Researcher to lead research efforts on generative video and audio models. Work with the Applied ML team to productionize research and stay up-to-date with the latest advancements. Contribute to building lifelike, expressive avatars for real-time applications.

Key Highlights

Lead research efforts on generative video and audio models

Work with the Applied ML team to productionize research

Stay relevant with the latest advancements in AI and machine learning

Technical Skills Required

PyTorch Deep learning models Auto regressive networks Diffusion models Flow matching

Benefits & Perks

Competitive salary ($160K - $250K)

Flexible work schedule

Unlimited PTO

Competitive healthcare

Gear stipends

Remote work options

Job Description

About Us

Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, free from the friction of today’s systems. Our real-time human simulation models let machines see, hear, respond, and even look real—enabling meaningful, face-to-face conversations. AI Humans combine the emotional intelligence of humans with the reach and reliability of machines, making them capable, trusted agents available 24/7, in every language, on our terms.

Imagine a therapist anyone can afford. A personal trainer that adapts to your schedule. A fleet of medical assistants that can give every patient the attention they need. With Tavus, individuals, enterprises, and developers can all build AI Humans to connect, understand, and act with empathy at scale.

We’re a Series A company backed by world-class investors including Sequoia Capital, Y Combinator, and Scale Venture Partners.

Be part of shaping a future where humans and machines truly understand each other.

The Role

We’re looking for a Senior Researcher to join our core AI team. Our ideal partner-in-crime works well in startup environments, is comfortable prioritizing for themselves, and is always down to take calculated risks. We’re moving fast and not looking for people to come along for the ride - we’re looking for people to pave the path.

Your Mission 🚀

Lead research efforts on generative video and audio models (ex: text-to-speech, speech-to-speech, audio-to-expression and other speech and multimodal AI topics)
Work with the Applied ML team to help productionize our research
Stay relevant with the latest advancements (and help us create the latest advancements!)

Requirements

Have proven experience with flow matching, diffusion models, auto regressive networks in the audio domain.
Have experience training deep learning models: from medium-sized to large models.
Have experience building streaming text-to-speech models or speech-to-speech models
Have strong foundations in audio modeling and demonstrated ability to innovate rapidly through prototyping.
Know state-of-the-art architectures in representation learning: audio or image domain, face animation (in addition to having a deep understanding of the direct field of expertise above)
Have excellent programming skills and be fluent in PyTorch
Show evidence of original research, with publications in top-tier or solid second-tier venues (e.g., CVPR, NeurIPS, BMVC or equivalent).
Be excited about building lifelike, expressive avatars for real-time applications.

Additionally, having some of the following experiences may help you be successful in this position:

Skills in 3D graphics, Gaussian splatting
Other, additional experience with generative models
PhD or equivalent experience preferred
Experience leading research teams
Knowledge of best practices in Software Development

Please note that this position is preferably hybrid in San Francisco and we offer relocation. However we are open to remote candidates as well.

Benefits & Culture

When you join Tavus, you’re joining a diverse and supportive team. Our work is driven by our people, and our success is shared by all. This position has a flexible work schedule, unlimited PTO, competitive healthcare, and gear stipends, as well as plenty of fun. At the end of the day, we want Tavus to be a place for you to learn, directly drive impact, and work with a team you love.

To learn more about our team culture and benefits, check out our hiring page.

Tavus is growing fast, and we’d like you to grow with us. If you’re excited to get your hands dirty and help make machines more human, drop your resume and we’ll be in touch.

We are not looking for cultural fits, we are looking for culture creators. Diversity is what drives our success – it’s at the core of how we hire, communicate, and work. We are inclusive to all and combine our diverse backgrounds, skill sets, and perspectives to build the best experiences for our clients.

Compensation Range: $160K - $250K

Job Overview

Posted Date Nov 28, 2025

Employment Type Full-time

Experience Level Mid-Senior level

Location United State

Category Graphic Design

Company tavus

Senior Researcher - Generative Video and Audio Models

Key Highlights

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Senior Researcher - Generative Video and Audio Models

Key Highlights

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Subscribe our newsletter