Senior Researcher - Generative Video and Audio Models

tavus United State
Relocation Remote
Apply
AI Summary

Join Tavus as a Senior Researcher to lead research efforts on generative video and audio models. Work with the Applied ML team to productionize research and stay up-to-date with the latest advancements. Contribute to building lifelike, expressive avatars for real-time applications.

Key Highlights
Lead research efforts on generative video and audio models
Work with the Applied ML team to productionize research
Stay relevant with the latest advancements in AI and machine learning
Technical Skills Required
PyTorch Deep learning models Auto regressive networks Diffusion models Flow matching
Benefits & Perks
Competitive salary ($160K - $250K)
Flexible work schedule
Unlimited PTO
Competitive healthcare
Gear stipends
Remote work options

Job Description


About Us

Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, free from the friction of today’s systems. Our real-time human simulation models let machines see, hear, respond, and even look real—enabling meaningful, face-to-face conversations. AI Humans combine the emotional intelligence of humans with the reach and reliability of machines, making them capable, trusted agents available 24/7, in every language, on our terms.

Imagine a therapist anyone can afford. A personal trainer that adapts to your schedule. A fleet of medical assistants that can give every patient the attention they need. With Tavus, individuals, enterprises, and developers can all build AI Humans to connect, understand, and act with empathy at scale.

We’re a Series A company backed by world-class investors including Sequoia Capital, Y Combinator, and Scale Venture Partners.

Be part of shaping a future where humans and machines truly understand each other.

The Role

We’re looking for a Senior Researcher to join our core AI team. Our ideal partner-in-crime works well in startup environments, is comfortable prioritizing for themselves, and is always down to take calculated risks. We’re moving fast and not looking for people to come along for the ride - we’re looking for people to pave the path.

Your Mission 🚀

  • Lead research efforts on generative video and audio models (ex: text-to-speech, speech-to-speech, audio-to-expression and other speech and multimodal AI topics)
  • Work with the Applied ML team to help productionize our research
  • Stay relevant with the latest advancements (and help us create the latest advancements!)

Requirements

  • Have proven experience with flow matching, diffusion models, auto regressive networks in the audio domain.
  • Have experience training deep learning models: from medium-sized to large models.
  • Have experience building streaming text-to-speech models or speech-to-speech models
  • Have strong foundations in audio modeling and demonstrated ability to innovate rapidly through prototyping.
  • Know state-of-the-art architectures in representation learning: audio or image domain, face animation (in addition to having a deep understanding of the direct field of expertise above)
  • Have excellent programming skills and be fluent in PyTorch
  • Show evidence of original research, with publications in top-tier or solid second-tier venues (e.g., CVPR, NeurIPS, BMVC or equivalent).
  • Be excited about building lifelike, expressive avatars for real-time applications.

Additionally, having some of the following experiences may help you be successful in this position:

  • Skills in 3D graphics, Gaussian splatting
  • Other, additional experience with generative models
  • PhD or equivalent experience preferred
  • Experience leading research teams
  • Knowledge of best practices in Software Development

Please note that this position is preferably hybrid in San Francisco and we offer relocation. However we are open to remote candidates as well.

Benefits & Culture

When you join Tavus, you’re joining a diverse and supportive team. Our work is driven by our people, and our success is shared by all. This position has a flexible work schedule, unlimited PTO, competitive healthcare, and gear stipends, as well as plenty of fun. At the end of the day, we want Tavus to be a place for you to learn, directly drive impact, and work with a team you love.

To learn more about our team culture and benefits, check out our hiring page.

Tavus is growing fast, and we’d like you to grow with us. If you’re excited to get your hands dirty and help make machines more human, drop your resume and we’ll be in touch.

We are not looking for cultural fits, we are looking for culture creators. Diversity is what drives our success – it’s at the core of how we hire, communicate, and work. We are inclusive to all and combine our diverse backgrounds, skill sets, and perspectives to build the best experiences for our clients.

Compensation Range: $160K - $250K


Subscribe our newsletter

New Things Will Always Update Regularly