Senior Data Scientist for AI Training and Evaluation

david joseph & company • United State
Visa Sponsorship
Apply
AI Summary

We are seeking a Senior Data Scientist to work with frontier AI labs and enterprise customers on highly specific dataset problems. The ideal candidate will own end-to-end dataset projects, build custom algorithms, and move between research prototypes and production systems. Strong Python development skills and experience shipping technical work in a startup or high-velocity environment are required.

Key Highlights
Own end-to-end dataset projects for customers
Build custom algorithms, models, and large-scale data pipelines
Move between research prototypes and production systems
Key Responsibilities
Work directly with customers to translate ambiguous dataset needs into concrete technical systems and delivery timelines
Build custom algorithms, models, and large-scale data pipelines spanning computer vision, audio processing, text processing, and metadata analysis
Move between research prototypes and production systems, using models and APIs creatively to solve customer problems
Technical Skills Required
Python Custom algorithms Model workflows Large-scale data pipelines Computer vision Audio processing Text processing Metadata analysis PyTorch
Benefits & Perks
Salary: $150,000-$250,000
Competitive equity
401k
Full health insurance
Breakfast/lunch/dinner covered
Snacks
Ubers home
Nice to Have
Experience building custom algorithms or ML workflows for production video, audio, or multimodal data
Depth in computer vision, audio, or video domains
Hands-on work with large-scale video, audio, or multimodal data processing at scale

Job Description


Our client works with frontier AI labs and enterprise customers on highly specific dataset problems, building custom algorithms, models, and data pipelines at scale. The company focuses on video, audio, and multimodal data processing for AI training and evaluation. The team is small and moving fast to scale deployments across leading AI labs.

About The Role

You'll own end-to-end dataset projects for customers, from untangling ambiguous requirements through shipping production systems that find, generate, filter, transform, evaluate, and package high-quality datasets. This is a high-agency role working directly with customers and internal teams, combining research prototypes with reliable production pipelines. You'll ship fast, move between technical domains within each project, and own customer outcomes directly.

What You'll Own

  • Work directly with customers to translate ambiguous dataset needs into concrete technical systems and delivery timelines
  • Build custom algorithms, models, and large-scale data pipelines spanning computer vision, audio processing, text processing, and metadata analysis
  • Move between research prototypes and production systems, using models and APIs creatively to solve customer problems
  • Break down customer-level goals into the models, heuristics, infrastructure, and QA steps needed to deliver
  • Optimize performance through pre/post-processing, parallelism, inference optimization, fine-tuning, and evaluation loops


Must-Have

  • Strong Python developer with hands-on experience building custom algorithms, model workflows, or large-scale data pipelines
  • Comfortable working directly with customers or external teams to translate ambiguous needs into technical systems
  • Deep intuition for dataset quality, filtering, labeling, evaluation, and edge cases
  • Able to move quickly between research prototypes and reliable production systems without creating brittle code
  • 1 to 3 years of experience shipping technical work in a startup or high-velocity environment.


Nice-to-Have

  • Experience building custom algorithms or ML workflows for production video, audio, or multimodal data
  • Depth in computer vision, audio, or video domains
  • Hands-on work with large-scale video, audio, or multimodal data processing at scale
  • Track record shipping data pipelines or algorithms in production
  • Background with PyTorch or similar ML frameworks in production
  • Experience working directly with customers or external stakeholders
  • Prior experience in a startup or other high-velocity environment, ideally as an early hire
  • Active contributor to open source projects.


Details

Experience: 1-3+ years. Salary: $150,000-$250,000 plus competitive equity. Visa sponsorship: H-1B, OPT. Benefits: 401k, full health insurance, breakfast/lunch/dinner covered, snacks, Ubers home, competitive equity.

Similar Jobs

Explore other opportunities that match your interests

Founding Engineer - AI-Native Mental Health Care

Programming
•
6h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

legion health

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Jobs via Dice

United State

Senior Developer Productivity Engineer - CI/CD & Release Specialist

Programming
•
7h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

anthropic

United State

Subscribe our newsletter

New Things Will Always Update Regularly