Senior Staff Machine Learning Engineer (Large Language Models & Mixture-of-Experts Architectures)

Orbis Group • United State
Relocation
Apply
AI Summary

Join a fast-growing AI platform as a Staff Machine Learning Engineer to develop advanced Large Language Models and Mixture-of-Experts architectures.

Key Highlights
Develop advanced Large Language Models and Mixture-of-Experts (MoE) architectures
Experiment with novel algorithms to improve efficiency and scalability
Collaborate with engineering and product teams to deploy ML capabilities into production
Technical Skills Required
Machine Learning PyTorch TensorFlow AWS Azure GCP Spark Hadoop Docker Kubernetes
Benefits & Perks
Competitive compensation and performance incentives
Comprehensive medical, dental, and vision benefits
Monthly wellness stipend + annual continuing education credit
Flexible work environment
Parental and bereavement leave and other employee support programs

Job Description


Staff Machine Learning Engineer - LLMs / Mixture-of-Experts

(Hybrid Austin, US Citizens Only)


The Role


Are you excited by the challenge of pushing the boundaries of what modern AI models can do - especially when data is limited? A fast-growing AI platform is looking for a Staff Machine Learning Engineer to help shape the next generation of large-scale intelligent systems.


In this role, you’ll take the lead on developing advanced Large Language Models (LLMs) and Mixture-of-Experts (MoE) architectures, driving innovation that directly influences product capabilities and performance. If you thrive at the intersection of research and real-world impact, you’ll feel right at home here.


This hybrid role is based in Austin, Texas, with relocation support available for qualified candidates. U.S. citizenship is required due to work involving security-sensitive projects.


What You’ll Work On


  • Architect, train, and optimize cutting-edge LLMs and MoE-based systems.
  • Experiment with novel algorithms to improve efficiency, scalability, and model performance.
  • Collaborate closely with engineering and product teams to deploy ML capabilities into production.
  • Contribute to pioneering research in ML and NLP, driving methodological advancements.
  • Mentor engineers and help shape technical best practices across the organisation.


What We’re Looking For


  • Advanced degree in Computer Science or a related field (PhD preferred).
  • 6+ years of industry experience building and deploying machine learning models at scale.
  • Deep expertise in LLMs, Mixture-of-Experts architectures, and modern ML frameworks such as PyTorch or TensorFlow.
  • Demonstrated innovation through impactful research, patents, or production-grade ML systems.
  • Ability to lead complex, cross-functional technical initiatives.
  • Strong problem-solving skills and a passion for pushing the boundaries of AI.


Bonus Skills


  • Publications or conference presentations at leading ML/NLP venues such as NeurIPS, ICML, ICLR, AAAI, EMNLP, NACL, ACL, EACL, CoNLL, or similar.
  • Experience with cloud platforms (AWS, Azure, GCP) and distributed computing tools (Spark, Hadoop).
  • Familiarity with containerization and orchestration (Docker, Kubernetes).


Why You’ll Love It


You’ll join a team that embodies transparency, ownership, tenacity, and humility - values that guide both technical decision-making and collaboration.


You’ll also enjoy:


  • Competitive compensation and performance incentives
  • Comprehensive medical, dental, and vision benefits
  • Monthly wellness stipend + annual continuing education credit
  • A flexible work environment and unlimited approved PTO
  • Parental and bereavement leave and other employee support programs


This role is hybrid in Austin with relocation assistance offered for the right candidate.

Only U.S. citizens are eligible due to the security clearance requirements.


Subscribe our newsletter

New Things Will Always Update Regularly