Senior Reinforcement Learning Engineer

Jobgether • United State

Remote

Apply

AI Summary

Design, train, and deploy advanced reinforcement learning systems to solve complex sequential decision-making problems. Collaborate with applied scientists and product teams to identify and prioritize high-impact RL applications. Stay current with reinforcement learning research and translate novel techniques into production-ready solutions.

Key Highlights

Design and implement reinforcement learning systems

Collaborate with applied scientists and product teams

Stay current with reinforcement learning research

Key Responsibilities

Design and implement reinforcement learning systems

Develop and maintain high-fidelity simulation environments

Implement and evaluate RL algorithms

Technical Skills Required

Python PyTorch TensorFlow Probability Optimization Reinforcement Learning Theory

Benefits & Perks

Competitive compensation

Fully remote position

Long-term, multi-year engineering engagement

Job Description

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Reinforcement Learning Engineer in the United States.

This role focuses on designing, training, and deploying advanced reinforcement learning systems that solve complex sequential decision-making problems where traditional supervised learning approaches are insufficient. You will work on building intelligent agents that learn through interaction, with applications spanning simulation environments and real-world production systems. The position blends deep research in modern RL methods with hands-on engineering to ensure models are scalable, stable, and safe in production. You will contribute to shaping reward systems, training infrastructure, and evaluation frameworks that directly influence model behavior. The environment is highly technical and research-driven, requiring close collaboration with applied scientists and product teams. This is a high-impact role where your work will transition cutting-edge RL techniques into production-ready systems. You will help define how intelligent agents are trained, evaluated, and continuously improved at scale.

Accountabilities

Design and implement reinforcement learning systems for sequential decision-making problems across simulated and real-world environments.
Develop and maintain high-fidelity simulation environments to support scalable agent training and experimentation.
Implement and evaluate RL algorithms including policy gradient, actor-critic, off-policy, and offline reinforcement learning methods.
Engineer reward functions and shaping strategies that align model behavior with performance, safety, and business objectives.
Apply offline RL and imitation learning techniques in environments where exploration is constrained or unsafe.
Utilize RLHF, DPO, and related approaches to fine-tune large-scale models where applicable.
Build distributed training infrastructure for RL, including experience replay systems and scalable data pipelines.
Improve training stability and sample efficiency through algorithmic optimization and systems-level enhancements.
Design rigorous evaluation frameworks, including adversarial testing and out-of-distribution validation.
Implement safety mechanisms such as constraints, guardrails, and human-in-the-loop oversight systems.
Collaborate with applied scientists and product teams to identify and prioritize high-impact RL applications.
Monitor production models for drift, performance degradation, and unintended behaviors, building observability tools and alerting systems.
Document methodologies, system design, and operational processes for long-term maintainability and knowledge sharing.

Interested in remote work opportunities in Development & Programming? Discover Development & Programming Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.

Stay current with reinforcement learning research and translate novel techniques into production-ready solutions.

Requirements

Master’s or PhD in Computer Science, Machine Learning, or equivalent practical experience.
6+ years of combined experience in reinforcement learning research and engineering.
Strong programming skills in Python and deep learning frameworks such as PyTorch or TensorFlow.
Hands-on experience with RL libraries or custom RL training stacks.
Solid understanding of probability, optimization, and reinforcement learning theory.
Experience designing reward functions in complex or high-dimensional environments.
Familiarity with simulation environments and large-scale training pipelines.
Experience training neural policies on GPU-based distributed systems.
Strong debugging, experimentation, and analytical skills.
Excellent communication skills with a track record of shipping or publishing impactful RL work.

Benefits

Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.

Competitive compensation aligned with experience, typically in the $100,000-$150,000 range.
Fully remote position within the United States.
Long-term, multi-year engineering engagement.
Direct W2 employment with full benefits package.
Opportunity to work on cutting-edge reinforcement learning systems in production environments.
Exposure to large-scale AI training infrastructure and advanced model optimization techniques.
Strong focus on research-to-production impact in a high-growth technical environment.
Collaborative, research-driven engineering culture.

How Jobgether Works

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Job Overview

Posted Date May 22, 2026

Employment Type Full-time

Experience Level Mid-Senior level

Location United State

Annual Salary 100,000 USD

Category Programming

Company Jobgether

Mentioned Skills

Similar Jobs

Explore other opportunities that match your interests

Software Engineer

Programming

•

4m ago

Premium Job

•••••• •••••• ••••••

Job Type ••••••

Experience Level ••••••

schireson

United State

Senior C++ Engineer - AI Code Review & Reference Implementation

Programming

•

24m ago

Premium Job

•••••• •••••• ••••••

Job Type ••••••

Experience Level ••••••

sme careers

United State

Remote Full-Stack .NET Developer (C#/.NET)

Programming

•

1h ago

Visa Sponsorship Relocation Remote

Job Type Contract

Experience Level Mid-Senior level

upkoi, inc

United State

Senior Reinforcement Learning Engineer

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

Software Engineer

Premium Job

schireson

Senior C++ Engineer - AI Code Review & Reference Implementation

Premium Job

sme careers

Remote Full-Stack .NET Developer (C#/.NET)

upkoi, inc

Subscribe our newsletter