Browse Jobs Find Talent Pricing Why Us Blog Post a Job
Advertisement
C

Tether: AI Research Engineer

Not Specified Global / Not Specified Engineering & Tech Senior

Salary not specified
Posted May 3, 2026
13 views
0 apply clicks
Apply Now ← Browse Jobs

About the Role

As a member of the AI model team, you will drive innovation in reinforcement learning approaches for advanced models. Your work will optimize decision-making and adaptive behavior to deliver enhanced intelligence, improved performance, and domain-specific capabilities for real-world challenges.

Responsibilities
  • Develop and implement state-of-the-art reinforcement learning algorithms designed to optimize decision-making processes in simulated and real-world settings.
  • Build, run, and monitor controlled reinforcement learning experiments, tracking key performance indicators and documenting iterative results.
  • Identify and curate high-quality simulation environments and training datasets tailored to specific domain challenges.
  • Systematically debug and optimize the reinforcement learning pipeline by analyzing computational efficiency and learning performance metrics.
  • Collaborate with cross-functional teams to integrate reinforcement learning agents into production systems and ensure continuous monitoring.
Requirements
  • A degree in Computer Science or a related field; ideally a PhD in NLP, Machine Learning, or a related field with a track record in AI R&D.
  • Proven experience with large-scale reinforcement learning experiments, including online RL techniques such as Group Relative Policy Optimization (GRPO).
  • Deep understanding of reinforcement learning algorithms, including policy gradients, actor-critic, and other gradient-based optimization approaches.
  • Strong expertise in PyTorch and relevant reinforcement learning frameworks.
  • Demonstrated ability to apply empirical research to overcome challenges like sample inefficiency and training instability.
Advertisement

Required Skills

Reinforcement Learning Machine Learning NLP PyTorch GRPO Actor-Critic Policy Gradients AI Research Multi-modal Architectures
Interested in this role?
Sign in to your free seeker account to apply.
Sign In to Apply →
Advertisement
← Back to all remote jobs