Reinforcement Learning Engineer

Appit LLC

montreal (administrative region), qc, Canada
Full-time
Posted May 24, 2026

Job Description

APPIT Software Solutions is hiring a Reinforcement Learning Engineer in Montreal, Canada . Design reinforcement learning systems at APPIT Software in Montreal, building adaptive AI agents for optimization, autonomous decision-making, and RLHF alignment of large language models.

Responsibilities

  • Design and implement reinforcement learning algorithms for enterprise optimization problems
  • Build RLHF and reward modeling pipelines for LLM alignment and fine-tuning
  • Develop simulation environments for training and evaluating RL agents
  • Implement multi-agent reinforcement learning systems for complex coordination tasks
  • Optimize RL training stability and sample efficiency using state-of-the-art techniques
  • Collaborate with research teams to translate RL advances into production applications

Requirements

  • 5+ years of ML experience with 2+ years focused on reinforceme...