Reinforcement Learning Engineer

Appit LLC

montreal (administrative region), qc, Canada

Full-time

Posted May 24, 2026

Apply Now Save Job Share

Job Description

APPIT Software Solutions  is hiring a Reinforcement Learning Engineer  in Montreal, Canada . Design reinforcement learning systems at APPIT Software in Montreal, building adaptive AI agents for optimization, autonomous decision-making, and RLHF alignment of large language models. 
Responsibilities Design and implement reinforcement learning algorithms for enterprise optimization problems 
Build RLHF and reward modeling pipelines for LLM alignment and fine-tuning 
Develop simulation environments for training and evaluating RL agents 
Implement multi-agent reinforcement learning systems for complex coordination tasks 
Optimize RL training stability and sample efficiency using state-of-the-art techniques 
Collaborate with research teams to translate RL advances into production applications 
Requirements 5+ years of ML experience with 2+ years focused on reinforceme...
            

Apply for this Position Back to Job Listings