Montreal Reinforcement Learning Engineer Role

Appit LLC

montreal (administrative region), qc, Canada

Full-time

Posted June 12, 2026

Apply Now Save Job Share

Job Description

                Join APPIT Software Solutions in Montreal as a Reinforcement Learning Engineer, designing innovative systems that drive AI optimization and autonomous decision-making. Leverage your skills in cutting-edge reinforcement learning methodologies to push boundaries within the AI landscape.
This position is ideal for candidates with a solid ML background, including 5+ years of experience and 2 years focused on reinforcement learning. Your main responsibilities will include algorithm design for enterprise applications and contributing to RLHF efforts for large language models. This role involves developing simulation environments and multi-agent systems, making collaboration with research teams essential.
Key Responsibilities:
• Design and implement reinforcement learning systems
• Build RLHF pipelines for model fine-tuning
• Develop simulation environments for RL evaluations
• Create multi-agent reinforcement systems
• Optimize training stability and efficiency
Requir...
            

Apply for this Position Back to Job Listings