Agentic RL Researcher for Distributed AI Systems

Huawei

markham, york region, Canada
Full-time
Posted June 15, 2026

Job Description

A leading technology firm in York Region is seeking a Researcher specialized in reinforcement learning and multi-agent systems. This role involves designing advanced learning algorithms, building scalable training platforms, and optimizing performance on distributed systems. Ideal candidates hold an MS or PhD in related fields and possess strong programming skills, particularly in Python and C++. Competitive compensation ranges from $106,000 to $156,000 annually, depending on qualifications and experience.
#J-18808-Ljbffr