Member of Technical Staff – AI Inference platform

Lyceum

zürich, zürich, Switzerland

Full-time

Posted June 16, 2026

Apply Now Save Job Share

Job Description

You will make Lyceum's AI inference platform reliable, secure, and scalable - ensuring it performs under pressure as we grow to thousands of concurrent 
users. While others on the team expand what the platform can do, your job is to make sure it keeps working, fails gracefully, and gets faster over time. 
Your focus Scalability:  Architect and implement the systems that allow our inference platform to scale to thousands of concurrent users. This includes request routing, load balancing, autoscaling, and resource scheduling across GPU clusters. 
Reliability and observability:  Build robust monitoring, alerting, and incident response tooling. Design for graceful degradation, automatic recovery, and minimal downtime. 
Performance engineering:  Profile and optimise the full inference path from request ingestion through model execution to response delivery. Identify and eliminate bottlenecks at every layer. 
Infrastructure...
            

Apply for this Position Back to Job Listings