Job Description
Drive cloud reliability as a Site Reliability Engineer at Intact. This hybrid role focuses on Azure, AWS, and GCP to enhance operational excellence and incident management across platforms.
Join the Intelligent Operations Department's SRE & Resiliency team. As a hands-on SRE, you’ll conduct investigations, implement observability tools like OpenTelemetry and Dynatrace, and lead proactive reliability improvements. Collaboration with various support teams will be key to driving system resilience and business-aligned reporting.
Key Responsibilities:
• Lead high-severity investigations and root cause analysis
• Implement observability tools and anomaly detection
• Build auto-healing and reliability tools
• Define user-centric SLIs/SLOs and drive reporting
• Upskill support teams and promote resilience culture
Requirements:
• 8+ years in SRE/Platform/Infrastructure roles
• Proficiency in Dynatrace, OpenTelemetr...
Join the Intelligent Operations Department's SRE & Resiliency team. As a hands-on SRE, you’ll conduct investigations, implement observability tools like OpenTelemetry and Dynatrace, and lead proactive reliability improvements. Collaboration with various support teams will be key to driving system resilience and business-aligned reporting.
Key Responsibilities:
• Lead high-severity investigations and root cause analysis
• Implement observability tools and anomaly detection
• Build auto-healing and reliability tools
• Define user-centric SLIs/SLOs and drive reporting
• Upskill support teams and promote resilience culture
Requirements:
• 8+ years in SRE/Platform/Infrastructure roles
• Proficiency in Dynatrace, OpenTelemetr...