Job Description
Elevate reliability standards at Confluent as a Senior Site Reliability Engineer. Focus on proactive reliability improvements within a multi-cloud streaming platform while managing incident response practices.
In this senior role, you'll devote 75% of your time to engineering, improving tooling, analyzing failure patterns, and designing solutions. The remaining 25% involves teaching and coordinating incident response enhancements, coaching teams, and driving organizational changes in reliability practices. Your expertise will help minimize incidents across Confluent Cloud's dynamic environment.
Key Responsibilities:
• Analyze failure patterns for proactive reliability design
• Own configuration of Rootly and integrations with key tools
• Define and maintain SLO/SLA frameworks
• Edit customer-facing incident documents for quality
• Develop training programs and coach teams through post-mortems
Requirements:
• 10+...
In this senior role, you'll devote 75% of your time to engineering, improving tooling, analyzing failure patterns, and designing solutions. The remaining 25% involves teaching and coordinating incident response enhancements, coaching teams, and driving organizational changes in reliability practices. Your expertise will help minimize incidents across Confluent Cloud's dynamic environment.
Key Responsibilities:
• Analyze failure patterns for proactive reliability design
• Own configuration of Rootly and integrations with key tools
• Define and maintain SLO/SLA frameworks
• Edit customer-facing incident documents for quality
• Develop training programs and coach teams through post-mortems
Requirements:
• 10+...