Site Reliability Engineer Leader
Product: Global Platform Engineering.
Your role:
• Supervise a team of Site Reliability Engineers
• Report metrics on application performance and incidents
• Act proactively and responsively to infrastructure and application failures
• Build and automate failover and recovery workflows
• Implement observability and monitoring stack for infrastructure and application layers
• Improve high availability and scalability for existing solutions
• Manage application downtime by defining and measuring SLAs and Error Budgets
• Design backup and recovery strategies
Your background:
• You have an Information Technology degree or similar
• You have a hands-on experience with AWS cloud
• You know automation CI/CD tools (Jenkins, Github or similar)
• You know how to automate and script cloud workloads with IaaC and CaaC techniques (Terraform, CloudFormation,
Ansible, Helm)
• You know monitoring tools (Datadog, Prometheus, Grafana, Splunk, or similar)
- Locations
- Poland
- Remote status
- Fully Remote
About Infotree Global Solutions
At Infotree, meeting your career needs is a top priority. Client satisfaction is largely dependent on the resources we can provide, and we take pride in our delivery. We have a supportive team in place to give quality people a chance to grow and challenge themselves in their roles which has resulted in that we have placed many employees in positions that have grown into lifelong careers.
We have a team of dedicated recruiters and consultant care representatives that are committed to your success and well-being. Check out our open roles to get started.
Infotree Poland Sp. z o.o. is part of Infotree Global Solutions. Agency number: 15970.
Site Reliability Engineer Leader
Loading application form
Already working at Infotree Global Solutions?
Let’s recruit together and find your next colleague.