
Site Reliability Engineer III
- Galway
- Permanent
- Full-time
- Utilise technologies and languages like Terraform, Helm, Python, Go, Container Orchestration services including Docker and Kubernetes, and a variety of GCP and services to drive service reliability.
- Implement software development practices to build observability, alerting, tracing, automation, and self-healing capabilities to maintain the highest levels of platform availability.
- End-to-end coordination across platforms, while supporting, identifying, responding, and reporting of issues; then escalating to respective teams for remediation promptly.
- Develop maintenance and operations automation through CI/CD.
- Passion for CI/CD: Demonstrated enthusiasm for developing and improving Continuous Integration/Continuous Deployment processes.
- Orchestration Technology: 2 years of hands-on experience with orchestration tools such as Kubernetes and/or Helm.
- Coding and Scripting: Proficiency in Terraform, Ansible, or Helm, with an understanding of CI/CD tools like GitHub, GitLab, and Artifactory.
- Monitoring Solutions Expertise: Practical experience with monitoring, alerting, and logging tools, including Splunk and GCP Monitoring.
- Production Environment Support: 2 years of experience in maintaining production environments across cloud platforms like GCP, AWS, or Azure.
- Software Development: Some experience in developing and delivering products using programming languages such as Bash, Python, Golang, or Java is desirable.
- System Optimisation: Track record of contributions to enhance existing systems, building robust infrastructure, and automating processes to reduce workload.
- Agile Methodology: Experience working within Agile teams, adhering to sprint cadences and delivery timelines.
- Problem-Solving Skills: Ability to effectively triage issues and conduct root-cause analyses when necessary.
- Team Collaboration: Strong team player with the ability to work collaboratively within diverse groups.
- On-Call Duties: Willingness to participate in an on-call rotation, troubleshoot production issues, perform Root Cause Analyses, and share insights with the Engineering and Operations teams.
- Generous Paid Time Off including annual leave, paid bereavement, and family sick leave - every employee needs time to take care of themselves and their family.
- Universal Paid Parental Leave for both parents + flexible return to work program - because we know your newest family member(s) deserve your undivided attention.
- Paid Sabbatical after 5 years of continuous service - unplug, recharge, and have some fun.
- Competitive Stakeholder Pension - taking care of your future.
- Comprehensive health, dental care and dependents care from day 1 of employment - Your health comes first and we've got you covered.
- Company wide events and outings - our team spirit is no joke - we know how to have fun!
- Hybrid Work - This is a hybrid role based in our Galway, Ireland, office 2-3 days per week.