Lead Site Reliability Engineer
OpenText View all jobs
- Cork
- Permanent
- Full-time
- Strong hands-on experience with GitLab, including creating, maintaining, and optimizing CI/CD pipelines to reliably deploy code to production as well as test and staging environments.
- Strong hands‑on experience with AWS infrastructure, covering provisioning, monitoring, scaling, and troubleshooting cloud‑native systems in production environments.
- Collaborate with development teams to promote cloud-native application design, integration patterns, and automation opportunities that enhance system reliability, monitoring, and self-healing capabilities.
- Stay up to date with emerging cloud technologies, industry trends, and SRE methodologies, and provide recommendations for their adoption to enhance our cloud infrastructure and reliability practices.
- Hands-on involvement in incident response, root cause analysis, and post-mortem improvements
- Perform scheduled maintenance activities, upgrades, and deployments — occasionally during weekends or off-hours
- At least 8 years of relevant professional experience
- Experience with GitLab and CI/CD tools and practices. Strong DevOps mindset and hands‑on experience, driving automation, collaboration, and continuous improvement across development and operations.
- Proven ability to drive operational change, strong resilience and accountability, capable of assuming end‑to‑end responsibility for the production environment, embedding it into established operational processes.
- A successfully completed degree in (business) computer science or in natural sciences with a computer science component, or comparable training
- Experience in the use of automation, scripting and/or configuration tools such as PowerShell (PowerCLI), Python, Ansible, Salt
- Strong troubleshooting and problem-solving skills, with keen attention to detail and maintain detailed documentation.
- Experience in operating and optimizing AWS environments, including automation of infrastructure processes and monitoring of cloud resources
- Understanding security and network protocols, such as SFTP, VPN, HTTPS, and SSH