Principal Software Engineer - OpenShift AI Model Training
Red Hat
- Ireland
- Permanent
- Full-time
- Lead Red Hat’s participation in machine learning related upstream communities to ensure the technologies work on OpenShift and can be integrated with RHOAI
- Architect and lead implementation of scalable open source solutions for Data Scientists to leverage distributed computing capabilities to train their Machine Learning models, running on OpenShift
- Act as a MLOps SME within Red Hat by supporting customer facing discussions, presenting at technical conferences, and evangelizing OpenShift AI within the internal community of practice
- Architect and design new features for open source communities such as
- Mentor, influence, and coach a distributed team of engineers
- Present at OpenShift/Kubernetes, and AI/ML related technology conferences and internally within the AI/ML communities of practice
- An existing contributor in one or more MLOps open source projects such as Ray/KubeRay, KubeFlow, Pytorch,
- Experience training and tuning ML models using tools like Ray,
- Advanced level knowledge and experience in development in Go or Python
- Excellent system understanding and troubleshooting capabilities
- Solid innovation skills and a passion for technology
- Technical leadership acumen in a global team environment & mentorship experience
- Passion for writing and maintaining reliable code
- Excellent written and verbal communication skills; good English language skills
- Bachelor's degree in statistics, mathematics, computer science, operations research, or a related quantitative field, or equivalent expertise; Master’s or PhD
- Experience in engineering, consulting or another field related to distributed model training or data processing in a customer environment or supporting a data science team
- Highly experienced in OpenShift