Senior Site Reliability Engineer
Core42 · Abou Dabi
Job description
About the role
Core42 is seeking a Senior Site Reliability Engineer to design, implement and operate scalable, reliable and secure infrastructure for large‑scale AI and HPC workloads. You will work closely with engineering, product and operations teams to automate processes, enforce SRE best practices and ensure high availability across globally distributed platforms.
Key responsibilities
- Design, build and maintain robust CI/CD pipelines using GitLab CI, Azure DevOps or Jenkins.
- Operate, manage and optimise Kubernetes clusters for performance, scalability and resilience.
- Develop infrastructure as code with Terraform, Helm and Ansible to automate provisioning and configuration.
- Implement observability solutions using Prometheus, VictoriaMetrics, Grafana and ELK/EFK stacks.
- Lead incident response, root‑cause analysis and post‑mortems to continuously improve reliability.
- Define and enforce SRE practices including SLAs, SLOs and error budgets.
- Build and maintain logging, alerting and tracing systems for proactive issue detection.
- Ensure security best practices and compliance across pipelines and runtime environments.
- Collaborate cross‑functionally and mentor junior engineers.
- Participate in on‑call rotations to support critical platform services.
Required profile
- Bachelor’s or Master’s degree in Computer Science, Engineering or a related technical field.
- Minimum 5 years of experience in DevOps, Site Reliability Engineering or platform engineering in production environments.
Required skills
- GitLab CI, Azure DevOps, Jenkins
- Kubernetes cluster management
- Terraform, Helm, Ansible
- Prometheus, VictoriaMetrics, Grafana, ELK/EFK
- Logging, alerting and tracing tools
- Security and compliance best practices
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 13 hours ago
Expires 1 month from now
5 views · 0 applications
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
Core42
Abou Dabi
Related job offers
-
Active Directory & Identity Security
Epergne Solutions Abou Dabi -
Site Reliability Engineer (SRE)
Halian | Managed Services, Recruitment and Contract Staffing Abou Dabi -
Backend Developer – Java Spring Boot (Abu Dhabi)
Halian | Managed Services, Recruitment and Contract Staffing Abou Dabi -
Engineering Team Leader
Jobgether Émirats arabes unis -
Cyber Security Manager – Hands‑On Leadership in Dubai
SELECTED RECRUITMENT Doubaï