Azure Site Reliability Engineer (sre)

Dubai, DU, AE, United Arab Emirates

Job Description

Role Overview



We are looking for an

Azure Site Reliability Engineer (SRE)

with

5+ years of experience

to join our team. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability of our cloud-based applications and infrastructure. You will work closely with development, DevOps, and security teams to ensure seamless deployment, monitoring, and automation in an Azure environment.

Key Responsibilities



Ensure System Reliability

- Maintain, optimize, and enhance the availability and performance of Azure cloud services.

Infrastructure as Code (IaC)

- Design and implement automated infrastructure using Terraform, ARM templates, or Bicep.

Monitoring & Incident Response

- Develop monitoring solutions (Azure Monitor, App Insights, Log Analytics) and respond to incidents proactively.

CI/CD & Automation

- Improve and maintain deployment pipelines using Azure DevOps, GitHub Actions, or Jenkins.

Performance Optimization

- Analyze system performance metrics and optimize cloud resources for efficiency.

Security & Compliance

- Implement best security practices, including identity management, role-based access control (RBAC), and threat detection.

Disaster Recovery & Backup

- Establish and test backup and recovery strategies using Azure Backup and Site Recovery.

Collaboration & Documentation

- Work closely with Software Engineers and Devops Engineers while maintaining detailed documentation of systems and processes.

Key Skills & Qualifications



5+ years of experience

in Site Reliability Engineering, Cloud Engineering, or DevOps.

Proficiency in Azure services

, including Virtual Machines, AKS, Azure Functions, App Services, Storage Accounts, and Networking. Experience with

Infrastructure as Code (IaC)

tools such as Terraform, ARM, or Bicep. Expertise in

CI/CD pipelines

with Azure DevOps, GitHub Actions, or Jenkins. Solid understanding of

monitoring tools

(Azure Monitor, Prometheus, Grafana, or New Relic). Knowledge of

containerization & orchestration

(Docker, Kubernetes, Helm). Familiarity with

security best practices

in cloud environments (RBAC, NSGs, Key Vault, Defender for Cloud). Experience with

troubleshooting and incident management

in production environments. Strong understanding of SRE processes and best practices for system reliability, availability, and performance. Experience with hybrid cloud environments and multi-cloud architectures. Hands-on experience in managing high-availability and scalable applications. Experience with payment gateway projects or similar high-transaction systems is preferred. Additional knowledge in advanced monitoring techniques, performance tuning, and capacity planning is a plus. Excellent analytical, problem-solving, and communication skills. Azure certifications (AZ-104, AZ-400, AZ-305, or equivalent) is an advantage.
Job Types: Full-time, Permanent

Application Question(s):

We need someone to who can start ASAP. Can you start immediately?
Experience:

as Azuure Site Reliability Engineer: 5 years (Required) Azure Services and CI/CD Pipeline: 5 years (Required)
License/Certification:

* SRE Certifications (Required)

Beware of fraud agents! do not pay money to get a job

MNCJobsGulf.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD1824303
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Contract
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Dubai, DU, AE, United Arab Emirates
  • Education
    Not mentioned