to join our team. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability of our cloud-based applications and infrastructure. You will work closely with development, DevOps, and security teams to ensure seamless deployment, monitoring, and automation in an Azure environment.
Key Responsibilities
Ensure System Reliability
- Maintain, optimize, and enhance the availability and performance of Azure cloud services.
Infrastructure as Code (IaC)
- Design and implement automated infrastructure using Terraform, ARM templates, or Bicep.
Monitoring & Incident Response
- Develop monitoring solutions (Azure Monitor, App Insights, Log Analytics) and respond to incidents proactively.
CI/CD & Automation
- Improve and maintain deployment pipelines using Azure DevOps, GitHub Actions, or Jenkins.
Performance Optimization
- Analyze system performance metrics and optimize cloud resources for efficiency.
Security & Compliance
- Implement best security practices, including identity management, role-based access control (RBAC), and threat detection.
Disaster Recovery & Backup
- Establish and test backup and recovery strategies using Azure Backup and Site Recovery.
Collaboration & Documentation
- Work closely with Software Engineers and Devops Engineers while maintaining detailed documentation of systems and processes.
Key Skills & Qualifications
5+ years of experience
in Site Reliability Engineering, Cloud Engineering, or DevOps.
Proficiency in Azure services
, including Virtual Machines, AKS, Azure Functions, App Services, Storage Accounts, and Networking.
Experience with
Infrastructure as Code (IaC)
tools such as Terraform, ARM, or Bicep.
Expertise in
CI/CD pipelines
with Azure DevOps, GitHub Actions, or Jenkins.
Solid understanding of
monitoring tools
(Azure Monitor, Prometheus, Grafana, or New Relic).
Knowledge of
containerization & orchestration
(Docker, Kubernetes, Helm).
Familiarity with
security best practices
in cloud environments (RBAC, NSGs, Key Vault, Defender for Cloud).
Experience with
troubleshooting and incident management
in production environments.
Strong understanding of SRE processes and best practices for system reliability, availability, and performance.
Experience with hybrid cloud environments and multi-cloud architectures.
Hands-on experience in managing high-availability and scalable applications.
Experience with payment gateway projects or similar high-transaction systems is preferred.
Additional knowledge in advanced monitoring techniques, performance tuning, and capacity planning is a plus.
Excellent analytical, problem-solving, and communication skills.
Azure certifications (AZ-104, AZ-400, AZ-305, or equivalent) is an advantage.
Job Types: Full-time, Permanent
Application Question(s):
We need someone to who can start ASAP. Can you start immediately?
Experience:
as Azuure Site Reliability Engineer: 5 years (Required)
Azure Services and CI/CD Pipeline: 5 years (Required)
License/Certification:
* SRE Certifications (Required)
Beware of fraud agents! do not pay money to get a job
MNCJobsGulf.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.
Job Detail
Job Id
JD1824303
Industry
Not mentioned
Total Positions
1
Job Type:
Contract
Salary:
Not mentioned
Employment Status
Permanent
Job Location
Dubai, DU, AE, United Arab Emirates
Education
Not mentioned
Apply For This Job
Beware of fraud agents! do not pay money to get a job
MNCJobsGulf.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.