We are looking for an experienced and motivated DevOps Engineer to join our fast-paced engineering team. You will be responsible for building and maintaining secure, scalable, and resilient infrastructure for cloud-native applications across AWS and Azure platforms.
In This Role, You Will
- Design and implement highly available and scalable architectures for PaaS/SaaS solutions that are operationally mature and cost-effective
- Drive operational excellence by defining SLOs/SLIs, setting up observability dashboards, and refining incident response workflows
- Lead and support root cause analysis (RCA) and post-mortem processes with a mindset of continuous improvement
- Collaborate with development and QA teams to implement and optimize CI/CD pipelines and automated deployment strategies
- Promote best practices for infrastructure-as-code (IaC), change management, security, and cloud governance
- Automate repetitive tasks and standard operating procedures to reduce toil and increase system reliability
- Contribute to architectural discussions, platform roadmaps, and incident-preparedness reviews in cross-functional teams
- Champion innovation and be hands-on with modern DevOps tools, cloud-native services, and AI-assisted operations where applicable
- AWS: 1+ years of experience with a strong focus on EKS (Kubernetes), RDS Serverless, S3, IAM, VPC, and integration with Atlas MongoDB
- Experience with modern AWS tools and add-ons: Karpenter (cluster autoscaling), KEDA (event-driven autoscaling), and service meshes like Linkerd or Istio
- Azure: Hands-on experience with Azure Cosmos DB, Azure Search, and App Service Plans
- Strong scripting and automation capabilities (e.g., Bash, Python, PowerShell)
- Solid experience with CI/CD tools (GitHub Actions, Jenkins, GitLab CI, or AWS CodePipeline)
- In-depth understanding of Linux OS administration, systemd, and package management
- Strong knowledge in networking concepts (TCP/IP, DNS, routing, firewall rules, security groups)
- Experience with monitoring and observability tools (Prometheus, Grafana, CloudWatch, Azure Monitor, NewRelic, etc.)
- Good grasp of containerization, Docker image optimization, and Kubernetes workload management
- Experience with patch management, release strategies (blue/green, canary), and infrastructure governance
- Excellent troubleshooting, collaboration, and documentation skills
- Bachelor’s degree in Information Technology, Software Engineering, or a related field
- 2–5 years of hands-on experience in DevOps, Site Reliability Engineering, or Infrastructure Engineering
- AWS
- EKS
- S#
- RDS Serverless
- IAM
- VPC
- Atlas MongoDB
- Azure
Generating Apply Link...