Devops Engineer 3 (B2B SaaS)
Overview
Senior DevOps Engineer role focused on end-to-end ownership of multi-cloud infrastructure, CI/CD evolution, and Infrastructure as Code (IaC). Lead reliability, scalability, and observability improvements while mentoring junior engineers.
What You'll Do7
- 1Own the design, architecture, and reliability of cloud infrastructure across AWS, Azure, GCP, and Aliyun, supporting multi-region, global deployments.
- 2Lead the evolution of CI/CD ecosystem, optimize and refactor Jenkins-as-Code setup for scalability, performance, and developer efficiency.
- 3Drive the Infrastructure as Code (IaC) journey end-to-end, migrate existing cloud resources, alarms, and configurations fully into code with strong versioning, review, and rollback practices.
- 4Partner with engineering teams to identify and resolve performance, scalability, and reliability bottlenecks, deep dives into memory, CPU, networking, and storage constraints.
- 5Define and implement monitoring, alerting, and incident response best practices, improve MTTR, system observability, and operational readiness.
- 6Lead initiatives around cost optimization, security hardening, and capacity planning, keep infrastructure efficient and compliant as the platform scales.
- 7Act as a technical mentor for junior DevOps engineers and raise the overall DevOps maturity across teams.
Requirements7
- 15+ years in DevOps/SRE/Infrastructure roles with hands-on experience (clear scale signals like traffic, uptime, latency, infra size)
- 2B2B SaaS company experience with multi-tenant architecture or multiple production stacks (multi-env / multi-client systems)
- 3Expertise in AWS (VPC, EKS, EC2, RDS, networking), Kubernetes (EKS) at scale, designing high availability, multi-region systems
- 4Proficiency in Terraform (must-have), Helm/GitOps, and strong scripting (Python/Go/Bash)
- 5Experience with scalable CI/CD pipelines (GitHub Actions/Jenkins) and zero/low downtime deployments
- 6Knowledge of SRE principles (SLOs, SLIs, error budgets) and monitoring tools (Prometheus, Grafana, Datadog), alerting, on-call, incident management
- 7BTech in Computer Science or related fields
Who Should Apply
A senior DevOps engineer with over 5 years of experience in DevOps/SRE roles, ideally from a B2B SaaS product company with multi-tenant architecture. Must have deep expertise in multi-cloud infrastructure (AWS, Azure, GCP), Kubernetes at scale, and Infrastructure as Code using Terraform. Strong scripting skills and experience with CI/CD pipelines and observability are essential.
Salary Insight
Open to discussion
Required Skills
Application Tip
Emphasize your hands-on experience with multi-cloud environments (AWS, Azure, GCP) and Infrastructure as Code using Terraform. Provide specific examples of CI/CD pipeline optimizations and incident response improvements that reduced MTTR.