Cloud Engineer — Job Description
Role summary
Designs, deploys, and maintains cloud infrastructure and services to support applications and platforms, ensuring scalability, security, cost-efficiency, and reliability.
Key responsibilities
- Design, implement, and operate cloud architectures (IaaS/PaaS) on AWS, GCP, or Azure.
- Build and maintain automated provisioning using Infrastructure as Code (Terraform, CloudFormation, Pulumi).
- Implement CI/CD pipelines and deployment automation (GitHub Actions, Jenkins, GitLab CI).
- Manage networking, VPCs, load balancers, DNS, and secure connectivity (VPN, Direct Connect).
- Configure and operate compute, storage, database, and serverless services; optimize for performance and cost.
- Implement observability: monitoring, logging, tracing, and alerting (Prometheus, Grafana, ELK, Cloud-native tools).
- Enforce security best practices: IAM, key/secret management, encryption, vulnerability scanning, and compliance controls.
- Perform backup, disaster recovery planning, and capacity/cost forecasting.
- Troubleshoot production incidents, participate in on-call rotation, and run post-incident reviews.
- Collaborate with developers, SREs, and security teams to design resilient, testable systems and platform services.
- Mentor junior engineers and contribute to runbooks, documentation, and automation libraries.
Qualifications
- Education: BS in Computer Science, Engineering, or equivalent experience.
- Experience: 3–6+ years in cloud engineering or related infrastructure roles; senior roles 5+ years.
- Cloud platforms: Hands-on with AWS, GCP, or Azure services and console/CLI.
- IaC & automation: Strong with Terraform/CloudFormation/Pulumi and configuration management (Ansible, Helm).
- Containers & orchestration: Docker and Kubernetes (EKS/GKE/AKS) experience.
- Scripting/programming: Proficient in Python, Bash, or Go for automation.
- Databases & storage: Familiarity with managed DBs, object/block storage, and caching.
- Observability & tooling: Experience with monitoring, logging, and tracing solutions.
- Security & networking: Solid understanding of IAM, network design, and cloud security practices.
- Soft skills: Problem-solving, communication, and collaboration across teams.
Working conditions
- Cross-functional collaboration; may include on-call duties and occasional after-hours incident work.
- Fast-paced environment with project-driven timelines and cloud migrations.
Performance metrics
- System availability and SLA/SLO adherence.
- Deployment lead time and change failure rate.
- Mean time to recover (MTTR) and incident frequency.
- Infrastructure cost per service and automation coverage.
Pay: $55.39 – $75.75 per hour
Benefits:
- Dental insurance
- Health insurance
- Salary packaging
Work Location: In person