Promise is seeking a Cloud Site Reliability Engineer (SRE) to build, operate, and optimize the infrastructure that powers their products. The ideal candidate will have a strong background in software development, site reliability engineering, and infrastructure-as-code.
Requirements
- 4+ years of experience in Linux system administration, managing large-scale production environments
- Strong debugging skills, with experience in performance tuning, observability, and system-level troubleshooting
- Hands-on experience with cloud platforms (AWS, Azure, or GCP)
- Expertise in Infrastructure-as-Code (IaC) using Terraform or similar tools
- Proficiency in monitoring tools (e.g., Prometheus, Datadog) and health check implementation
- Experience with containerization (Docker, Podman, Kubernetes)
- Scripting experience (Python, Bash, or equivalent) to automate infrastructure management
- Knowledge of networking and security best practices for cloud environments
Benefits
- Reasonable accommodations to qualified individuals with disabilities, pregnant individuals, and those with sincerely held religious beliefs, in accordance with applicable laws
- Promise provides equal opportunities for all applicants and employees, without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, genetic information, age, or military or veteran status
- Promise is an equal opportunity employer