We are looking for a Principal Site Reliability Engineer - Cloud to design, build, secure, monitor, and maintain our SaaS product cloud infrastructure. The ideal candidate has a SaaS cloud infrastructure background in Azure or AWS, or a software engineering background with SaaS cloud infrastructure experience.
Requirements
- 8+ years experience designing, building, securing, monitoring, and maintaining cloud infrastructure in Azure or AWS
- 5+ years experience creating, configuring, maintaining, and monitoring Kubernetes clusters (AKS or EKS) in cloud infrastructure
- 5+ years building and deploying Infrastructure as Code with Terraform or similar technology
- 5+ years experience with common cloud networking, firewall, and load balancing configuration
- 5+ years experience writing software in any modern software language such as C#.NET, Java
- 5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible, or Jenkins
- 5+ years experience implementing production performance, availability, and scalability monitoring and alerting
- 5+ years experience supporting public client-facing revenue-generating systems
- Experience monitoring and preventing issues with databases and database queries (SQL) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor
- Experience planning, coordinating, developing, and executing all stages of post-deployment verification test scripts
- Experience securing Windows or Linux systems in a 24x7 production environment
- BS in Computer Science or equivalent work experience
Benefits
- Competitive compensation
- Flexible workplace
- Comprehensive benefits
- Opportunities for professional growth