Playson is seeking a talented Site Reliability Engineer/DevOps to join our dynamic Firex squad. As a Site Reliability Engineer, you will be responsible for managing day-to-day alerts, system checks, and issue escalation, as well as providing 24x7 on-call support for critical SaaS events.
Requirements
- Strong experience with issue processing (RCA, Postmortems)
- Proficiency in Kubernetes (deployment, scaling, troubleshooting)
- Familiarity with AWS, Terraform, Docker, CI/CD
- Experience with monitoring tools like DataDog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS CloudWatch
- Strong understanding of networking concepts and protocols
- Proficiency in at least one scripting language (e.g., Python, NodeJS, Go)
- Experience with configuration management tools like FluxCD/ArgoCD
- Proficiency in Git or other version control systems
- Familiarity with incident response and management tools like PagerDuty, Opsgenie, or VictorOps
- Ownership, proactiveness, persistence, and passion for maintaining a high-traffic online platform
Benefits
- Quarterly Bonuses based on transparent and systematic evaluation
- Flexible Work Schedule
- Remote Work Option for Enhanced Flexibility
- Comprehensive Medical Insurance for you and your significant other
- Financial Support for Life Events
- Unlimited Paid Vacation
- Unlimited Paid Sick Leave
- Reimbursement for professional development courses and training