Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation, designing, building, and managing large-scale, highly distributed systems. We're currently building global teams around the world and are looking for professionals to join us on this transformative journey.
Requirements
- Participate in and enhance the complete service lifecycle, from inception and design, through development, capacity planning, launch reviews, deployment, operation, and refinement.
- Design and implement software platforms and monitoring frameworks to govern service-oriented architecture (SOA) efficiently, automatically, and intelligently.
- Develop and manage components of cloud-managed data infrastructure, encompassing technologies such as Kubernetes, Redis, MySQL, Flink, and more.
- Establish sustainable mechanisms for scaling systems, such as automation, to drive enhancements in reliability, efficiency, and velocity.
- Provide sustainable user support, manage incident responses, and conduct blameless postmortems as part of our ongoing efforts to improve our systems.
Benefits
- Health insurance
- Paid time off
- Retirement plan
- Stock options
- Other benefits not specified