The Onyx Research Data Platform organization represents a major investment by GSK R&D and Digital & Tech, designed to deliver a step change in our ability to leverage data, knowledge, and prediction to find new medicines. We are a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward building an unified, automated, next-generation data experience for GSK’s scientists, engineers, and decision-makers, increasing productivity, and reducing data friction.
Requirements
- Designs, builds, and operates data tools, services, workflows, etc that deliver high value through the solution to key business problems by leveraging modern data engineering tools (e.g. Spark, Kafka, Storm,...) and orchestration tools (e.g. Google Workflow, AirFlow Composer)
- Confidently optimizes design and execution of complex solutions in data ingestion and data transformation
- Enables data products optimized for AI/ML and GenAI workloads—high throughput, observable, feature-ready and governed
- Produces well-engineered software, including appropriate automated test suites, technical documentation, and operational strategy
- Implements modular, reusable components and microservices that accelerate development and reduce operational overhead
- Provides input into the roadmaps of upstream teams (e.g. Data Platforms, DataOps, DevOps) to help improve the overall program of work
- Ensure consistent application of platform abstractions to ensure quality and consistency with respect to logging and lineage
- Fully versed in coding best practices and ways of working, and participates in code reviews and partnering to improve the team’s standards
- Adhere to QMS framework and CI/CD best practices and helps to guide improvements to them that improve ways of working
- Provides technical leadership, code reviews, architectural guidance, and mentorship to junior engineers and serves as an escalation point for complex operational issues across pipeline and data services.
Benefits
- Health care and other insurance benefits (for employee and family)
- Retirement benefits
- Paid holidays
- Vacation
- Paid caregiver/parental and medical leave