The Onyx Research Data Tech organization is GSK’s Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data and insights when they need it to give them a better starting point for and accelerate medical discovery.
Requirements
- Bachelor’s, Master’s or PhD degree in Computer Science, Software Engineering, or related discipline.
- 6+ years of experience with Bachelor's, 4+ Years of experience with Masters and 2+ years of experience with PhD using specialized knowledge in cloud computing, scalable parallel computing paradigms, software engineering, and CI/CD.
- 2 + years of experience in AIML engineering, including large-scale model training and production deployment.
- Deep experience using at least one interpreted and one compiled common industry programming language: e.g., Python, C/C++, Scala, Java, including toolchains for documentation, testing, and operations / observability
- Deep experience with application performance tuning and optimization, including in parallel and distributed computing paradigms and communication libraries such as MPI, OpenMP, Gloo, including deep understanding of the underlying systems (hardware, networks, storage) and their impact on application performance
- Deep cloud expertise (e.g., AWS, Google Cloud, Azure), including infrastructure-as-code tools (Terraform, Ansible, Packer,...) and scalable cloud compute technologies, such as Google Batch and Vertex AI
- Expert understanding of AIML training optimization, including distributed multi-node training best practices and associated tools and libraries as well as hands-on practical experience in accelerating training jobs
- Understanding of ML model deployment strategies, including agent systems as well as scalable LLM model inference systems deployed in multi-GPU, multi-node environments
- Experience with CI/CD implementations using git and a common CI/CD stack (e.g., Azure DevOps, CloudBuild, Jenkins, CircleCI, GitLab)
- Experience with Docker, Kubernetes, and the larger CNCF ecosystem including experience with application deployment tools such as Helm
- Experience with low level application builds tools (make, CMake) and understanding of optimization at the build and compile level
- Demonstrated excellence with agile software development environments using tools like Jira and Confluence
Benefits
- health care and other insurance benefits (for employee and family)
- retirement benefits
- paid holidays
- vacation
- paid caregiver/parental and medical leave