Who ❤️ PJ →

Full Search

This job listing has expired and may no longer be relevant!
6 Jun 2023

Full-Time Senior Data Engineer

GSK – Posted by kylegsk71 San Francisco, California, United States

Job Description

This role is responsible for architecting, building, and maintaining a world-beating Knowledge Graph Platform.  The Senior Knowledge Graph Engineer is a leading technical contributor who can consistently design, scope, and deliver data projects. They should be deeply familiar with the languages and tools of modern data engineering (e.g., Scala, Spark, Kafka, …), and engaged with the open-source community surrounding them. They support the Director of Knowledge Graph Platform Engineering in building a strong culture of accountability and ownership, as well as model best-in-class engineering practices (e.g., testing, code reviews, documentation, and DevOps-forward ways of working). They work in harmony with teammates and in close partnership with Product, Platform, and user groups such as AI/ML engineers to ensure the right data orchestration and robustness of our services.

 

 

Key responsibilities for the Sr. Data Engineer include:

  • Designs, builds, and operates data tools, services, workflows, etc on petabytes of data on Cloud by leveraging modern data engineering tools and orchestration tools. 
  • Measure, optimize, and architect high performance systems, especially, evaluate and optimize Knowledge Graph data storage and query performance.
  • Resolve customer-facing issues and fix bugs. Debug and resolve complex issues related to knowledge graph construction and management in a timely manner.
  • Stay up-to-date with emerging trends and technologies in knowledge graph and streaming data processing.
  • Collaborate with cross-functional teams (product, platform, Quality, and DevOps) to translate business problems into technical solutions that leverage the knowledge graph. 
  • Fully versed in coding best practices and ways of working, participates in code reviews and provide constructive feedback to improve code quality and team’s standards.
  • Design, debug, and scale core query language engine.
  • Deploy to GCP using CI/CD best practices, monitor and manage GCP resources.
  • Develop secure, auditable, and performant graph query services for consumers such as AI/ML and other research teams, and integrate the query services into data catalogue, governance, and security services.

 

 Why you?

 

Basic Qualifications:

We are looking for professionals with these required skills to achieve our goals:

  • Bachelor’s Degree in Computer Science, Software Engineering or related discipline.
  • Minimum 1 year of Cloud experience e.g., AWS, Google Cloud, Azure, Kubernetes
  • Minimum 1 year of experience with Spark and Scala

 

 

Preferred Qualifications:

If you have the following characteristics, it would be a plus:

  • Masters or PHD in CS, Software Engineering or related discipline.
  • Deep experience with industry standard big data technologies e.g., Spark, BigQuery, Kafka, HDFS, Delta Lake.
  • Deep experience using Scala, including toolchain, documentation, testing, and operations / observability.
  • Strong functional programming background. Experience with parser combinators, relational algebra.
  • Experience with linked data, especially RDF.
  • Experience with various data storage solutions (SQL, key-value, column, document, graph stores). 
  • Experience with data modelling, particularly involving the use of semantic data and ontologies/taxonomies/business data.
  • Deep experience utilizing infrastructure as code technologies to produce repeatable architectures e.g., Terraform, Cloud templates
  • Experience delivering microservices utilizing an event driven architecture
  • Application experience of CI/CD implementations using git and a common CI/CD stack: e.g., Jenkins, CircleCI, GitLab, Azure DevOps
  • Experience in modern software development tools / ways of working: e.g., git/GitHub, DevOps tools, metrics / monitoring
Share this role online (there may be a referral fee*)

How to Apply

Please apply directly for the Senior Data Engineer position here: Senior Data Engineer in Multiple Locations | GSK Careers

Job Categories: LGBT. Job Types: Full-Time. Salaries: 100,000 and above.

64 total views, 0 today

Apply for this Job