Job was saved successfully.
Job was removed from Saved Jobs.

Job Details

GlaxoSmithKline (GSK)

Senior Scientific Knowledge Engineer



Full Time

On Site


Collegeville, Pennsylvania, United States

Site Name: USA - Pennsylvania - Upper Providence, Cambridge 300 Technology Square, London The Stanley Building
Posted Date: Apr 26 2023

The Onyx Research Data Platform organization represents a major investment by GSK R&D and Digital & Tech, designed to deliver a step- change in our ability to leverage data, knowledge, and prediction to find new medicines. We are a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward:

  • Building a next-generation, metadata- and automation-driven data experience for GSK’s scientists, engineers, and decision-makers, increasing productivity and reducing time spent on “data mechanics”

  • Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent

  • Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time

The Scientific Knowledge Engineering team, which sits within the Onyx Product Management organization, is responsible for the data modeling, ontology definition and management, vocabulary mapping, and other key metadata activities that ensure Onyx platforms and data assets speak scientific language. They are a core factor in delivering the GSK R&D Knowledge Graph – the semantic layer that connects all of our data and metadata systems – as well as the core metadata experiences that ultimately allow us to build products and services that both delight our customers and enable impressive automation and intelligence.

* For this specific role, GSK is looking for biologists that have demonstrated experience working with genomics data and business analysis.

This role is responsible for maximizing the value of our data assets over a lifetime to bring purpose to data by acting as translators of highly technical information from domain experts into an appropriate data model – complete with significant ontology and vocabulary -- that can be utilized to effectively structure and index the data. Specifically working with Product managers and R&D subject matter expertise to define the language (data models, ontology, standards, etc.) of science into data products by acting as the voice of “Knowledgebase” and interoperability/value of asset. This includes responsibility for the understanding and translation of computational methods back through the data chain to maximise the quality and speed of data from source to drive experimental multi-variant analysis and data driven decision-making.

  • Definition of schemas and data models of scientific information required for the creation of value adding data products. This includes accountability for the quality control and mapping specifications to be industrialized by data engineering and maintained in platform provisioned tooling.

  • Accountable for the quality control (through validation and verification) of mapping specifications to be industrialised by data engineering and maintained in platform provisioned tooling – e.g., models, schemas, controlled vocab.

  • Working with Product managers/engineers confidently convert business need into defined deliverable business requirements to enable the integration of large-scale biology data to predict, model, and stabilize therapeutically relevant protein complex and antigen conformations for drug and vaccine discovery.

  • Collaborate with external groups to align GSK data standards with industry/ academic ontologies ensuring that data standards are defined with usage/analytics in mind. They may also provide data source profiling and advisory consultancy to R&D outside of Onyx.

  • Support effective ingestion of data by GSK through understanding the entry requirements required by platform engineering teams and ensuring that the “barrier for entry” is met e.g. Scientific information has the appropriate metadata to be indexed, structured, integrated and standardised as needed. This may require articulation of GSK engineering standards and metadata information needs to third parties to ensure efficient and automate ingestion at scale.

  • Provides bespoke subject matter expertise for R&D data to translate deep science into data for actionable insights

Why You?

Basic Qualifications

  • Bachelor’s degree (Bioinformatics, Biomedical Science, Biomedical Engineering, Molecular Biology, or Computer Science)

  • Biologist work experience

  • 5+ years of experience

  • Working experience querying relational databases - SQL

  • Experience with industry standard data management / metadata platforms e.g. Collibra, Datahub, Datum, Informatica

  • Data modeling, analysis, profiling (working experience with any data quality tool, SAS, Ataccama, Informatica Data Quality, Talend, OpenRefine)

  • Experience with industry standard tools for building data protocols e.g. Avro, Protocol Buffers, Thrift

  • Experience with at least one programming language – e.g. Python – for scripting vocabulary mappings, building data models, etc.

Preferred Qualifications

  • Masters or PhD

  • Awareness of RDF, Ontology

  • Demonstrated comfort operating and leading across organizational boundaries a matrixed team

  • Membership of industry committee, board, consortium, or data standards group

  • Participation in peer-reviewed research (both publication and review), particularly in genetics and/or bioinformatics

  • Specific experience with Knowledge Graph efforts



Why GSK?

GSK is a global biopharma company with a special purpose – to unite science, technology and talent to get ahead of disease together – so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns – as an organisation where people can thrive. Getting ahead means preventing disease as well as treating it, and we aim to positively impact the health of 2.5 billion people by the end of 2030.

Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it’s also about making GSK a place where people can thrive. We want GSK to be a workplace where everyone can feel a sense of belonging and thrive as set out in our Equal and Inclusive Treatment of Employees policy. We’re committed to being more proactive at all levels so that our workforce reflects the communities we work and hire in, and our GSK leadership reflects our GSK workforce.

If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-[Register to View] (US Toll Free) or +1 [Register to View] (outside US).

GSK is an Equal Opportunity Employer and, in the US, we adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class.

Important notice to Employment businesses/ Agencies

GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.

Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit GSK’s Transparency Reporting [Register to View] site.