Job was saved successfully.
Job was removed from Saved Jobs.

Job Details

GlaxoSmithKline (GSK)

Principal Data Architect



Full Time

On Site


Collegeville, Pennsylvania, United States

Site Name: USA - Pennsylvania - Upper Providence, Belgium-Wavre, Poznan Grunwaldzka, UK - Hertfordshire - Stevenage
Posted Date: Mar 10 2023

Principal Data Architect

Come join us as we supercharge GSK’s data capability!

At GSK we are building a best-in-class data and prediction powered team that is ambitious for patients.

Scientific Digital and Tech’s goal is to power the discovery, development and supply of medicines and vaccines to patients.

This means new tools to discover new medicines and vaccines, predictive capability for pre-clinical research, accelerated CMC and supply chain and an improved day-to-day laboratory experience for our scientists.

Our Digital & Tech solutions will automate workflows and speed up decisions; freeing hands and releasing minds to focus on science.

As R&D enters a new era of data driven science, we are building a data engineering capability to ensure we have high quality data captured with context and aligned data models, so that the data is useable and reusable for a variety of use cases.

GSK R&D and Digital and Tech’s collective goal is to deliver business impact, including the acceleration of the discovery and development of medicines and vaccines to patients.

The R&D Digital and Tech remit has expanded over the past 2 years, and to position GSK for the future, The change will strengthen R&D Tech, to provide more strategic impact, focus, accountability, and improved decision making in the use of Digital, Data and Analytics (DDA) to strengthen the pipeline.

These range from data driven decision making, through to data sciences and AI/ML capability to enable data driven science. As a result, it is essential that Digital & Tech can surface the data from operational systems into a harmonized storage layer and controlled presentation data fabrics and mesh.

To do this there is a need to build out a Data Engineering capability that can build the required architecture, data strategy and implement an engineering approach. And a leader is required to build this capability within the Scientific Data & Tech organization in R&D Digital & Tech, and then industrialize the approaches and practices around Data architecture and delivery.

The Principal Data Architect should bring deep knowledge about the highly complex, heterogeneous world of science and deep knowledge of the distinct data types applicable to CMC, laboratories and Research and ontological standards (e.g. from SEND safety format to S88 manufacturing standards).

R&D is a highly complex and regulated environment and therefore you need to have deep knowledge of the applicable business area to be successful in this role.

You must demonstrate excellent data modelling skills to utilize conceptual and Logical data models as a foundation for building a Data Mesh/Data Fabric.

This role will provide YOU the opportunity to lead key activities to progress YOUR career. These responsibilities include some of the following:

  • Producing conceptual, logical, and physical data models to build fit for purpose data products using Data Mesh/Data Fabric architectures on Target platforms.

  • Building data architecture spanning different business areas and drawing links between problems to build common solutions.

  • Building, maintaining and governing data modelling principles, standards, and the execution of the enterprise ontologies.

  • Work with business and application/solution teams to define/implement data strategies to support key business initiatives.

  • Collaborating with different business areas to ensure consistent application of data standards including metadata and data quality to enable data governance across workstreams.

  • Providing input into the roadmaps of upstream teams (e.g., Data Platforms, Data Governance) to help improve the overall program of work.

  • Designing data architecture aligned with Enterprise-wide standards and principles to promote interoperability.

  • Adopting a security-first design that embeds robust authentication, hardened infrastructure and resilient connectivity across the Data and data platform environments.

  • Providing leadership to team members to help others get the job done right.

  • Supporting engineering teams in the adoption and creation of data Mesh best practices.

  • Maintaining best practices for data modelling and architecture on our Confluence site.

  • Pro-actively engaging in experimentation and innovation to drive relentless improvement.

  • Providing leadership, Subject Matter, and GSK expertise to architecture and engineering teams composed of GSK FTEs, strategic partners, and software vendors.

Why you?

Basic Qualifications:

We are looking for professionals with these required skills to achieve our goals:

  • Bachelor’s Degree-Computer Science, Engineering, Data Science.

  • 5+ years’ experience of Data Modelling, Metadata Management & Data Governance.

  • 5+ years' pharma experience.

  • 5+ years of hands-on relational, dimensional, and/or analytic experience (using RDBMS, dimensional, NoSQL data platform technologies, and ETL and data ingestion protocols).

  • Experience with data warehouse, data lake, and enterprise big data platforms.

  • 2 plus years of experience Data Fabric/Data Mesh Architecture Designs.

Preferred Qualifications:

If you have the following characteristics, it would be a plus:

  • Familiarity Dev Sec Ops & Data Sec Ops.

  • Demonstrated skill in delivering high-quality Conceptual, Logical and Physical Data Models.

  • Knowledge of industry standards and technology platforms.

  • Deep understanding of Pharma Research and CMC processes.

  • Excellent communication, negotiation, influencing, and stakeholder management skills.

  • Customer focus and excellent problem-solving skills.

  • Familiarity with data modelling software such as Erwin DM, ER Studio etc.

  • Good understanding of various software paradigms: domain-driven, procedural, data-driven, object-oriented, functional.

  • Demonstrable knowledge depth in more than one area of software engineering and technology.

  • Demonstrated ability to develop talent and build effective teams.

  • Experience with Big Data technologies & data structures (i.e. information management), data models or relational database design.

  • Experience in applying Metadata, Data Security and Data Quality standards to build interoperable data products.

  • Subject matter expertise in Pharma Research, CMC and scientific domains.

  • Comprehensive understanding of Industry 4.0, Cloud Purdue Model, S88 / S95 Models including IoT, Streaming Analytics and Digital Twins.

  • Experience in building domain driven contextual data products to enable decision support across multiple products and assets to drive results across R&D business operations.


GSK is a global biopharma company with a special purpose – to unite science, technology and talent to get ahead of disease together – so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns – as an organisation where people can thrive. Getting ahead means preventing disease as well as treating it, and we aim to impact the health of 2.5 billion people around the world in the next 10 years.

Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it’s also about making GSK a place where people can thrive. We want GSK to be a workplace where everyone can feel a sense of belonging and thrive as set out in our Equal and Inclusive Treatment of Employees policy. We’re committed to being more proactive at all levels so that our workforce reflects the communities we work and hire in, and our GSK leadership reflects our GSK workforce.

If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-[Register to View] (US Toll Free) or +1 [Register to View] (outside US).

GSK is an Equal Opportunity Employer and, in the US, we adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class.

At GSK, the health and safety of our employees are of paramount importance. As a science-led healthcare company on a mission to get ahead of disease together, we believe that supporting vaccination against COVID-19 is the single best thing we can do in the US to ensure the health and safety of our employees, complementary workers, workplaces, customers, consumers, communities, and the patients we serve.

GSK has made the decision to require all US employees to be fully vaccinated against COVID-19, where allowed by state or local law and where vaccine supply is readily available. The only exceptions to this requirement are employees who are approved for an accommodation for religious, medical or disability-related reasons.

Important notice to Employment businesses/ Agencies

GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.

Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit GSK’s Transparency Reporting [Register to View] site.