National AI Awards 2025Discover AI's trailblazers! Join us to celebrate innovation and nominate industry leaders.

Nominate & Attend

Staff NLP Data Engineer and Team Lead

GSK
Epping
8 months ago
Applications closed

Related Jobs

View all jobs

AI / ML / Data Science Strategy & Delivery Lead (SM/D)

Senior Research Scientist: Data Science and Machine Learning AIP

Inkfish Research Scientist (Medical) in Large Language Models

Staff Software Engineer

Staff Software Engineer

Staff Data Scientist

The Onyx Research Data Tech organization is GSK’s Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data and insights when they need it to give them a better starting point for and accelerate medical discovery. Ultimately, this helps us get ahead of disease in more predictive and powerful ways.

Onyx is a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward:​

  • Building a next-generation, metadata- and automation-driven data experience for GSK’s scientists, engineers, and decision-makers, increasing productivity and reducing time spent on “data mechanics”​
  • Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent​
  • Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time​

Data Engineering is responsible for the design, delivery, support, and maintenance of industrialized automated end to end data services and pipelines. They apply standardized data models and mapping to ensure data is accessible for end users in end-to-end user tools through use of APIs. They define and embed best practices and ensure compliance with Quality Management practices and alignment to automated data governance. They also acquire and process internal and external, structure and unstructured data in line with Product requirements.

This role is responsible for building and leading a scrum team of world-class NLP data engineers focused on building automated, scalable, and sustainable pipelines to account for evolving scientific needs. They support the head of Data Engineering in building a strong culture of accountability and ownership in their team, as well as instilling best-in-class engineering practices (e.g., testing, code reviews, DevOps-forward ways of working). They work in close partnership with our Platforms teams to ensure we have the right tools and ways of working, and with our Bioinformatics teams to ensure the use of appropriate schemas, vocabularies, and ontologies.

Key responsibilities for the Staff NLP Data Engineer and Team Lead:
  • Lead a team of NLP data engineers in delivering data and knowledge products that advance GSK R&D
  • Architect the data delivery and operational strategy for the NLP data engineering team; Deconstruct a complex and ambiguous data or knowledge request into a detailed strategy to make decision, anticipates future issues, and drive engineering efficiencies
  • Partner with AIML and knowledge graph platform team to build, test, and deploy NLP pipelines, systems and solutions
  • Partner closely with other data engineering leads to conceptualize the design of new data flows aimed at maximizing reuse and aligning with an event-driven microservice enable architecture
  • Partner with other data engineering leads to architect an engagement model and optimal ways of working with the product management teams
  • Apply graph-based data modelling techniques for efficient organization, integration, and data retrieval to ensure system flexibility and maintainability
  • Design innovative strategies beyond the current enterprise way of working to create a better environment for the end users, and able to construct a coordinated, stepwise plan to bring others along with the change curve
  • Create standards for proper ways of working and engineering discipline, including the QMS framework and CI/CD best practices and proactively spearhead improvement within their engineering area
  • Exemplar leader in their field of technical knowledge, keen on bettering their understanding and acting as the knowledge holder for the organization 
Why you?Basic Qualifications:

We are looking for professionals with these required skills to achieve our goals:

  • Bachelor’s degree in Data Engineering, Computer Science, Software Engineering or related field.
  • Data Engineering experience
  • Experience in Natural Language Processing algorithms and deep learning methods
  • Experience with building end-to-end systems based on machine learning or deep learning methods
  • Cloud experience (e.g., AWS, Google Cloud, Azure, Kubernetes)
Preferred Qualifications:
  • If you have the following characteristics, it would be a plus:
  • Demonstratable experience overcoming high volume, high compute challenges
  • Familiarity with orchestrating tooling
  • Experience in automated testing and design
  • Experience with DevOps-forward ways of working
  • Nice to have good understanding of ontologies and semantic harmonization of data across sources
  • Deep knowledge and use of at least one common programming language: e.g., Python, Scala, Java
  • Deep experience with common big data tools (e.g., Spark, Kafka, Storm, …)
  • Proven experience with machine learning algorithms and NLP frameworks like Pytorch, Tensorflow, Spacy, etc.
  • Proven track record of working with knowledge graphs and graph databases, and in general good understanding of database concepts
  • Proficiency in semantic web technologies (SPARQL, RDF, OWL) and harmonization of data
  • Application experience of CI/CD implementations using git and a common CI/CD stack (e.g., Jenkins, CircleCI, GitLab, Azure DevOps)
  • Experience with agile software development environments using tools like Jira and Confluence 
  • Experience with Infrastructure as a Code and automation tools (i.e. Terraform)

#GSKOnyx, #LI-GSK and #GSKTech1

Why GSK?

Uniting science, technology and talent to get ahead of disease together.

GSK is a global biopharma company with a special purpose – to unite science, technology and talent to get ahead of disease together – so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns – as an organisation where people can thrive. We prevent and treat disease with vaccines, specialty and general medicines. We focus on the science of the immune system and the use of new platform and data technologies, investing in four core therapeutic areas (infectious diseases, HIV, respiratory/ immunology and oncology).

Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it’s also about making GSK a place where people can thrive. We want GSK to be a place where people feel inspired, encouraged and challenged to be the best they can be. A place where they can be themselves – feeling welcome, valued, and included. Where they can keep growing and look after their wellbeing. So, if you share our ambition, join us at this exciting moment in our journey to get Ahead Together.

As an Equal Opportunity Employer, we are open to all talent. In the US, we also adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to neurodiversity, race/ethnicity, colour, national origin, religion, gender, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class*(*US only).

We believe in an agile working culture for all our roles. If flexibility is important to you, we encourage you to explore with our hiring team what the opportunities are.

Should you require any adjustments to our process to assist you in demonstrating your strengths and capabilities contact us on or . 

Please note should your enquiry not relate to adjustments, we will not be able to support you through these channels. However, we have created a UK Recruitment FAQ guide. Click thelinkand scroll to the Careers Section where you will find answers to multiple questions we receive

As you apply, we will ask you to share some personal information which is entirely voluntary. We want to have an opportunity to consider a diverse pool of qualified candidates and this information will assist us in meeting that objective and in understanding how well we are doing against our inclusion and diversity ambitions. We would really appreciate it if you could take a few moments to complete it.  Rest assured, Hiring Managers do not have access to this information and we will treat your information confidentially.

Important notice to Employment businesses/ Agencies

GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.

Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit GSK’s Transparency ReportingFor the Recordsite.

    

National AI Awards 2025

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

10 AI Recruitment Agencies in the UK You Should Know (2025 Job‑Seeker Guide)

Generative‑AI hype has translated into real hiring: Lightcast recorded +57 % year‑on‑year growth in UK adverts mentioning “machine learning”, “LLM” or “gen‑AI” during Q1 2025. Yet supply still lags. Roughly 18,000 core AI professionals work in the UK, but monthly live vacancies hover around 1,400–1,600. That mismatch makes specialist recruiters invaluable—opening stealth vacancies, advising on salary bands and fast‑tracking interview loops. But many tech agencies sprinkle “AI” on their website without an active desk. To save you time, we vetted 50 + consultancies and kept only those with: A registered UK head office (verified via Companies House). A named AI/Machine‑Learning or Data practice.

AI Jobs Skills Radar 2026: Emerging Frameworks, Languages & Tools to Learn Now

As the UK’s AI sector accelerates towards a £1 trillion tech economy, the job landscape is rapidly evolving. Whether you’re an aspiring AI engineer, a machine learning specialist, or a data-driven software developer, staying ahead of the curve means more than just brushing up on Python. You’ll need to master a new generation of frameworks, languages, and tools shaping the future of artificial intelligence. Welcome to the AI Jobs Skills Radar 2026—your definitive guide to the emerging AI tech stack that employers will be looking for in the next 12–24 months. Updated annually for accuracy and relevance, this guide breaks down the top tools, frameworks, platforms, and programming languages powering the UK’s most in-demand AI careers.

How to Find Hidden AI Jobs in the UK Using Professional Bodies like BCS, IET & the Turing Society

Stop Scrolling Job Boards and Start Tapping the Real AI Market Every week a new headline announces millions of pounds flowing into artificial-intelligence research, defence initiatives, or health-tech pilots. Read the news and you could be forgiven for thinking that AI vacancies must be everywhere—just grab your laptop, open LinkedIn, and pick a role. Yet anyone who has hunted seriously for an AI job in the United Kingdom knows the truth is messier. A large percentage of worthwhile AI positions—especially specialist or senior posts—never appear on public boards. They emerge inside university–industry consortia, defence labs, NHS data-science teams, climate-tech start-ups, and venture studios. Most are filled through referral or conversation long before a recruiter drafts a formal advert. If you wait for a vacancy link, you are already at the back of the queue. The surest way to beat that dynamic is to embed yourself in the professional bodies and grassroots communities where the work is conceived. The UK has a dense network of such organisations: the Chartered Institute for IT (BCS); the Institution of Engineering and Technology (IET) with its Artificial Intelligence Technical Network; the Alan Turing Institute and its student-driven Turing Society; the Royal Statistical Society (RSS); the Institution of Mechanical Engineers (IMechE) and its Mechatronics, Informatics & Control Group; public-funding engines like UK Research and Innovation (UKRI); and an ecosystem of Slack channels and Meetup groups that trade genuine, timely intel. This article is a practical, step-by-step guide to using those networks. You will learn: Why professional bodies matter more than algorithmic job boards Exactly which special-interest groups (SIGs) and technical networks to join How to turn CPD events into informal interviews How to monitor grant databases so you hear about posts months before they exist Concrete scripts, portfolio tactics, and outreach rhythms that convert visibility into offers Follow the playbook and you move from passive applicant to insider—the colleague who hears about a role before it is written down.