Job description
Site Name: USA - Massachusetts - Waltham, GSK HQ
Posted Date: Dec 1 2025
Senior Director, Data Science Innovation Lead
The Senior Director Data Science Innovation lead will pioneer transformative solutions in real-world evidence generation in the Real-World Data, Measurement, and Analytics (RWDMA) organization, supporting the entire drug development life cycle from early development to late-phase clinical trials and post-approval market access and reimbursement. Leveraging the latest advancements in data sciences, such as multimodal AI, generative AI, knowledge graphs, causal AI and agentic AI, the Data Science Innovation Lead will develop and optimize statistical methodologies in comparative effectiveness analyses, precision medicine, predictive modelling, and evidence synthesis. In addition, the Innovation Lead will support AI-driven automation tools and deployment of intelligent systems for more efficient data processing and automate complex data analyses and QC processes, thereby accelerating development timelines while ensuring compliance with regulatory standards.
Key Responsibilities
Data Science Strategy & Leadership
· Align RWDMA Data Science initiatives with RWD organizational drug development goals, regulatory requirements (e.g., FDA, EMA), and payer expectations, ensuring strategic impact and compliance, particularly in RWD analytics.
· Lead RWDMA Data Science through a matrix organization, collaborating with biostatisticians, clinical and other subject matter experts, and regulatory specialists to lead innovative applications of Data Science in RWE generation and embed Data Science into RWD workflows to improve efficiency of data processing and analysis.
Innovative Applications of Data Science in RWE Generation
· Design customized Data Science models tailored to specific RWD analytic applications, including:
· Comparative Effectiveness: Applying Data Science methodologies to evaluate treatment outcomes across diverse patient populations, supporting real world biostatistics and statistical programming efforts.
· Precision Medicine: Leveraging RWD to identify patient subgroups and biomarkers for tailored therapies.
· Predictive Modelling: Using advanced Data Science techniques (e.g., transformers, recurrent neural networks) to forecast disease progression, trial outcomes, and patient responses, and enhance insights from digital measurement and patient reported outcomes.
· Evidence Synthesis: Utilizing data science methodologies to integrate and synthesize findings from RWD and RCTs, including meta-analysis, indirect treatment comparisons, and network meta-analysis, to support comprehensive evaluations of treatment efficacy and safety.
Automation & Process Optimization
· Automate coding, including clinical coding and patient identification, and quality control (QC) processes using AI-driven anomaly detection and pattern recognition to ensure the validity of statistical programs, as well as data integrity across large-scale RWD datasets.
· Develop Natural Language Processing (NLP) tools to automate the creation, review, and validation of analytic plans and protocols, ensuring compliance with regulatory and payer standards, benefiting data strategy and operational efficiency.
· Build AI systems to streamline administrative tasks, such as assessing analytic consistency with market access requirements, enhancing operational efficiency across drug development phases.
Data strategy
· In alignment with DDF and D3 initiatives and the RWDSP team, assess the gaps in data needs in RWD and use potential Data Science applications to inform data strategy.
· Collaborate with the RWDSP, DDF, and data tech teams to optimize RWD storage, management, and access control to optimise RWD analytical workflows.
· Provide technical expertise and leadership on the usage of synthetic data in RWD and drug development.
Collaboration & Thought Leadership
· Mentor team members in advanced Data Science methodologies, fostering a culture of innovation and technical excellence across real world biostatistics, digital measurement, and other focus areas.
· Spearhead methodological innovation and development in RWD Data Science, providing opportunities for mentoring and professional growth of junior RWDMA staff.
· Develop and manage an external engagement strategy with academic partners and key opinion leaders (KOLs) to foster collaborative research and development in RWD Data Science data science.
· Present Data Science analyses and insights clearly and effectively at conferences, in publications, and during key stakeholder meetings, reinforcing the value of RWD Data Science contributions.
Qualifications
Education and experience:
· PhD in Data Science, Biostatistics, Computer Science, or a related field.
· 15+ years in healthcare and life sciences, with significant exposure to pharmaceutical and/or medical device industries.
· 10+ years in clinical development or RWE generation within regulated environments, including hands-on leadership of Data Science projects.
· Demonstrated success in deploying DataOps, ModelOps, or MLOps pipelines in cloud platforms (e.g., Azure, AWS).
Technical Skills:
· Expertise in statistical modelling, AI and machine learning techniques (e.g., Convolutional Neural Networks [CNNs], Recurrent Neural Networks [RNNs], Transformers).
· Proficiency in generative AI (e.g., LLMs, RAG, GANs, VAEs, and diffusion models) and the technical stack and tools (e.g., LangChain, LlamaIndex, CrewAI).
· Strong programming skills in Python, R, TensorFlow, PyTorch, and experience with cloud tools (e.g., Azure ML, AWS SageMaker), containerization (Docker), and version control (GitHub).
· Familiarity with multi-domain real-world data (e.g., clinical records, imaging, genomics, wearables, unstructured text).
Achievements:
· Proven track record of innovation in Data Science applications for healthcare, evidenced by publications, patents, or industry recognition.
Experience navigating ethical, privacy, and regulatory challenges in AI-driven healthcare solutions.
Senior Director, Data Science Innovation Lead
The Senior Director Data Science Innovation lead will pioneer transformative solutions in real-world evidence generation in the Real-World Data, Measurement, and Analytics (RWDMA) organization, supporting the entire drug development life cycle from early development to late-phase clinical trials and post-approval market access and reimbursement. Leveraging the latest advancements in data sciences, such as multimodal AI, generative AI, knowledge graphs, causal AI and agentic AI, the Data Science Innovation Lead will develop and optimize statistical methodologies in comparative effectiveness analyses, precision medicine, predictive modelling, and evidence synthesis. In addition, the Innovation Lead will support AI-driven automation tools and deployment of intelligent systems for more efficient data processing and automate complex data analyses and QC processes, thereby accelerating development timelines while ensuring compliance with regulatory standards.
Key Responsibilities
Data Science Strategy & Leadership
· Align RWDMA Data Science initiatives with RWD organizational drug development goals, regulatory requirements (e.g., FDA, EMA), and payer expectations, ensuring strategic impact and compliance, particularly in RWD analytics.
· Lead RWDMA Data Science through a matrix organization, collaborating with biostatisticians, clinical and other subject matter experts, and regulatory specialists to lead innovative applications of Data Science in RWE generation and embed Data Science into RWD workflows to improve efficiency of data processing and analysis.
Innovative Applications of Data Science in RWE Generation
· Design customized Data Science models tailored to specific RWD analytic applications, including:
· Comparative Effectiveness: Applying Data Science methodologies to evaluate treatment outcomes across diverse patient populations, supporting real world biostatistics and statistical programming efforts.
· Precision Medicine: Leveraging RWD to identify patient subgroups and biomarkers for tailored therapies.
· Predictive Modelling: Using advanced Data Science techniques (e.g., transformers, recurrent neural networks) to forecast disease progression, trial outcomes, and patient responses, and enhance insights from digital measurement and patient reported outcomes.
· Evidence Synthesis: Utilizing data science methodologies to integrate and synthesize findings from RWD and RCTs, including meta-analysis, indirect treatment comparisons, and network meta-analysis, to support comprehensive evaluations of treatment efficacy and safety.
Automation & Process Optimization
· Automate coding, including clinical coding and patient identification, and quality control (QC) processes using AI-driven anomaly detection and pattern recognition to ensure the validity of statistical programs, as well as data integrity across large-scale RWD datasets.
· Develop Natural Language Processing (NLP) tools to automate the creation, review, and validation of analytic plans and protocols, ensuring compliance with regulatory and payer standards, benefiting data strategy and operational efficiency.
· Build AI systems to streamline administrative tasks, such as assessing analytic consistency with market access requirements, enhancing operational efficiency across drug development phases.
Data strategy
· In alignment with DDF and D3 initiatives and the RWDSP team, assess the gaps in data needs in RWD and use potential Data Science applications to inform data strategy.
· Collaborate with the RWDSP, DDF, and data tech teams to optimize RWD storage, management, and access control to optimise RWD analytical workflows.
· Provide technical expertise and leadership on the usage of synthetic data in RWD and drug development.
Collaboration & Thought Leadership
· Mentor team members in advanced Data Science methodologies, fostering a culture of innovation and technical excellence across real world biostatistics, digital measurement, and other focus areas.
· Spearhead methodological innovation and development in RWD Data Science, providing opportunities for mentoring and professional growth of junior RWDMA staff.
· Develop and manage an external engagement strategy with academic partners and key opinion leaders (KOLs) to foster collaborative research and development in RWD Data Science data science.
· Present Data Science analyses and insights clearly and effectively at conferences, in publications, and during key stakeholder meetings, reinforcing the value of RWD Data Science contributions.
Qualifications
Education and experience:
· PhD in Data Science, Biostatistics, Computer Science, or a related field.
· 15+ years in healthcare and life sciences, with significant exposure to pharmaceutical and/or medical device industries.
· 10+ years in clinical development or RWE generation within regulated environments, including hands-on leadership of Data Science projects.
· Demonstrated success in deploying DataOps, ModelOps, or MLOps pipelines in cloud platforms (e.g., Azure, AWS).
Technical Skills:
· Expertise in statistical modelling, AI and machine learning techniques (e.g., Convolutional Neural Networks [CNNs], Recurrent Neural Networks [RNNs], Transformers).
· Proficiency in generative AI (e.g., LLMs, RAG, GANs, VAEs, and diffusion models) and the technical stack and tools (e.g., LangChain, LlamaIndex, CrewAI).
· Strong programming skills in Python, R, TensorFlow, PyTorch, and experience with cloud tools (e.g., Azure ML, AWS SageMaker), containerization (Docker), and version control (GitHub).
· Familiarity with multi-domain real-world data (e.g., clinical records, imaging, genomics, wearables, unstructured text).
Achievements:
· Proven track record of innovation in Data Science applications for healthcare, evidenced by publications, patents, or industry recognition.
Experience navigating ethical, privacy, and regulatory challenges in AI-driven healthcare solutions.
#LI-GSK*
#Hybrid*
• If you are based in Cambridge, MA; Waltham, MA; Rockville, MD; or San Francisco, CA, the annual base salary for new hires in this position ranges $207,075 to $345,125.
The US salary ranges take into account a number of factors including work location within the US market, the candidate’s skills, experience, education level and the market rate for the role. In addition, this position offers an annual bonus and eligibility to participate in our share based long term incentive program which is dependent on the level of the role. Available benefits include health care and other insurance benefits (for employee and family), retirement benefits, paid holidays, vacation, and paid caregiver/parental and medical leave.
If salary ranges are not displayed in the job posting for a specific country, the relevant compensation will be discussed during the recruitment process.
Please visit GSK US Benefits Summary to learn more about the comprehensive benefits program GSK offers US employees.
Why GSK?
Uniting science, technology and talent to get ahead of disease together.
GSK is a global biopharma company with a purpose to unite science, technology and talent to get ahead of disease together. We aim to positively impact the health of 2.5 billion people by the end of the decade, as a successful, growing company where people can thrive. We get ahead of disease by preventing and treating it with innovation in specialty medicines and vaccines. We focus on four therapeutic areas: respiratory, immunology and inflammation; oncology; HIV; and infectious diseases – to impact health at scale.
People and patients around the world count on the medicines and vaccines we make, so we’re committed to creating an environment where our people can thrive and focus on what matters most. Our culture of being ambitious for patients, accountable for impact and doing the right thing is the foundation for how, together, we deliver for patients, shareholders and our people.
If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-877-694-7547 (US Toll Free) or +1 801 567 5155 (outside US).
GSK is an Equal Opportunity Employer. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, religion, sex (including pregnancy, gender identity, and sexual orientation), parental status, national origin, age, disability, genetic information (including family medical history), military service or any basis prohibited under federal, state or local law.
Important notice to Employment businesses/ Agencies
GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.
Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSK’s compliance to all federal and state US Transparency requirements. For more information, please visit the Centers for Medicare and Medicaid Services (CMS) website at https://openpaymentsdata.cms.gov/