About the Role
We are seeking aGenAI/Data Scientistto join ourAI Innovation Team. This role will focus on experimenting, optimising and applyingGenerative AIoff the shelf models to extract valuable insights from large-scale patent datasets and enhance our search and analytics tools. You will collaborate with data scientists, product teams, and stakeholders across different geographies, driving innovation throughLLMs(Large Language Models) and advanced AI methodologies.
Responsibilities
Perform EDA, design, build, and deploy pipelines utilizingLLMmodels to enhance patent search and analytics applications.
Stay up to date with the latest research and developments in the field of natural language processing (NLP) and machine learning.
Work onGenAItechniqueslikePrompt Engineering,RAG (Retrieval-Augmented Generation)and perform evaluation using frameworks to optimise LLM performance.
Develop and implement machine learning workflows, focusing on the integration ofGenAIwith existing data infrastructure.
Collaborate with the team to exploreGenerative AIuse cases, includingautomated summarisation,natural language understanding, andtext generation.
Conduct experiments to evaluate model performance, identify areas for improvement, and implement enhancements.
Perform continuous evaluations and improvements of models to handle large volumes of patent data.
Work with stakeholders across teams to identify key areas for AI-driven innovation and enhancement indata products.
UsePython,SQL,PySparkand related technologies to develop scalable solutions, focusing on large-scale data processing.
Qualifications:
Demonstrate3+ yearsof experience indata science, with a focus onNLP,Generative AIandLLMs.
Proficiency inPythonand experience working with transformer basedLLMsandNLPframeworks (e.g.Hugging Face, Spacy, Pytorch/Tensorflow etc).
Knowledge ofPrompt Engineering,RAGtechniques and various evaluation methodologies for integratingGenAIwith search/retrieval systems andmeasure the quality.
Experience working withcloud platformslikeAzure,AWS, orGCPfor machine learning workflows.
Understanding ofdata engineering pipelinesanddistributed data processing(e.g.,Databricks, Apache Spark).
Strong analytical skills, with the ability to transform raw data into meaningful insights through AI techniques.
Experience withLangChain / LlamaIndex,vector databases (e.g., FAISS),fine-tuningmodels on domain-specific data would be an advantage
Work in a way that works for you
We promote a healthy work/life balance across the organisation. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals.
Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive
Working for you
We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer:
Generous holiday allowance with the option to buy additional days Health screening, eye care vouchers and private medical benefits Wellbeing programs Life assurance Access to a competitive contributory pension scheme Save As You Earn share option scheme Travel Season ticket loan Electric Vehicle Scheme Optional Dental Insurance Maternity, paternity and shared parental leave Employee Assistance Programme Access to emergency care for both the elderly and children RECARES days, giving you time to support the charities and causes that matter to you Access to employee resource groups with dedicated time to volunteer Access to extensive learning and development resources Access to employee discounts scheme via Perks at Work