Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Research Scientist - Large Language Model Post-Training (Must be in UK)

PolyAI
London
8 months ago
Applications closed

Related Jobs

View all jobs

Research Scientist (Machine Learning)

Research Scientist (Machine Learning)

Research Scientist -Machine Learning

Research Scientist (Machine Learning), London

Research Scientist, Machine Learning (PhD)

Research Scientist, Machine Learning (PhD) London, UK • Software Engineering +1 more • Engineer[...]

About PolyAI

PolyAI is a leader in automating customer service through cutting-edge voice technology. Our voice assistants empower businesses to deliver exceptional customer service at every interaction. We are seeking a dedicated and innovative Research Scientist to join our team and elevate our machine learning models to new heights.

Job Details

As a Research Scientist specialising in large language model post-training, you will play a key role in shaping and implementing strategies for aligning language models for use in our conversational AI platform. Your primary focus will be on post-training techniques such as preference- finetuning, reward modelling, synthetic data generation etc.

Responsibilities:

  • Train models and conduct experiments to assess model performance in live deployments
  • Work on experimental model architectures, exploring multimodal, efficient long-context etc.
  • Develop post-training strategies to achieve state of the art performance on domain-specific tasks
  • Generate, collect, and annotate contact center data from sources such as real customer calls, chats, online open datasets, and synthetic data
  • Develop robust evaluation benchmarks to track improvements in production models
  • Collaborate with the legal and compliance team to address any compliance or data privacy-related issues
  • Work closely with product and engineering teams to ensure alignment with business and production goals
  • Stay informed about the latest advancements in machine learning, ASR, TTS, and LLM to continuously enhance our technologies

Requirements

  • A degree in Computer Science, Machine Learning, or a related field, or equivalent industry experience
  • 3+ years of experience working with deep learning and statistical models
  • Strong knowledge of data quality standards and annotation processes, with the ability to independently evaluate and improve models
  • Proficiency in Python and familiarity with relevant ML frameworks and libraries (e.g., PyTorch)
  • Experience with cloud services such as AWS, GCP, or Azure
  • Excellent verbal and written communication skills, with the ability to convey complex technical concepts to diverse audiences
  • A passion for solving technical challenges and driving practical solutions

Preferred Qualifications:

  • Experience working with LLMs and data preparation pipelines.
  • Experience with speech models, such as ASR or TTS.

Why Join PolyAI: At PolyAI, we are dedicated to pushing the boundaries of voice technology and machine learning. You will have the opportunity to work with a talented and diverse team, contribute to groundbreaking projects, and make a significant impact on the future of customer service automation. We offer a dynamic and inclusive work environment, competitive compensation, and opportunities for professional growth.

PolyAI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Benefits

Participation in the company’s employee share options plan

25 days holiday, plus bank holidays

Flexible working from home policy plus a one-off WFH allowance when you join

Work from outside of the UK for up to 6 months each year

Enhanced parental leave

Bike2Work scheme

Annual learning and development allowance

‍ ‍Company-funded fertility and family-forming programmes

Menopause care programme with Maven

Private healthcare and dental cover, discounts on gym members and relaxation apps, and access to a range of mental health programs

Equal Opportunity Statement:

PolyAI is proud to be an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Write an AI CV that Beats ATS (UK examples)

Writing an AI CV for the UK market is about clarity, credibility, and alignment. Recruiters spend seconds scanning the top third of your CV, while Applicant Tracking Systems (ATS) check for relevant skills & recent impact. Your goal is to make both happy without gimmicks: plain structure, sharp evidence, and links that prove you can ship to production. This guide shows you exactly how to do that. You’ll get a clean CV anatomy, a phrase bank for measurable bullets, GitHub & portfolio tips, and three copy-ready UK examples (junior, mid, research). Paste the structure, replace the details, and tailor to each job ad.

AI Recruitment Trends 2025 (UK): What Job Seekers Must Know About Today’s Hiring Process

Summary: UK AI hiring has shifted from titles & puzzle rounds to skills, portfolios, evals, safety, governance & measurable business impact. This guide explains what’s changed, what to expect in interviews, and how to prepare—especially for LLM application, MLOps/platform, data science, AI product & safety roles. Who this is for: AI/ML engineers, LLM engineers, data scientists, MLOps/platform engineers, AI product managers, applied researchers & safety/governance specialists targeting roles in the UK.

Why AI Careers in the UK Are Becoming More Multidisciplinary

Artificial intelligence is no longer a single-discipline pursuit. In the UK, employers increasingly want talent that can code and communicate, model and manage risk, experiment and empathise. That shift is reshaping job descriptions, training pathways & career progression. AI is touching regulated sectors, sensitive user journeys & public services — so the work now sits at the crossroads of computer science, law, ethics, psychology, linguistics & design. This isn’t a buzzword-driven change. It’s happening because real systems are deployed in the wild where people have rights, needs, habits & constraints. As models move from lab demos to products that diagnose, advise, detect fraud, personalise education or generate media, teams must align performance with accountability, safety & usability. The UK’s maturing AI ecosystem — from startups to FTSE 100s, consultancies, the public sector & universities — is responding by hiring multidisciplinary teams who can anticipate social impact as confidently as they ship features. Below, we unpack the forces behind this change, spotlight five disciplines now fused with AI roles, show what it means for UK job-seekers & employers, and map practical steps to future-proof your CV.