Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Software Engineer - Model Evaluation and Productisation (Must be UK based)

PolyAI
London
11 months ago
Applications closed

Related Jobs

View all jobs

Software Engineer- Machine Learning - UK

Software Engineer - Machine Learning

Software Engineer- Machine Learning - UK

Software Engineer- Machine Learning - UK

Software Engineer - Graph Data Science

Senior Software Engineer, Machine Learning

PolyAI is a leader in automating customer service through innovative voice technology. Our voice assistants empower businesses to deliver exceptional customer service at every interaction.

We are seeking a talented and hands-on Software Engineer with a strong data science background to join our team.

In this role, you will work on building software to enhance the visibility and configurability of large language models (LLMs). You will be responsible for rapidly developing tools and platforms to evaluate, iterate, and productionalize models, ensuring their reliability and accuracy.

We are looking for the right candidate, and therefore are flexible on the levelling for this position ranging from mid- level to senior!

Responsibilities:

  • Must have at least 2 years of Python experience
  • Must have at least 2+ years working experience
  • Develop software that provides visibility into LLM models and offers configurability for tuning and evaluation.
  • Build and maintain evaluation datasets and tools, enabling the measurement of model performance across key metrics.
  • Take a hands-on approach to quickly prototype, test, and iterate on solutions, moving from proof-of-concept to production.
  • Employ a data-driven methodology to drive model accuracy, leveraging evaluation results to inform decisions.
  • Collaborate with cross-functional teams to integrate developed tools and ensure they meet production standards.
  • Formulate hypotheses, design experiments, and collect data to validate model assumptions, consistently striving for improved reliability.
  • Communicate findings and ranking metrics clearly to both technical and non-technical stakeholders.

Why Join Us:

Join a dynamic and innovative team at the forefront of LLM development. You will have the opportunity to work on challenging projects, rapidly build impactful solutions, and drive data-informed improvements that push the boundaries of what LLMs can achieve.

Requirements

  • Degree in Computer Science, Data Science, or related field, or equivalent experience.
  • Strong proficiency in Python
  • Experience developing evaluation suites, datasets, and data-driven tools for model reliability testing.
  • Ability to rapidly prototype and iterate on solutions while maintaining a focus on production-level quality.
  • Strong problem-solving skills and a creative mindset, with the ability to hypothesize and validate results through experimentation.
  • Familiarity with cloud platforms such as AWS, GCP, or Azure is a plus.

Benefits

Participation in the company’s employee share options plan

25 days holiday, plus bank holidays

Flexible working from home policy plus a one-off WFH allowance when you join

Work from outside of the UK for up to 6 months each year

Enhanced parental leave

Yearly learning budget

Bike2Work scheme

Annual learning and development allowance

One-off WFH allowance when you join

‍ ‍Company-funded fertility and family-forming programmes

Menopause care programme with Maven

Private healthcare and dental cover, discounts on gym members and relaxation apps, and access to a range of mental health programs

Equal Opportunity Statement:

PolyAI is proud to be an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

AI Recruitment Trends 2025 (UK): What Job Seekers Must Know About Today’s Hiring Process

Summary: UK AI hiring has shifted from titles & puzzle rounds to skills, portfolios, evals, safety, governance & measurable business impact. This guide explains what’s changed, what to expect in interviews, and how to prepare—especially for LLM application, MLOps/platform, data science, AI product & safety roles. Who this is for: AI/ML engineers, LLM engineers, data scientists, MLOps/platform engineers, AI product managers, applied researchers & safety/governance specialists targeting roles in the UK.

Why AI Careers in the UK Are Becoming More Multidisciplinary

Artificial intelligence is no longer a single-discipline pursuit. In the UK, employers increasingly want talent that can code and communicate, model and manage risk, experiment and empathise. That shift is reshaping job descriptions, training pathways & career progression. AI is touching regulated sectors, sensitive user journeys & public services — so the work now sits at the crossroads of computer science, law, ethics, psychology, linguistics & design. This isn’t a buzzword-driven change. It’s happening because real systems are deployed in the wild where people have rights, needs, habits & constraints. As models move from lab demos to products that diagnose, advise, detect fraud, personalise education or generate media, teams must align performance with accountability, safety & usability. The UK’s maturing AI ecosystem — from startups to FTSE 100s, consultancies, the public sector & universities — is responding by hiring multidisciplinary teams who can anticipate social impact as confidently as they ship features. Below, we unpack the forces behind this change, spotlight five disciplines now fused with AI roles, show what it means for UK job-seekers & employers, and map practical steps to future-proof your CV.

AI Team Structures Explained: Who Does What in a Modern AI Department

Artificial Intelligence (AI) and Machine Learning (ML) are no longer confined to research labs and tech giants. In the UK, organisations from healthcare and finance to retail and logistics are adopting AI to solve problems, automate processes, and create new products. With this growth comes the need for well-structured teams. But what does an AI department actually look like? Who does what? And how do all the moving parts come together to deliver business value? In this guide, we’ll explain modern AI team structures, break down the responsibilities of each role, explore how teams differ in startups versus enterprises, and highlight what UK employers are looking for. Whether you’re an applicant or an employer, this article will help you understand the anatomy of a successful AI department.