Software Engineer - Model Evaluation and Productisation (Must be UK based)

PolyAI
London
1 year ago
Applications closed

Related Jobs

View all jobs

Software Engineer, Applied Artificial Intelligence (AI)

Software Engineer, Applied Artificial Intelligence (AI)

Lead Software Engineer - MLOps Platform

Lead Software Engineer (Machine Learning)

Data Scientist – Content Engineering

Digital and Technology Solutions Apprenticeship - Artificial Intelligence Software Engineering

PolyAI is a leader in automating customer service through innovative voice technology. Our voice assistants empower businesses to deliver exceptional customer service at every interaction.

We are seeking a talented and hands-on Software Engineer with a strong data science background to join our team.

In this role, you will work on building software to enhance the visibility and configurability of large language models (LLMs). You will be responsible for rapidly developing tools and platforms to evaluate, iterate, and productionalize models, ensuring their reliability and accuracy.

We are looking for the right candidate, and therefore are flexible on the levelling for this position ranging from mid- level to senior!

Responsibilities:

  • Must have at least 2 years of Python experience
  • Must have at least 2+ years working experience
  • Develop software that provides visibility into LLM models and offers configurability for tuning and evaluation.
  • Build and maintain evaluation datasets and tools, enabling the measurement of model performance across key metrics.
  • Take a hands-on approach to quickly prototype, test, and iterate on solutions, moving from proof-of-concept to production.
  • Employ a data-driven methodology to drive model accuracy, leveraging evaluation results to inform decisions.
  • Collaborate with cross-functional teams to integrate developed tools and ensure they meet production standards.
  • Formulate hypotheses, design experiments, and collect data to validate model assumptions, consistently striving for improved reliability.
  • Communicate findings and ranking metrics clearly to both technical and non-technical stakeholders.

Why Join Us:

Join a dynamic and innovative team at the forefront of LLM development. You will have the opportunity to work on challenging projects, rapidly build impactful solutions, and drive data-informed improvements that push the boundaries of what LLMs can achieve.

Requirements

  • Degree in Computer Science, Data Science, or related field, or equivalent experience.
  • Strong proficiency in Python
  • Experience developing evaluation suites, datasets, and data-driven tools for model reliability testing.
  • Ability to rapidly prototype and iterate on solutions while maintaining a focus on production-level quality.
  • Strong problem-solving skills and a creative mindset, with the ability to hypothesize and validate results through experimentation.
  • Familiarity with cloud platforms such as AWS, GCP, or Azure is a plus.

Benefits

Participation in the company’s employee share options plan

25 days holiday, plus bank holidays

Flexible working from home policy plus a one-off WFH allowance when you join

Work from outside of the UK for up to 6 months each year

Enhanced parental leave

Yearly learning budget

Bike2Work scheme

Annual learning and development allowance

One-off WFH allowance when you join

‍ ‍Company-funded fertility and family-forming programmes

Menopause care programme with Maven

Private healthcare and dental cover, discounts on gym members and relaxation apps, and access to a range of mental health programs

Equal Opportunity Statement:

PolyAI is proud to be an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many AI Tools Do You Need to Know to Get an AI Job?

If you are job hunting in AI right now it can feel like you are drowning in tools. Every week there is a new framework, a new “must-learn” platform or a new productivity app that everyone on LinkedIn seems to be using. The result is predictable: job seekers panic-learn a long list of tools without actually getting better at delivering outcomes. Here is the truth most hiring managers will quietly agree with. They do not hire you because you know 27 tools. They hire you because you can solve a problem, communicate trade-offs, ship something reliable and improve it with feedback. Tools matter, but only in service of outcomes. So how many AI tools do you actually need to know? For most AI job seekers: fewer than you think. You need a tight core toolkit plus a role-specific layer. Everything else is optional. This guide breaks it down clearly, gives you a simple framework to choose what to learn and shows you how to present your toolset on your CV, portfolio and interviews.

What Hiring Managers Look for First in AI Job Applications (UK Guide)

Hiring managers do not start by reading your CV line-by-line. They scan for signals. In AI roles especially, they are looking for proof that you can ship, learn fast, communicate clearly & work safely with data and systems. The best applications make those signals obvious in the first 10–20 seconds. This guide breaks down what hiring managers typically look for first in AI applications in the UK market, how to present it on your CV, LinkedIn & portfolio, and the most common reasons strong candidates get overlooked. Use it as a checklist to tighten your application before you click apply.

The Skills Gap in AI Jobs: What Universities Aren’t Teaching

Artificial intelligence is no longer a future concept. It is already reshaping how businesses operate, how decisions are made, and how entire industries compete. From finance and healthcare to retail, manufacturing, defence, and climate science, AI is embedded in critical systems across the UK economy. Yet despite unprecedented demand for AI talent, employers continue to report severe recruitment challenges. Vacancies remain open for months. Salaries rise year on year. Candidates with impressive academic credentials often fail technical interviews. At the heart of this disconnect lies a growing and uncomfortable truth: Universities are not fully preparing graduates for real-world AI jobs. This article explores the AI skills gap in depth—what is missing from many university programmes, why the gap persists, what employers actually want, and how jobseekers can bridge the divide to build a successful career in artificial intelligence.