Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Machine Learning Engineer (Reinforcement Learning) London, UK

AgileRL Ltd
City of London
4 days ago
Create job alert
Machine Learning Engineer (Reinforcement Learning)

We are seeking a talented and experienced Machine Learning Engineer with a background in Reinforcement Learning to join our team. This engineer will contribute to the further development of Arena, a web-based software platform for reinforcement learning training and RLOps, and our open-source reinforcement learning library.


Responsibilities

  • Collaborate with the team to understand requirements and design new features of the Arena platform and open-source framework.
  • Develop scalable and reliable infrastructure to support reinforcement learning model training, LLM finetuning, model deployment, and management.
  • Integrate existing machine learning frameworks and libraries into the platform and open-source framework, providing a range of algorithms, environments, and tools for reinforcement learning model development.
  • Stay up-to-date with the latest advancements in AI, MLOps, reinforcement learning algorithms, tools, and techniques, and incorporate them into the platform as appropriate.
  • Provide technical guidance and support to internal users and external customers using the Arena platform and open-source framework.

Requirements

  • Master’s or Ph.D. degree in Computer Science, Engineering, or a related field, or 3+ years of relevant industry experience.
  • Solid understanding of reinforcement learning algorithms and concepts, with hands‑on experience in building and training reinforcement learning models.
  • Strong programming skills, with experience using reinforcement learning and ML frameworks and libraries (e.g. PyTorch, TensorFlow, Ray, Gym, RLLib, SB3, TRL), and MLOps tools.
  • Solid understanding of hyperparameter optimisation techniques and strategies.
  • Experience in building machine learning platforms or tooling for industrial or enterprise settings.
  • Proficiency in data management techniques, including storage, retrieval, and pre‑processing of large‑scale datasets.
  • Familiarity with model deployment and management, including the development of APIs, deployment pipelines, and performance optimisation.
  • Experience in designing and developing cloud‑based infrastructure for distributed computing and scalable data processing.
  • Deep understanding of software engineering and machine learning principles and best practices.
  • Strong problem‑solving and communication skills, and the ability to work independently as well as in a team environment.

Compensation

  • Competitive salary + significant stock options.
  • 30 days of holiday, plus bank holidays, per year.
  • Flexible working from home and 6-month remote working policies.
  • Enhanced parental leave.
  • Learning budget of £500 per calendar year for books, training courses and conferences.
  • Company pension scheme.
  • Regular team socials and quarterly all‑company parties.
  • Bike2Work scheme.

Join the fast‑growing AgileRL team and play a key role in the development of cutting‑edge reinforcement learning tooling and infrastructure.


Apply below


#J-18808-Ljbffr

Related Jobs

View all jobs

Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer (Databricks)

Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

AI Hiring Trends 2026: What to Watch Out For (For Job Seekers & Recruiters)

As we head into 2026, the AI hiring market in the UK is going through one of its biggest shake-ups yet. Economic conditions are still tight, some employers are cutting headcount, & AI itself is automating whole chunks of work. At the same time, demand for strong AI talent is still rising, salaries for in-demand skills remain high, & new roles are emerging around AI safety, governance & automation. Whether you are an AI job seeker planning your next move or a recruiter trying to build teams in a volatile market, understanding the key AI hiring trends for 2026 will help you stay ahead. This guide breaks down the most important trends to watch, what they mean in practice, & how to adapt – with practical actions for both candidates & hiring teams.

How to Write an AI CV that Beats ATS (UK examples)

Writing an AI CV for the UK market is about clarity, credibility, and alignment. Recruiters spend seconds scanning the top third of your CV, while Applicant Tracking Systems (ATS) check for relevant skills & recent impact. Your goal is to make both happy without gimmicks: plain structure, sharp evidence, and links that prove you can ship to production. This guide shows you exactly how to do that. You’ll get a clean CV anatomy, a phrase bank for measurable bullets, GitHub & portfolio tips, and three copy-ready UK examples (junior, mid, research). Paste the structure, replace the details, and tailor to each job ad.

AI Recruitment Trends 2025 (UK): What Job Seekers Must Know About Today’s Hiring Process

Summary: UK AI hiring has shifted from titles & puzzle rounds to skills, portfolios, evals, safety, governance & measurable business impact. This guide explains what’s changed, what to expect in interviews, and how to prepare—especially for LLM application, MLOps/platform, data science, AI product & safety roles. Who this is for: AI/ML engineers, LLM engineers, data scientists, MLOps/platform engineers, AI product managers, applied researchers & safety/governance specialists targeting roles in the UK.