National AI Awards 2025Discover AI's trailblazers! Join us to celebrate innovation and nominate industry leaders.

Nominate & Attend

Lead of Machine Learning Systems, Scale and Performance

InstaDeep
London
4 days ago
Create job alert

Lead of Machine Learning Systems, Scale and Performance

Join to apply for theLead of Machine Learning Systems, Scale and Performancerole atInstaDeep

Continue with Google Continue with Google

Lead of Machine Learning Systems, Scale and Performance

5 months ago Be among the first 25 applicants

Join to apply for theLead of Machine Learning Systems, Scale and Performancerole atInstaDeep

InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.

Join us to be a part of the AI revolution!

The Team

Efficiently training machine learning algorithms at scale requires solving novel system problems. Our team leads the design and implementation of high-performance solutions to seamlessly scale our AI systems, including our latest foundation models in biology and beyond. We optimise throughput, scalability, and robustness in some of the largest distributed ML systems, making ambitious research ideas a practical reality.

The Role

We're looking for a Lead Machine Learning Engineer to take charge of tackling performance bottlenecks and lead the development of solutions that scale machine learning to the next level. In this role, you’ll collaborate with a team of software and performance engineers to build systems that enable the next generation of our research. Strong candidates will have demonstrated expertise in managing and executing complex ML system solutions, coupled with a drive to optimise performance and scalability in state-of-the-art workloads.

Responsibilities

  • Technical Leadership: Define the long-term technical roadmap and drive the development of scalable, high-performance ML systems.
  • Algorithm Optimisation: Optimise state-of-the-art algorithms and architectures from the latest deep learning research for compute efficiency and performance.
  • System Scaling: Design strategies for scaling machine learning models across diverse hardware platforms (GPU/TPU) and optimising system performance under heavy load.
  • Low-Level Optimisation: Write efficient Python, C/C++, XLA, Pallas, Triton, or CUDA code to achieve performance breakthroughs.
  • ML Systems Design: Architect robust distributed systems for training, deployment, and monitoring, ensuring computational efficiency and scalability.
  • Data Pipeline Automation: Develop automated pipelines for data processing, model training, validation, and deployment, enabling efficient handling of large datasets.
  • Team Collaboration: Partner with research, applied, and product teams to build a cohesive software stack supporting key projects.
  • Mentorship: Guide and mentor the ML engineering team, fostering best practices in coding, testing, and documentation.

Required Skills

  • Expertise with Python and/or C/C++
  • Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques.
  • Development with machine learning frameworks (JAX, Tensorflow, and/or PyTorch)
  • Passion for profiling, identifying bottlenecks, and delivering efficient solutions.
  • Fundamentals of modern Deep Learning

Desired Skills

  • Track record of successfully scaling ML models.
  • Experience writing custom CUDA kernels or XLA operations.
  • Understanding of GPU/TPU architectures and their implications for efficient ML systems.

Representative projects

  • Profile algorithm, identifying opportunities for custom XLA/CUDA kernels.
  • Implement SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
  • Adapt algorithms for large-scale distributed architectures across HPC clusters.

What We Offer

  • A chance to lead and grow a team of talented engineers in solving some of AI’s most challenging system problems.
  • Hands-on experience optimising large-scale distributed ML systems that underpin industry-leading research;
  • A front-row seat to the evolution of AI, with opportunities to shape its direction through technical innovation and leadership.

TLDR: Lead a team of engineers to design and implement innovative engineering solutions for scaling ML systems, enabling InstaDeep’s most ambitious AI research.

Our commitment to our people

We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.

Right to work:Please note that you will require the legal right to work in the location you are applying for.Seniority level

  • Seniority levelMid-Senior level

Employment type

  • Employment typeFull-time

Job function

  • Job functionEngineering and Information Technology
  • IndustriesSoftware Development

Referrals increase your chances of interviewing at InstaDeep by 2x

Sign in to set job alerts for “Machine Learning Specialist” roles.

Continue with Google Continue with Google

Continue with Google Continue with Google

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 1 month ago

London, England, United Kingdom 13 hours ago

London, England, United Kingdom 5 months ago

London, England, United Kingdom 1 month ago

London, England, United Kingdom 22 hours ago

Machine Learning Engineer - Search and Recommendation

London, England, United Kingdom 1 month ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 1 day ago

Greater London, England, United Kingdom 3 weeks ago

London, England, United Kingdom 2 months ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 2 months ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 5 days ago

London, England, United Kingdom 3 weeks ago

London, England, United Kingdom 3 days ago

Greater London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 2 weeks ago

Machine Learning Software Engineer, Research

London, England, United Kingdom 3 months ago

London, England, United Kingdom 2 days ago

London, England, United Kingdom 2 months ago

London, England, United Kingdom 20 hours ago

London, England, United Kingdom 4 days ago

Machine Learning Scientists and Engineers: AI for Quantum

London, England, United Kingdom 2 months ago

Machine Learning Engineer - Recommendations & Reinforcement Learning

London, England, United Kingdom 2 days ago

London, England, United Kingdom 1 week ago

Senior Machine Learning Engineer, Pricing

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 1 month ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.


#J-18808-Ljbffr

Related Jobs

View all jobs

Head of Machine Learning

Head of Machine Learning

Head of Machine Learning

Head of Machine Learning

Head of Machine Learning

Head of Machine Learning

National AI Awards 2025

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Present AI Models to Non-Technical Audiences: A Public Speaking Guide for Job Seekers

In today’s competitive job market, AI professionals are expected to do more than just build brilliant algorithms—they must also explain them clearly to stakeholders who may have no technical background. Whether you're applying for a role as a machine learning engineer, data scientist, or AI consultant, your ability to articulate complex models in simple terms is fast becoming one of the most valued soft skills in interviews and on the job. This guide will help you master the art of public speaking for AI roles, offering tips on structuring presentations, designing effective slides, and using storytelling to make your work resonate with any audience.

AI Jobs UK 2025: 50 Companies Hiring Now

Bookmark this guide – we refresh it every quarter so you always know who’s really scaling their artificial‑intelligence teams. Artificial intelligence hiring has roared back in 2025. The UK’s boosted National AI Strategy funding, record‑breaking private investment (£18.1 billion so far) & a fresh wave of generative‑AI product launches mean employers are jockeying for data scientists, ML engineers, MLOps specialists, AI product managers, prompt engineers & applied researchers. Below are 50 organisations that have advertised UK‑based AI vacancies in the past eight weeks or formally announced growth plans. They’re grouped into five easy‑scan categories so you can jump straight to the kind of employer – & culture – that suits you. For each company you’ll find: Main UK hub Example live or recent vacancy Why it’s worth a look (tech stack, culture, mission) Use the internal links to browse current vacancies on ArtificialIntelligenceJobs.co.uk – or set up a free job alert so fresh roles land in your inbox.

Return-to-Work Pathways: Relaunch Your AI Career with Returnships, Flexible & Hybrid Roles

Stepping back into the workplace after a career break can feel like embarking on a whole new journey—especially in a cutting-edge field such as artificial intelligence (AI). For parents and carers, the challenge isn’t just refreshing your technical know-how but also securing a role that respects your family commitments. Fortunately, the UK’s tech sector now boasts a wealth of return-to-work programmes—from formal returnships to flexible and hybrid opportunities. These pathways are designed to bridge the gap, equipping you with refreshed skills, confidence and a supportive network. In this comprehensive guide, you’ll discover how to: Understand the booming demand for AI talent in the UK Leverage transferable skills honed during your break Overcome common re-entry challenges Build your AI skillset with targeted training Tap into returnship and re-entry programmes Find flexible, hybrid and full-time AI roles that suit your lifestyle Balance professional growth with caring responsibilities Master applications, interviews and networking Whether you’re returning after maternity leave, eldercare duties or another life chapter, this article will equip you with practical steps, resources and insider tips.