Senior Machine Learning Engineer, Scaling and Performance

InstaDeep Ltd
London
1 month ago
Create job alert

Innovation is at the heart of what we do. We work as a cohesive team that collectively develops real-life decision-making and technology products across various industries. We are always on the lookout for talented minds to join our dynamic team and contribute their unique insights. Be part of a stimulating and collaborative environment where your ideas can make an impact and ignite transformative change worldwide.

InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.

Join us to be a part of the AI revolution!

The Team:

Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.

The Role:

We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.

TLDR:

Train world-class billion parameter models for some of the most exciting applications of ML in the industry – with minimum development time and maximum hardware utilisation.

Responsibilities

  • Scaling Expertise:Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
  • Performance Optimisation:Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimisations.
  • Distributed Systems Architecture:Create robust distributed training and inference solutions for maximum computational efficiency.
  • Algorithmic Optimisation:Research and understand the latest deep learning literature to implement and optimise state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
  • Low-Level Mastery:Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.

Required Skills

  • Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques
  • Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.)
  • Expertise with Python and/or C/C++
  • Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.)
  • Passion for profiling, identifying bottlenecks, and delivering efficient solutions.

Highly Desirable

  • Track record of successfully scaling ML models.
  • Experience writing custom CUDA kernels or XLA operations.
  • Understanding of GPU/TPU architectures and their implications for efficient ML systems.
  • Fundamentals of modern Deep Learning
  • Actively following ML trends and a desire to push boundaries.

Example Projects:

  • Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
  • Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
  • Adapt algorithms for large-scale distributed architectures across HPC clusters.
  • Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.

What We Offer:

  • Real-World Impact:Directly contribute to the performance and reach of our AI solutions.
  • Cutting-Edge Challenges:Tackle complex problems at the forefront of machine learning and large-scale system design.
  • Growth-Oriented Environment:Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.

* Important: All applicants must submit their CV/Resume and Cover letter in English. *

Our commitment to our people

We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.

Right to work:Please note that you will require the legal right to work in the location you are applying for.

#J-18808-Ljbffr

Related Jobs

View all jobs

Senior Software Engineer, Machine Learning

Senior Software Engineer, Machine Learning

Senior Machine Learning Engineer

Senior Data Scientist

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Negotiating Your AI Job Offer: Equity, Bonuses & Perks Explained

Artificial intelligence (AI) has proven itself to be one of the most transformative forces in today’s business world. From smart chatbots in customer service to predictive analytics in finance, AI technologies are reshaping how organisations operate and innovate. As the demand for AI professionals grows, so does the complexity of compensation packages. If you’re a mid‑senior AI professional, you’ve likely seen job offers that include far more than just a base salary—think equity, bonuses, and a range of perks designed to entice you into joining or staying with a company. For many, the focus remains squarely on salary. While that’s understandable—after all, your monthly take‑home pay is what covers day-to-day expenses—limiting your negotiations to salary alone can leave considerable value on the table. From stock options in ambitious startups to sign‑on bonuses that ‘buy you out’ of your current contract, modern AI job offers often include elements that can significantly boost your long-term wealth and job satisfaction. This article aims to shed light on the full scope of AI compensation—specifically focusing on how equity, bonuses, and perks can enhance (or sometimes detract from) the overall value of your package. We’ll delve into how these elements work in practice, what to watch out for, and how to navigate the negotiation process effectively. Our goal is to provide mid‑senior AI professionals with the insights and tools to land a holistic compensation deal that accurately reflects their technical expertise, leadership potential, and strategic importance in this fast-moving field. Whether you’re eyeing a leadership role in machine learning at an established tech giant, or you’re considering a pioneering position at a disruptive AI startup, the knowledge in this guide will help you weigh the merits of base salary alongside the potential riches—and risks—of equity, bonuses, and other benefits. By the end, you’ll have a clearer sense of how to align your compensation with both your immediate lifestyle needs and long-term career aspirations.

AI Jobs in the Public Sector: MOD, NHS & Gov Digital Service Opportunities

Artificial intelligence (AI) has rapidly evolved from a niche field of computer science into a transformative force reshaping industries across the globe. From healthcare to finance and from education to defence, AI-driven tools and techniques are revolutionising how we approach problems, improve efficiency, and make data-driven decisions. Nowhere is this transformation more apparent than in the United Kingdom’s public sector. Key government entities, including the Ministry of Defence (MOD), the National Health Service (NHS), and the Government Digital Service (GDS), are increasingly incorporating AI into their operations. Consequently, AI jobs within these bodies are growing both in number and strategic importance. In this comprehensive blog post, we will explore the landscape of AI jobs across the UK public sector, with a close look at the MOD, the NHS, and the Government Digital Service. We will delve into the reasons these organisations are investing heavily in AI, the types of roles available, the essential skills and qualifications required, as well as the salary ranges one might expect. Whether you are a new graduate keen to make a meaningful impact through your technical skills or a seasoned professional looking for your next career move, the public sector offers a wealth of opportunities in AI. By the end of this article, you will have a clearer understanding of why AI is so crucial to the public sector’s success, which roles are in demand, and how you can tailor your application to stand out in a competitive and rewarding job market.

Contract vs Permanent AI Jobs: Which Pays Better in 2025?

n the ever-evolving world of technology, the competition for top talent in artificial intelligence (AI) is intense—and the rewards are significant. By 2025, AI roles in machine learning, natural language processing, data science, and robotics are expected to be among the highest-paid professions within the UK technology sector. As an AI professional, deciding between contracting (either as a day‑rate contractor or via fixed-term contracts) and permanent employment could drastically impact your take‑home pay, job security, and career trajectory. In this article, we will delve into the various types of AI roles in 2025—particularly focusing on day‑rate contracting, fixed-term contract (FTC) roles, and permanent positions. We will compare the earning potential across these three employment types, discuss the key pros and cons, and provide practical examples of how your annual take‑home pay might differ under each scenario. Whether you are already working in AI or looking to break into this booming field, understanding these employment options will help you make an informed decision on your next move.