Platform engineer MLOps UK

Writer
London
1 month ago
Applications closed

Related Jobs

View all jobs

Staff Software Engineer, MLOps (Remote within UK)

Lead Engineer - MLOps (Apply in 3 Minutes) ...

Lead Engineer - MLOps ...

Software Engineer (MLOps / LLMOps)

Principal MLOps Engineer - Chase UK

▷ (Apply Now) Principal MLOps Engineer - Chase UK ...

About this role

As a Platform engineer MLOps you will be critical to deploying and managing cuttingedge infrastructure crucial for AI/ML operations and you will collaborate with AI/ML engineers and researchers to develop a robust CI/CD pipeline that supports safe and reproducible experiments. Your expertise will also extend to setting up and maintaining monitoring logging and alerting systems to oversee extensive training runs and clientfacing APIs. You will ensure that training environments are optimally available and efficiently managed across multiple clusters enhancing our containerization and orchestration systems with advanced tools like Docker and Kubernetes.

This role demands a proactive approach to maintaining large Kubernetes clusters optimizing system performance and providing operational support for our suite of software solutions. If you are driven by challenges and motivated by the continuous pursuit of innovation this role offers the opportunity to make a significant impact in a dynamic fastpaced environment.

Your responsibilities:

  • Work closely with AI/ML engineers and researchers to design and deploy a CI/CD pipeline that ensures safe and reproducible experiments.

  • Set up and manage monitoring logging and alerting systems for extensive training runs and clientfacing APIs.

  • Ensure training environments are consistently available and prepared across multiple clusters.

  • Develop and manage containerization and orchestration systems utilizing tools such as Docker and Kubernetes.

  • Operate and oversee large Kubernetes clusters with GPU workloads.

  • Improve reliability quality and timetomarket of our suite of software solutions

  • Measure and optimize system performance with an eye toward pushing our capabilities forward getting ahead of customer needs and innovating for continual improvement

  • Provide primary operational support and engineering for multiple largescale distributed software applications

Is this you

  • You have professional experience with:

    • Model training

    • Huggingface Transformers

    • Pytorch

    • vLLM

    • TensorRT

    • Infrastructure as code tools like Terraform

    • Scripting languages such as Python or Bash

    • Cloud platforms such as Google Cloud AWS or Azure

    • Git and GitHub workflows

    • Tracing and Monitoring

  • Familiar with highperformance largescale ML systems

  • You have a knack for troubleshooting complex systems and enjoy solving challenging problems

  • Proactive in identifying problems performance bottlenecks and areas for improvement

  • Take pride in building and operating scalable reliable secure systems

  • Are comfortable with ambiguity and rapid change

Preferred skills and experience:

  • Familiar with monitoring tools such as Prometheus Grafana or similar

  • 5 years building core infrastructure

  • Experience running inference clusters at scale

  • Experience operating orchestration systems such as Kubernetes at scale

    Benefits & perks (UK fulltime employees):

    • Generous PTO plus company holidays

    • Comprehensive medical and dental insurance

    • Paid parental leave for all parents 12 weeks)

    • Fertility and family planning support

    • Earlydetection cancer testingthrough Galleri

    • Competitive pension scheme and company contribution

    • Annual worklife stipends for:

      • Home office setup cell phone internet

      • Wellness stipend for gym massage/chiropractor personal training etc.

      • Learning and development stipend

    • Companywide offsites and team offsites

    • Competitive compensation and company stock options

    #LIRemote


Key Skills
ASP.NET,Health Education,Fashion Designing,Fiber,Investigation
Employment Type :Full-Time
Experience:years
Vacancy:1

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Advertise AI Jobs and List AI Vacancies: Advanced Recruitment Strategies for 2025

In a landscape where artificial intelligence (AI) is rapidly transforming industries—from healthcare and finance to manufacturing and creative fields—employers are in stiff competition to secure the best AI talent. Whether you’re a start-up looking for your first machine learning engineer or a global enterprise planning an AI research lab, knowing how to advertise AI jobs effectively has never been more critical. Below, you’ll find in-depth strategies for crafting compelling AI job adverts, optimising your recruitment funnel, and showcasing your organisation as an employer of choice for top AI specialists. We’ll also explore the importance of salary transparency, the best channels for promoting your AI vacancies, and advanced techniques for nurturing a culture of innovation.

AI Training Jobs: Your Comprehensive Guide to Launching a High-Potential Career

Artificial Intelligence (AI) has evolved from a futuristic concept to a core component of modern business strategy. As organisations increasingly embrace AI-driven systems to stay competitive, the demand for qualified professionals who can develop, implement, and train AI models has skyrocketed. In the UK—and indeed worldwide—there is a pressing need for skilled experts who understand the nuances of AI, from algorithm design to ethical considerations. For anyone seeking to enter this exciting field or pivot into a role focusing on AI training, the opportunities are abundant. This in-depth blog post will explore everything you need to know about AI training jobs, the essential skills you’ll need, the current employment landscape in the UK, and how to future-proof your career in AI.

Rural-Remote AI Jobs: A Breath of Fresh Air in the UK Tech Scene

A New Horizon for AI Professionals For years, conversations around tech careers in the UK have hinged on a central theme: to succeed in artificial intelligence (AI), you must be in or around London (or other big metropolitan areas like Manchester, Bristol, or Edinburgh). But times are changing. Technological leaps and the rise of flexible working are paving the way for AI professionals to live and work well beyond the capital. From the rugged coastlines of Cornwall and Pembrokeshire to the rolling hills of the Yorkshire Dales, we’re witnessing an exciting trend of AI remote countryside roles that allow you to work at the forefront of tech innovation—all while enjoying the tranquillity of rural or seaside living. At ArtificialIntelligenceJobs.co.uk, we’re seeing a marked increase in job postings and applications for these sorts of positions. A growing segment of job seekers is actively searching for “tech jobs by the sea” or “AI remote countryside,” driven by a desire for better work-life balance, lower living costs, and a healthier lifestyle. And it’s not just employees who stand to benefit; employers eager to attract top-tier AI talent are discovering that offering remote or flexible roles widens their candidate pool and enhances diversity. If you’re enticed by the idea of logging off from a day of coding neural networks and taking a stroll along a coastal path—or stepping outside your converted barn in Northumberland to soak in some fresh country air—this article is for you. Below, we’ll explore the benefits and challenges of rural-remote AI jobs, the specific roles best suited for remote work, and how to position yourself for success in this rapidly evolving sector.