Machine Learning Engineer, Platform

AION
London
2 weeks ago
Create job alert

About AION

AION is building an interoperable AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute and provides managed services, aiming to be an end-to-end AI lifecycle platform—taking organizations from data to deployed models using its forward-deployed engineering approach.

AI is transforming every business around the world, and the demand for compute is surging like never before. AION thrives to be the gateway for dynamic compute workloads by building integration bridges with diverse data centers around the world and re-inventing the compute stack via its state-of-the-art serverless technology. We stand at the crossroads where enterprises are finding it hard to balance AI adoption with security. At AION, we take enterprise security and compliance very seriously and are re-thinking every piece of infrastructure from hardware and network packets to API interfaces.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team in India/UK.

Who You Are

You're a hands-on ML engineer with 4-6 years of experience building and fine-tuning large language models (LLMs) and transformer-based models. You're execution-focused and thrive on solving challenging problems at the intersection of machine learning research and production systems.

You're comfortable working across the ML development lifecycle—from data preparation and model fine-tuning to evaluation and optimization. You understand both what makes a model perform well and how to systematically improve model quality through experimentation. Experience with LLM fine-tuning (LoRA, QLoRA), RLHF pipelines, and comprehensive model evaluation is highly desirable. You bring strong ownership, initiative, and the drive to build production-ready ML models that impact thousands of developers globally.

Requirements

What You'll Do

ML Model Development & Optimization

  • Design and implement end-to-end LLMOps pipelines for model training, fine-tuning, and evaluation
  • Fine-tune and customize LLMs (Llama, Mistral, Gemma, etc.) using full fine-tuning and PEFT techniques (LoRA, QLoRA) with tools like Unsloth, Axolotl, and HuggingFace Transformers
  • Implement RLHF (Reinforcement Learning from Human Feedback) pipelines for model alignment and preference optimization
  • Design experiments for automated hyperparameter tuning, training strategies, and model selection
  • Prepare and validate training datasets—ensuring data quality, preprocessing, and format correctness
  • Build comprehensive model evaluation systems with custom metrics (BLEU, ROUGE, perplexity, accuracy) and develop synthetic data generation pipelines
  • Optimize model accuracy, token efficiency, and training performance through systematic experimentation
  • Design and maintain prompt engineering workflows with version control systems
  • Deploy models using vLLM with multi-adapter LoRA serving, hot-swapping, and basic optimizations (speculative decoding, continuous batching, KV cache management)

ML Operations & Technical Leadership

  • Set up ML-specific monitoring for model quality, drift detection, and performance tracking with automated retraining triggers
  • Manage model versioning, artifact storage, lineage tracking, and reproducibility using experiment tracking tools
  • Debug production model issues and optimize cost-performance trade-offs for training and inference
  • Partner with infrastructure engineers on ML-specific compute requirements and deployment pipelines
  • Document model development processes and share knowledge through internal tech talks

Technical Skills & Experience

If you are meeting some of these requirements and feel comfortable catching up on others, we definitely recommend you

Related Jobs

View all jobs

Machine Learning Engineer (Applied AI) (100% Remote in EMEA)

Machine Learning Engineer (Applied AI) (100% Remote in EMEA)

Machine Learning Engineer, AI Engineer, Machine Learning Engineer, Deep Learning Engineer, Generative AI Engineer, NLP Engineer, Speech AI Engineer, Audio ML Engineer, Agentic AI Engineer, AI Solutions Engineer, AI Platform Engineer, Applied AI Engineer,

Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer - Tech Lead

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Maths for AI Jobs: The Only Topics You Actually Need (& How to Learn Them)

If you are a software engineer, data scientist or analyst looking to move into AI or you are a UK undergraduate or postgraduate in computer science, maths, engineering or a related subject applying for AI roles, the maths can feel like the biggest barrier. Job descriptions say “strong maths” or “solid fundamentals” but rarely spell out what that means day to day. The good news is you do not need a full maths degree worth of theory to start applying. For most UK roles like Machine Learning Engineer, AI Engineer, Data Scientist, Applied Scientist, NLP Engineer or Computer Vision Engineer, the maths you actually use again & again is concentrated in a handful of topics: Linear algebra essentials Probability & statistics for uncertainty & evaluation Calculus essentials for gradients & backprop Optimisation basics for training & tuning A small amount of discrete maths for practical reasoning This guide turns vague requirements into a clear checklist, a 6-week learning plan & portfolio projects that prove you can translate maths into working code.

Neurodiversity in AI Careers: Turning Different Thinking into a Superpower

The AI industry moves quickly, breaks rules & rewards people who see the world differently. That makes it a natural home for many neurodivergent people – including those with ADHD, autism & dyslexia. If you’re neurodivergent & considering a career in artificial intelligence, you might have been told your brain is “too much”, “too scattered” or “too different” for a technical field. In reality, many of the strengths that come with ADHD, autism & dyslexia map beautifully onto AI work – from spotting patterns in data to creative problem-solving & deep focus. This guide is written for AI job seekers in the UK. We’ll explore: What neurodiversity means in an AI context How ADHD, autism & dyslexia strengths match specific AI roles Practical workplace adjustments you can ask for under UK law How to talk about your neurodivergence during applications & interviews By the end, you’ll have a clearer picture of where you might thrive in AI – & how to set yourself up for success.

AI Hiring Trends 2026: What to Watch Out For (For Job Seekers & Recruiters)

As we head into 2026, the AI hiring market in the UK is going through one of its biggest shake-ups yet. Economic conditions are still tight, some employers are cutting headcount, & AI itself is automating whole chunks of work. At the same time, demand for strong AI talent is still rising, salaries for in-demand skills remain high, & new roles are emerging around AI safety, governance & automation. Whether you are an AI job seeker planning your next move or a recruiter trying to build teams in a volatile market, understanding the key AI hiring trends for 2026 will help you stay ahead. This guide breaks down the most important trends to watch, what they mean in practice, & how to adapt – with practical actions for both candidates & hiring teams.