Machine Learning Infrastructure Engineer [UAE Based]

AI71
London
7 months ago
Applications closed

Related Jobs

View all jobs

Staff Machine Learning Engineer

Senior Machine Learning Engineer

Advisory AI Infrastructure / MLOps Engineer

Senior MLOps Engineer

Machine Learning Engineer (RL)

Machine Learning Engineer - LLM post-training/mid-training

Job Title: ML Infrastructure Senior Engineer

Location: Abu Dhabi, United Arab Emirates [Full relocation package provided]



Job Overview

We are seeking a skilled ML Infrastructure Engineer to join our growing AI/ML platform team. This role is ideal for someone passionate about large-scale machine learning systems and has hands-on experience deploying LLMs/SLMs using advanced inference engines like vLLM. You will play a critical role in designing, deploying, optimizing, and managing ML models and the infrastructure around them—both for inference, fine-tuning and continued pre-training.


Key Responsibilities

· Deploy large-scale or small language models (LLMs/SLMs) using inference engines (e.g., vLLM, Triton, etc.).

· Collaborate with research and data science teams to fine-tune models or build automated fine-tuning pipelines.

· Extend inference-level capabilities by integrating advanced features such as multi-modality, real-time inferencing, model quantization, and tool-calling.

· Evaluate and recommend optimal hardware configurations (GPU, CPU, RAM) based on model size and workload patterns.

· Build, test, and optimize LLMs Inference for consistent model deployment.

· Implement and maintain infrastructure-as-code to manage scalable, secure, and elastic cloud-based ML environments.

· Ensure seamless orchestration of the MLOps lifecycle, including experiment tracking, model registry, deployment automation, and monitoring.

· Manage ML model lifecycle on AWS (preferred) or other cloud platforms.

· Understand LLM architecture fundamentals to design efficient scalability strategies for both inference and fine-tuning processes.


Required Skills


Core Skills:

· Proven experience deploying LLMs or SLMs using inference engines like vLLM, TGI, or similar.

· Experience in fine-tuning language models or creating automated pipelines for model training and evaluation.

· Deep understanding of LLM architecture fundamentals (e.g., attention mechanisms, transformer layers) and how they influence infrastructure scalability and optimization.

· Strong understanding of hardware-resource alignment for ML inference and training.

Technical Proficiency:

· Programming experience in Python and C/C++, especially for inference optimization.

· Solid understanding of the end-to-end MLOps lifecycle and related tools.

· Experience with containerization, image building, and deployment (e.g., Docker, Kubernetes optional).

Cloud & Infrastructure:

· Hands-on experience with AWS services for ML workloads (SageMaker, EC2, EKS, etc.) or equivalent services in Azure/GCP.

· Ability to manage cloud infrastructure to ensure high availability, scalability, and cost efficiency.


Nice-to-Have

· Experience with ML orchestration platforms like MLflow, SageMaker Pipelines, Kubeflow, or similar.

· Familiarity with model quantization, pruning, or other performance optimization techniques.

· Exposure to distributed training frameworks like Unsloth, DeepSpeed, Accelerate, or FSDP.

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

AI Jobs for Career Switchers in Their 30s, 40s & 50s (UK Reality Check)

Changing career into artificial intelligence in your 30s, 40s or 50s is no longer unusual in the UK. It is happening quietly every day across fintech, healthcare, retail, manufacturing, government & professional services. But it is also surrounded by hype, fear & misinformation. This article is a realistic, UK-specific guide for career switchers who want the truth about AI jobs: what roles genuinely exist, what skills employers actually hire for, how long retraining really takes & whether age is a barrier (spoiler: not in the way people think). If you are considering a move into AI but want facts rather than Silicon Valley fantasy, this is for you.

How to Write an AI Job Ad That Attracts the Right People

Artificial intelligence is now embedded across almost every sector of the UK economy. From fintech and healthcare to retail, defence and climate tech, organisations are competing for AI talent at an unprecedented pace. Yet despite the volume of AI job adverts online, many employers struggle to attract the right candidates. Roles are flooded with unsuitable applications, while highly capable AI professionals scroll past adverts that feel vague, inflated or disconnected from reality. In most cases, the issue isn’t a shortage of AI talent — it’s the quality of the job advert. Writing an effective AI job ad requires more care than traditional tech hiring. AI professionals are analytical, sceptical of hype and highly selective about where they apply. A poorly written advert doesn’t just fail to convert — it actively damages your credibility. This guide explains how to write an AI job ad that attracts the right people, filters out mismatches and positions your organisation as a serious employer in the AI space.

Maths for AI Jobs: The Only Topics You Actually Need (& How to Learn Them)

If you are a software engineer, data scientist or analyst looking to move into AI or you are a UK undergraduate or postgraduate in computer science, maths, engineering or a related subject applying for AI roles, the maths can feel like the biggest barrier. Job descriptions say “strong maths” or “solid fundamentals” but rarely spell out what that means day to day. The good news is you do not need a full maths degree worth of theory to start applying. For most UK roles like Machine Learning Engineer, AI Engineer, Data Scientist, Applied Scientist, NLP Engineer or Computer Vision Engineer, the maths you actually use again & again is concentrated in a handful of topics: Linear algebra essentials Probability & statistics for uncertainty & evaluation Calculus essentials for gradients & backprop Optimisation basics for training & tuning A small amount of discrete maths for practical reasoning This guide turns vague requirements into a clear checklist, a 6-week learning plan & portfolio projects that prove you can translate maths into working code.