Machine Learning Infrastructure Engineer [UAE Based]

AI71
London
10 months ago
Applications closed

Related Jobs

View all jobs

Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer (Visa Sponsorship Available)

Staff Machine Learning Engineer

Senior AI Platform Engineer - Hybrid (ML Infra & MLOps)

Advisory AI Infrastructure / MLOps Engineer

Job Title: ML Infrastructure Senior Engineer

Location: Abu Dhabi, United Arab Emirates [Full relocation package provided]



Job Overview

We are seeking a skilled ML Infrastructure Engineer to join our growing AI/ML platform team. This role is ideal for someone passionate about large-scale machine learning systems and has hands-on experience deploying LLMs/SLMs using advanced inference engines like vLLM. You will play a critical role in designing, deploying, optimizing, and managing ML models and the infrastructure around them—both for inference, fine-tuning and continued pre-training.


Key Responsibilities

· Deploy large-scale or small language models (LLMs/SLMs) using inference engines (e.g., vLLM, Triton, etc.).

· Collaborate with research and data science teams to fine-tune models or build automated fine-tuning pipelines.

· Extend inference-level capabilities by integrating advanced features such as multi-modality, real-time inferencing, model quantization, and tool-calling.

· Evaluate and recommend optimal hardware configurations (GPU, CPU, RAM) based on model size and workload patterns.

· Build, test, and optimize LLMs Inference for consistent model deployment.

· Implement and maintain infrastructure-as-code to manage scalable, secure, and elastic cloud-based ML environments.

· Ensure seamless orchestration of the MLOps lifecycle, including experiment tracking, model registry, deployment automation, and monitoring.

· Manage ML model lifecycle on AWS (preferred) or other cloud platforms.

· Understand LLM architecture fundamentals to design efficient scalability strategies for both inference and fine-tuning processes.


Required Skills


Core Skills:

· Proven experience deploying LLMs or SLMs using inference engines like vLLM, TGI, or similar.

· Experience in fine-tuning language models or creating automated pipelines for model training and evaluation.

· Deep understanding of LLM architecture fundamentals (e.g., attention mechanisms, transformer layers) and how they influence infrastructure scalability and optimization.

· Strong understanding of hardware-resource alignment for ML inference and training.

Technical Proficiency:

· Programming experience in Python and C/C++, especially for inference optimization.

· Solid understanding of the end-to-end MLOps lifecycle and related tools.

· Experience with containerization, image building, and deployment (e.g., Docker, Kubernetes optional).

Cloud & Infrastructure:

· Hands-on experience with AWS services for ML workloads (SageMaker, EC2, EKS, etc.) or equivalent services in Azure/GCP.

· Ability to manage cloud infrastructure to ensure high availability, scalability, and cost efficiency.


Nice-to-Have

· Experience with ML orchestration platforms like MLflow, SageMaker Pipelines, Kubeflow, or similar.

· Familiarity with model quantization, pruning, or other performance optimization techniques.

· Exposure to distributed training frameworks like Unsloth, DeepSpeed, Accelerate, or FSDP.

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

New AI Employers to Watch in 2026: UK and Global Companies Reshaping AI Careers

The artificial intelligence job market in the UK is evolving at an extraordinary pace. With record-breaking investment, government backing, and a surge in enterprise adoption, the landscape of AI employers is shifting rapidly. For candidates exploring opportunities on ArtificialIntelligenceJobs.co.uk, understanding who is hiring next is just as important as understanding what skills are in demand. In this article, we explore the new and emerging AI employers to watch in 2026, focusing on organisations that have recently secured funding, won major contracts, or expanded their UK footprint. From cutting-edge startups to global giants doubling down on Britain, these companies represent the next wave of AI career opportunities.

How Many AI Tools Do You Need to Know to Get an AI Job?

If you are job hunting in AI right now it can feel like you are drowning in tools. Every week there is a new framework, a new “must-learn” platform or a new productivity app that everyone on LinkedIn seems to be using. The result is predictable: job seekers panic-learn a long list of tools without actually getting better at delivering outcomes. Here is the truth most hiring managers will quietly agree with. They do not hire you because you know 27 tools. They hire you because you can solve a problem, communicate trade-offs, ship something reliable and improve it with feedback. Tools matter, but only in service of outcomes. So how many AI tools do you actually need to know? For most AI job seekers: fewer than you think. You need a tight core toolkit plus a role-specific layer. Everything else is optional. This guide breaks it down clearly, gives you a simple framework to choose what to learn and shows you how to present your toolset on your CV, portfolio and interviews.

What Hiring Managers Look for First in AI Job Applications (UK Guide)

Hiring managers do not start by reading your CV line-by-line. They scan for signals. In AI roles especially, they are looking for proof that you can ship, learn fast, communicate clearly & work safely with data and systems. The best applications make those signals obvious in the first 10–20 seconds. This guide breaks down what hiring managers typically look for first in AI applications in the UK market, how to present it on your CV, LinkedIn & portfolio, and the most common reasons strong candidates get overlooked. Use it as a checklist to tighten your application before you click apply.