An exceptional opportunity to join an innovative, high-growth organisation shaping the future of AI-powered automation and digital interaction.
We’re seeking a
Machine Learning Engineer with full-stack development experience
to work on cutting-edge projects involving
Generative AI ,
Retrieval-Augmented Generation (RAG) , and
multi-agent reasoning frameworks .
This is a hands-on, end-to-end engineering role with impact across the full ML lifecycle – from experimentation to deployment.
Conversational AI & Reasoning:
Design, fine-tune, and deploy advanced LLMs with agentic capabilities
RAG Pipelines:
Build and optimise scalable pipelines for structured and unstructured data retrieval
LLM Training & Fine-Tuning:
Use methods like LoRA, QLoRA, SFT, PEFT, and RLHF
Inference & Acceleration:
Serve models using vLLM, DeepSpeed, Triton, TensorRT
Multi-Agent Orchestration:
Work with LangChain, AutoGen, CrewAI, DSPy and similar tools
Cloud & MLOps (AWS):
Deploy with SageMaker, Bedrock, Lambda, S3, ECS, EKS
Full-Stack Integration:
Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js
Vector Search:
Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch
Required skills & experience:
3–5+ years of experience in ML engineering and software development
Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face
Proven experience with LLMs, RAG, and deploying cloud-native AI on AWS
Strong full-stack skills (React, TypeScript, Node.js) and API development
Familiarity with vector databases and multi-agent frameworks
Apply now to join this high growth and award-winning organisation with the opportunity to be part of building the future of AI driven projects and solutions. The role offers a highly competitive salary and benefits package and will be office based in London.