A global investment firm is hiring a Lead Machine Learning Engineer. Hybrid position. Paying up to £1600 per day. Outside IR35
Experience and skills required for the Senior Machine Learning Engineer (AI), Quantization, PEFT, DeepSpeed, ONNX, TensorRT, PyTorch, multi-LoRa, LoRA Exchange, TitanML ️
Strong experience working with inference servers like multi-LoRa, LoRA Exchange, TitanML ️
Experience with HuggingFace
Retrieval-augmented generation, embedding pipelines
Previous industry exposure to multi-node HPC clusters ️
Experience with Large language models - OpenAI, Mistral, Claude, LLaMA
Large language model for GPU usage, scaling, and data movement
Quantization, PEFT, DeepSpeed, ONNX, TensorRT
Deep Learning with PyTorch
Role and responsibilities for the Senior Machine Learning Engineer (AI), Quantization, PEFT, DeepSpeed, ONNX, TensorRT, PyTorch, multi-LoRa, LoRA Exchange, TitanML ️
Build and fine-tune NLP and vision models using frameworks like PyTorch and HuggingFace
Develop retrieval-augmented generation (RAG) pipelines and custom embeddings
Ship internal tools with light front end/back end components
Work closely with product and data leads to select and integrate third-party AI services
Own infrastructure decisions and help define internal ML engineering best practices
Package for the for the
Senior Machine Learning Engineer, Quantization, PEFT, DeepSpeed, ONNX, TensorRT, PyTorch, multi-LoRa, LoRA Exchange, TitanML ️
Hybrid
Outside IR35
Paying between £1300 to £1600 per day, depending on experience
TPBN1_UKTJ