Staff ML Performance Engineer (Inference Optimisation)
As a Staff ML Performance Engineer, you will focus on optimizing machine learning inference for edge devices and GPUs, working on large transformer-based models. You will collaborate with model developers, profile and optimize the full inference stack, and contribute to technical roadmaps and tooling.