Dataflow Development Engineer - LPU Hardware DataFlow

3 months ago
Applications closed

Related Jobs

View all jobs
Spotlight

Senior ML Compiler Engineer

Fractile Bristol, United Kingdom
Spotlight

Forward Deployed Engineer

SolveAI London, United Kingdom
Hybrid

Senior Machine Learning Applications and Compiler Engineer, LPX

NVIDIA Cambridge, United Kingdom
Hybrid

Senior Machine Learning Applications and Compiler Engineer, LPX

Hybrid

Senior Frontend Software Engineer - Simulation Workbench

PhysicsX London, United Kingdom
On-site

Core Services - Software Engineer (Frontend)

PhysicsX London, United Kingdom
On-site

Associate Director, Data Engineering

Relation Therapeutics London, United Kingdom
On-site

AI Deployment Engineer

Artis Recruitment Wc1A1Ap, WC1A 1AP, United Kingdom
£100,000 – £125,000 pa Hybrid
Posted
16 Mar 2026 (3 months ago)

NVIDIA is known as the "AI Computing Company." Our GPUs power modern Deep Learning software frameworks, accelerated analytics, data centers, and autonomous vehicles. We are looking for a Dataflow Development Engineer - LPU Hardware to join our team and develop, build, and improve dataflow systems at the hardware–software boundary. You will work on FPGA accelerator dataflow: implementing and tuning dataflow pipelines, creating host-side drivers and runtimes that collaborate with programmable logic, and jointly inventing hardware and software for deterministic, low-latency execution.

Dataflow development engineers at NVIDIA connect FPGA and custom hardware with our software systems. You will implement dataflow graphs and streaming pipelines in hardware. You will build efficient host–device interfaces (PCIe, DMA, VFIO) and collaborate with compiler and architecture teams to map high-level dataflow onto FPGA and accelerator fabrics. Your work directly affects latency, efficiency, and resource usage for inference at scale. The ideal candidate has a proven hardware approach, including experience with FPGA development, HDL, or hardware/software co-design. They can analyze timing, resource usage, and data movement. We seek engineers comfortable working from RTL to runtime. They consider pipelines and hardware performance and enjoy implementing dataflow architectures in silicon and programmable logic.

What you'll be doing:

  • Build and implement dataflow pipelines and streaming architectures in FPGA or programmable logic.

  • Develop host-side software, drivers, and runtimes that collaborate with FPGA and accelerator hardware (e.g. PCIe, DMA, VFIO).

  • Partner with compiler and hardware groups to allocate dataflow graphs onto hardware resources; improve latency, processing efficiency, and area/utilization.

  • Build and maintain hardware–software co-design flows: from high-level dataflow specs to synthesis, place-and-route, and validation.

  • Build tooling and methodologies for debugging, profiling, and validating dataflow behavior in hardware; participate in design reviews and cross-team alignment across EMEA and globally.

What we need to see:

  • BS or higher degree or equivalent experience in CS/EE/CE with more than 5 years in FPGA development, hardware dataflow, or hardware/software co-design.

  • Hands-on experience with RTL/HDL (Verilog, VHDL) or high-level synthesis (HLS); ability to build and debug dataflow-style pipelines in hardware.

  • Solid programming abilities in C/C++ for host drivers, runtimes, or tooling; familiarity with hardware interfaces (e.g. PCIe, DMA, memory-mapped I/O).

  • Proven understanding of dataflow and streaming concepts: pipelining, backpressure, buffering, and resource/area trade-offs.

  • Familiarity with FPGA toolchains (synthesis, P&R, timing closure) and with Linux, scripting, and version control.

  • Excellent communication in English; ability to work with distributed teams.

Ways to stand out from the crowd:

  • Experience working with FPGA dataflow for machine learning inference, networking, or high-throughput streaming (e.g. Xilinx/AMD, Intel FPGA).

  • VFIO, SR-IOV, or other pass through/virtualization for accelerators; low-level driver or BSP development.

  • ASIC or custom-silicon dataflow build; RTL develop for dataflow or network-on-chip (NoC).

  • Background in compiler backends or HLS that targets FPGAs; MLIR or IR-level optimization for hardware mapping.

  • Experience with multi-FPGA or FPGA–GPU systems; distributed dataflow across programmable logic and accelerators.

Join our team of world-class engineers and be part of the groundbreaking work we do at NVIDIA. We are committed to encouraging a collaborative and inclusive environment, where every team member has the opportunity to thrive and make a significant impact!

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Where to Advertise AI Jobs in the UK (2026 Guide)

Where to advertise AI jobs UK in 2026: the specialist boards and communities that reach AI engineers, ML scientists and applied research talent in the UK. The candidate pool is small, highly informed and in demand across multiple sectors simultaneously. General job boards reach a broad audience but lack the specificity that AI professionals expect — and the filtering mechanisms they rely on. Specialist platforms, direct outreach and academic channels each serve a different part of the market. This guide, published by ArtificialIntelligenceJobs.co.uk, covers where to advertise AI roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about time-to-hire across different role types.