Senior Systems Software Engineer, Kubernetes Scale - DGX Cloud
This role involves driving performance and scalability of the NVIDIA DGX Cloud software stack, focusing on Kubernetes and NVIDIA AI infrastructure. The engineer will diagnose complex distributed systems issues, develop automated testing frameworks, and optimize large-scale AI workloads from orchestration to hardware. Collaboration with AI teams and open-source communities is key to advancing real-world AI deployment at scale.