Senior Software Engineer, RL Post-Training Frameworks
This role involves designing and building scalable reinforcement learning (RL) post-training infrastructure that supports the full lifecycle of training-inference-rollout loops across heterogeneous hardware. The engineer will contribute to open-source RL frameworks, optimize distributed systems for performance and resilience, and collaborate with research and hardware teams to advance AI capabilities. Work spans deep integration with PyTorch, Kubernetes, and high-performance computing environments to enable next-generation AI models.