Jobs

SRE - High-Performance Compute


Job details
  • Alexander Ash Consulting
  • London
  • 3 weeks ago

Consultant - High-Performance Computing - Global Quant Firm

Contract (transition to perm) - London, UK - Competitive

 

We are seeking a highly skilled and motivated Consultant to join a leading quant technology firm specializing in leveraging innovative data science technology to deliver valuable insights and solutions.

 

The primary focus of this role is High-Performance Computing (HPC). Candidates should have strong expertise in UNIX/Linux systems, programming (Bash, Python, C++), GPU optimization, and high-performance storage solutions. Experience working in ultra-low-latency environments could be beneficial.

 

Key Responsibilities:

System Design and Optimization:

  • Architect, deploy, and manage UNIX/Linux systems tailored for high-performance and high-frequency operations.
  • Implement kernel-level enhancements, such as real-time or low-latency kernels, CPU pinning, and HugePages, to boost system performance.
  • Optimize system settings to reduce latency, including IRQ balancing, process affinity, and memory management.

GPU Performance Tuning:

  • Enhance GPU performance to accelerate quantitative models and simulations, ensuring high throughput and low latency for crucial financial computations.
  • Conduct thorough tuning of GPU resources, focusing on memory management, parallel processing, and kernel optimization to maximize efficiency in high-frequency trading and complex data analysis.

HPC Programming and Optimization:

  • Develop and optimize system-level code using C++ and other languages to support HPC needs.
  • Apply advanced compiler optimizations and profiling tools, such as Intel VTune and perf, to identify and resolve performance issues.

Networking and Infrastructure:

  • Employ high-performance, low-latency network interfaces like Solarflare or Mellanox and apply kernel bypass techniques using DPDK or PF_RING.
  • Maintain precise time synchronization using Time-Sensitive Networking (TSN) and protocols like Precision Time Protocol (PTP) and GPS-based NTP servers.
  • Utilize and manage network monitoring tools like Corvil to track and reduce latency.

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Similar Jobs

SRE and Service Manager

What you’ll be doingLeadership: Lead a team of BT and partner engineers & analysts responsible for ensuring reliable, efficient and secure systems and delivery of end-to-end service journeys to internal customers.Strategy Development: Develop and implement strategies for effective and proactive monitoring and observability.GenAI and AIOps: Leverage GenAI and AIOps to...

BT Group Leeds

DevOps Engineer - SRE - London- Thriving Sports Betting Consultancy

Summary:A globally leading sports betting consultancy, focussing on vast amounts of data analytics and predictive modelling with machine learning, are looking for a DevOps engineer or SRE to join their agile, London-based team.The successful DevOps engineer or SRE will be architecting cloud infrastructure, and developing an AWS environment with Terraform,...

Oxford Knight London

DevOps Engineer – SRE – London

Summary:A globally leading sports betting consultancy, focussing on vast amounts of data analytics and predictive modelling with machine learning, are looking for a DevOps engineer or SRE to join their agile, London-based team.The successful DevOps engineer or SRE will be architecting cloud infrastructure, and developing an AWS environment with Terraform,...

Oxford Knight London

Site Reliability Engineer (SRE)

Job DescriptionThere's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Corporate and Investment Banking team for Management...

JP Morgan Chase Bank, National Association Glasgow

Site Reliability Engineering Manager

What you will be doing :As a Site Reliability Engineering (SRE) Manager, you willTake ownership of your team, being responsible for current team members’ growth and development, plus hiring and onboarding new team membersCreate a positive environment where your team members thrive to deliver the best outcomes and innovationsBe a...

ComplyAdvantage London

Senior DevOps Engineer

Posted byAssociate Delivery ConsultantSenior DevOps/Site Reliability Engineer - Global Quantitative Investment ManagementContract - Global Offices -petitiveWe are seeking a highly skilled and motivated Senior Site Reliability Engineers (SRE) and DevOps Engineers to join a leading quantitative technology firm specializing in leveraging innovative data science research and cutting-edge technology to deliver...

Alexander Ash Consulting London