Site Reliability Engineer

Thought Machine

London, United Kingdom

4 months ago

Applications closed

Related Jobs

View all jobs

Spotlight

Business Development Manager – New Business Sales

SenseAI Liverpool, Merseyside, United Kingdom

Hybrid

Spotlight

Director of Technology - AI

Digital Catapult Bristol, County Of Bristol, United Kingdom

£110,000 – £130,000 pa Hybrid

Site Reliability Engineer

Darktrace Cambridge, CB2 3BJ, United Kingdom

Hybrid

Senior Site Reliability Engineer - JA London

Spectrum IT Recruitment London, United Kingdom

£60,000 – £65,000 pa

Senior Site Reliability Engineer

Thought Machine London, United Kingdom

Hybrid

Senior Cloud Site Reliability Engineer

Wayve London, United Kingdom

On-site

Software Engineer, GPU Infrastructure- ChatGPT Engineering

OpenAI London, United Kingdom

Hybrid

Director of Engineering (ML Platform), London

Isomorphic Labs London, United Kingdom

On-site

Job Type: Permanent
Work Pattern: Full-time
Work Location: On-site
Seniority: Mid
Education: Degree
Posted: 19 Mar 2026 (4 months ago)

Benefits

Employee share package High Glassdoor rating Fantastic workplace culture

Save job

Create job alert

Applications closed

Job Type: Permanent
Work Pattern: Full-time
Work Location: On-site
Seniority: Mid
Education: Degree
Posted: 19 Mar 2026 (4 months ago)

Benefits

Employee share package High Glassdoor rating Fantastic workplace culture

Save job

Create job alert

Applications closed

Thought Machine’s mission is bold – to properly and permanently rid the world’s banks of legacy technology. To achieve this, we have developed the foundations of modern banking through core and payments technology which run natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.

We have grown rapidly in the past few years – growing our team to more than 550 individuals across offices in London, New York, Singapore and Sydney. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Molten Ventures, Eurazeo, Intesa Sanpaolo, Temasek, Nyca Partners, JPMorgan Chase Strategic Investments, Standard Chartered Ventures, and more.

We have created a culture that enables our team to produce the best work in the industry while ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the industry's most generous employee share package. Named one of the world’s most innovative fintechs byGlobal Finance Magazine, we were also recognised by theFinancial Times as one of Europe’s fastest-growing companies for two consecutive years—and a UK Best Employer for 2026.

Thought Machine’s Site Reliability Engineers are the guardians of mission-critical systems for the world's most influential financial institutions. As a member of our elite, globally distributed team, you'll be entrusted with running and maintaining the robust production infrastructure that powers our customers' cutting-edge Core Banking and Payments platforms. This is an opportunity to make a tangible impact on the global financial landscape while collaborating with brilliant minds to solve complex engineering challenges.

This role will be part of the Site Reliability Engineering team at Thought Machine HQ in London. The team is deeply involved in tackling the technical challenges of executing Thought Machine’s growth ambitions - expect to be working with senior stakeholders in the organisation, our customers, and working on programmes and initiatives that are critical to the success of the company.

As an SRE at Thought Machine, you will be responsible for:

Supporting the product engineering teams in building highly fault-tolerant, scalable applications by participating in design discussions, engaging in RFCs and code reviews.
Contributing to the execution of department strategies such as implementing disaster recovery, backup, redundancy, and capacity planning activities.
Participating in a global on-call rotation responsible for identifying and fixing bottlenecks in SaaS customer environments.
Regular maintenance of production systems that host Vault products.
Contributing to the evolution of our SaaS products by building features that foster exceptional reliability and an unparalleled user experience.
Implementing and testing DR strategies to ensure the highest level of resilience and fault tolerance of the platform.
Maintaining high-quality written documentation of assets, processes and runbooks that are used by the team in their day-to-day operations.
Collaborating effectively with team members, actively participating in knowledge sharing, and continuously growing your own technical understanding of Vault Products.

What we’re looking for:

You have experience successfully delivering engineering tasks and projects with a focus on reliability and scalability.
You possess a good understanding of design patterns relevant to hosting and networking architectures.
You proactively champion product development, driven by a desire to build truly exceptional products, not just solve immediate challenges.
You have a strong background working in either Python, Golang or Java, having used one of these programming languages to build production level software.
You have experience working with Kubernetes or other container orchestration systems.
You have experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible.
You have a good understanding of one or more of the following areas: Database Administration, Networking, Observability Tools (such as Prometheus, Jaeger) or automation infrastructure.
You have solid experience working with either GCP or AWS.

Benefits:

Highly competitive salary
Pension plan (match up to 5%)
Life insurance - three times annual salary
Competitive maternity (six months fully paid) and paternity leave (four weeks fully paid)
Shared parental leave (matched to our maternity leave for the same point in time)
25 days holiday and bank holidays
Flexible working hours
Cycle-to-work scheme
Electric car scheme
Season ticket loan
Access to outstanding learning materials and courses
Sports and hobby clubs, subsidised by Thought Machine
All the latest tech you need
Start the day properly with fresh fruit and cereals
Huge range of healthy (and not-so-healthy) snacks, smoothies and drinks
A talented and experienced team as your colleagues
An environment where we encourage learning and progress
Two charity days a year
Weekly food pop-up

We actively hire candidates who demonstrate technical excellence in their field and welcome people of all ages and backgrounds, providing everyone with equal access to professional development. You are encouraged to apply even if your experience doesn't accurately match the job description. We also encourage applications from those with different abilities, including candidates with ADHD, autism, dyslexia or dyspraxia.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Jul 6, 2026

Jobs

How Hard Is It to Get an Artificial Intelligence Job in the UK? Competition, Success Rates & Hiring Timelines (2026)

Artificial intelligence jobs in the UK are competitive but winnable. See applicants-per-vacancy, interview odds, salaries and hiring timelines for 2026.

Jun 22, 2026

Jobs

What Is an AI Forward Deployed Engineer? The Fastest-Growing Job in AI for 2026

If you have been watching AI job boards over the past year, one title keeps surfacing again and again: the forward deployed engineer, or FDE. It has gone from a niche term known mainly to Palantir alumni to arguably the hottest role in the entire AI hiring market. Job postings for forward deployed engineers have exploded, salaries have climbed past levels most software engineers will ever see, and the biggest names in AI — OpenAI, Anthropic, Google, Salesforce, Databricks and Palantir — are all competing for the same small pool of talent. So what exactly is an AI forward deployed engineer, why has demand surged so dramatically, and how do you position yourself to land one of these roles? This guide breaks it all down for AI engineers, software engineers and data scientists looking at their next move.

Jun 22, 2026

Jobs

Artificial Intelligence Jobs in the UK (2026): Contractor Day Rates, IR35 Status & Freelance Demand

Artificial intelligence jobs in the UK on contract: 2026 day rates by seniority, IR35 status, umbrella vs limited take-home and where demand sits.