What you will be doing :
As a Site Reliability Engineering (SRE) Manager, you will
- Take ownership of your team, being responsible for current team members’ growth and development, plus hiring and onboarding new team members
- Create a positive environment where your team members thrive to deliver the best outcomes and innovations
- Be a role model for your team, mentoring and coaching them, whilst having a learning mindset yourself, being open to new ideas and technologies
- Within the context of our broader technology vision, set the direction for your team and take accountability for tech decisions
- Use your specific experience working with cloud systems to input into technical decision-making
- Work with other stakeholders across engineering to ensure the systems and services your team provides meet the needs of your internal customers
- Collaborate, both within your team and across the tribe to ensure your team’s implementation meets industry standards
The role reports to the Director of Infrastructure. You’ll be managing a team of DevOps Engineers focused on the provision and support of our DevOps and Observability suites for use across the organisation. These include our Grafana, Gitlab and Kubernetes-based CI/CD pipelines. Our CI/CD pipelines are built using Gitlab actions and are leveraging the latest technologies allowing us to provide great abstraction to the development teams supporting their custom deployment needs on Kubernetes.
Our tech stack:
ComplyAdvantage is fully cloud-based, with a modern kubernetes-focused tech stack. All compute workloads run in Kubernetes, with clusters in multiple regions to support the needs of our global client base. Our production services are multi-cloud by design and are currently hosted in AWS and GCP.
We make heavy use of Terraform and Helm to define our infrastructure and services, and lean heavily on GitOps paradigms - production and non-production environments are defined in git and changes to these environments (both cloud infrastructure and Kubernetes applications) are managed via git.
ArgoCD is our tool of choice for controlling our deployments, and paired with our istio mesh, allows us for advanced deployment patterns used by our development teams such a progressive rollouts. Our observability stack consists of Grafana Cloud, along with some on-prem Mimir, amongst others. We focus on Open Telemetry for application metrics, with SLO and metric driven alerting at all levels, from Cloud infra through to application performance.
Across the wider Technology team, teams build and release containerised applications to support the wide array of activities that our teams are engaged in - from developing low latency client-facing APIs, to machine learning models and data processing pipelines.
About you:
As an Site Reliability Engineering (SRE) Manager, you will
- Have experience of managing and growing high performing engineering teams
- Have experience with Kubernetes and Terraform
- Have experience hosting microservices-based architectures
- Have experience of working with cloud native architectures (AWS and GCP are preferred)
- Have good communication and writing skills including experience writing technical documentation
Nice to haves:
- Experience of working in a start-up/ scale-up environment
- Have experience managing observability platforms, whether self-hosted or third party - eg Grafana stack, Datadog, NewRelic
- Have experience managing pipeline tools, whether self-hosted or third party - eg CircleCI, ArgoCD, Harness, etc
Benefits
- Equity participation in our innovative mission to combat financial crime
- Unlimited Time Off Policy to promote work-life balance and well-being
- We embrace a hybrid approach that requires employees to be in the office for two days a week. We strongly believe that this approach fosters collaboration and enables the building of meaningful relationships
- Opportunities for collaboration and career development with smart, like-minded professionals
- Annual learning budget to support professional growth
- A home office budget to support working from home
- Enhanced parental leave and childcare benefits
- Life insurance and medical coverage through Vitality, including pre-existing conditions
- Pension contribution through The People's Pension
About us:
ComplyAdvantage is the financial industry’s leading source of AI-driven financial crime risk data and detection technology. Our mission is to neutralise the risk of money laundering, terrorist financing, corruption, and other financial crime.
More than 1000 companies rely on us to understand the risk of who they’re doing business with through the world’s only global, real-time database of people and companies. Our solutions identify thousands of risk events daily from millions of structured and unstructured data points.
We have five global hubs in New York, London, Singapore, Lisbon and Cluj-Napoca and are backed by Goldman Sachs, Ontario Teachers, Index Ventures, and Balderton Capital.
Since 2014, we have raised over $100 million in funding, and in 2022 alone grew by over 40% to over 500 people globally. Over the next 12 months, as our revenue increases, we plan to increase to 600.
At ComplyAdvantage diversity fuels our rocket ship and our commitment to inclusion across race, gender, age, religion, identity and experience drives us forward every day. We encourage everyone to apply and aspire to consider every application fairly.
We will handle your information in accordance with our Privacy Policy. For further information, please click