Jobs

Senior Data Platform Engineer


Job details
  • Causaly
  • London
  • 1 month ago
Applications closed

About us 

Founded in 2018, Causaly accelerates how humans acquire knowledge and develop insights in Biomedicine. Our production-grade generative AI platform for research insights and knowledge automation enables thousands of scientists to discover evidence from millions of academic publications, clinical trials, regulatory documents, patents and other data sources… in minutes. 

We work with some of the world's largest biopharma companies and institutions on use cases spanning Drug Discovery, Safety and Competitive Intelligence. You can read more about how we accelerate knowledge acquisition and improve decision making in our blog posts here:Blog - Causaly 

We are backed by top VCs including ICONIQ, Index Ventures, Pentech and Marathon. 

About the role: 

We are looking for a Senior Data Engineer with experience in data pipelines, backend architectures, ETL, cloud and other related fields. You will join and help to grow our established Data & Semantic Technologies team. This team is responsible for designing & building the highly scalable and flexible data backend that we need at Causaly in order to make our vision become real. You will be working on incremental data pipelines supporting batch as well as targeted updates, grow and maintain massive knowledge graphs and ontologies, feed our constantly growing data warehouse, and so on. You will enable & empower the Applied AI and Application teams, and be responsible for linking their outcomes in order to create true business value.

We are looking for innovative engineers who are capable, talented, engaged and passionate about creating industry-strength architectures and solutions that unleash the value of data. We are a multi-disciplinary team working in a fast-paced and collaborative environment, who value honest opinion and open debate. You have a true problem-solving mind-set with a hands-on attitude, you are keen to design and build innovative solutions that leverage the value of data, you are passionate and creative in your work, you love to share ideas with your team and can pick the right tool for the job? Then you should become part of our journey!

What you can expect to work on:

  • Gather and understand data based on business requirements.
  • Import big data (millions of records) from various formats (e.g. CSV, XML, SQL, JSON) to BigQuery. Process further on BigQuery and combine with external data sources.
  • Implement and maintain highly performant data pipelines with the industry’s best practices and technologies for scalability, fault tolerance and reliability.
  • Build the necessary tools for monitoring, auditing, exporting and gleaning insights from our data pipelines.
  • Work directly with a multitude of technical, product and business stakeholders.
  • Manage and maintain backend data processes related to data delivery, curation and machine learning operations.
  • Help to build a strong data-engineering function, mentor and guide other engineers, shape our technology strategy and innovate on our data backbone.

Requirements

Minimum Requirements

Successful candidates will have:

  • Master’s degree in Computer Science, Mathematics or a related technical field
  • 5+ years experience in backend data processing and data pipelines
  • Excellent knowledge of Python and related libraries for working with data (e.g. pandas, Airflow)
  • Excellent SQL and database skills
  • Solid understanding of modern software development practices (testing, version control, documentation, version control, etc…)
  • A product and user-centric mindset
  • Excellent problem solving, ownership, organizational skills, high attention to detail and quality

Preferred Qualifications

Any experience of the following will be considered a plus:

  • NoSQL and big data technologies (e.g. Spark, Hadoop)
  • Full-text search databases (e.g., ElasticSearch)
  • Knowledge graphs and graph databases (e.g., Neo4J)
  • MLOps / DataOps in production
  • Terraform, Kubernetes and or/Docker Containers

Benefits

  • Competitive compensation package
  • Private medical insurance (underwritten on a medical health disregarded basis)
  • Life insurance (4 x salary)
  • Individual training/development budget through Learnerbly
  • Individual wellbeing budget through Juno
  • 25 days holiday plus public holidays and 1 day birthday leave per year
  • Hybrid working (home + office)
  • Potential to have real impact and accelerated career growth as an early member of a multinational team that's building a transformative knowledge product

Be yourself at Causaly... Difference is valued. Everyone belongs.

Diversity. Equity. Inclusion. They are more than words at Causaly. It's how we work together. It's how we build teams. It's how we grow leaders. It's what we nurture and celebrate. It's what helps us innovate. It's what helps us connect with the customers and communities we serve.

We are on a mission to accelerate scientific breakthroughs for ALL humankind and we are proud to be an equal opportunity employer. We welcome applications from all backgrounds and fairly consider qualified candidates without regard to race, ethnic or national origin, gender, gender identity or expression, sexual orientation, disability, neurodiversity, genetics, age, religion or belief, marital/civil partnership status, domestic / family status, veteran status or any other difference.

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Similar Jobs

Data Platform Engineer / Semantic / RDF

Data Platform Engineer - Remote - Permanent roleWe are seeking an Experienced Data Platform Engineer with strong expertise in Scala and Java, along with a solid understanding of semantic technologies and knowledge representation. In this senior role, you will work closely with delegate architects, data scientists and engineers to design,...

London

Data Engineering Lead

Embark on a transformational journey as the Data (Platform) Engineering Lead within the Data Foundations Programme for BUK. In this role, you will play a vital part in building and shaping the go-to platform capabilities that will lay a strong foundation for the Data Engineering community to build data pipelines...

Knutsford

Data Engineering Lead

Embark on a transformational journey as the Data (Platform) Engineering Lead within the Data Foundations Programme for BUK. In this role, you will play a vital part in building and shaping the go-to platform capabilities that will lay a strong foundation for the Data Engineering community to build data pipelines...

Barclays Bank PLC Knutsford

Senior Data Engineer

Senior Data Engineer - Python / Data Pipelines / Data Platform / AWS - is required by fast growing, highly successful and tech focused organisation.About the jobYou will play a crucial role in designing, building, and maintaining their data platform, with a strong emphasis on streaming data, cloud infrastructure, and...

Cramlington

Senior Data Engineer

Senior Data Engineer - Python / Data Pipelines / Data Platform / AWS - is required by fast growing, highly successful and and tech focused organisation.About the jobYou will play a crucial role in designing, building, and maintaining their data platform, with a strong emphasis on streaming data, cloud infrastructure,...

Tech4 Northumberland

NPIC Data Engineer (XN07)

Job summaryExpected Shortlisting Date:17/01/2025Planned Interview Date:27/01/2025Due to recent and continued expansion an exciting opportunity has arisen at NPIC (National Pathology Imaging Cooperative) based at LTHT for a Data Engineer. The candidate will work closely with the NPIC Senior Data Engineer to ensure that data architecture is effectively implemented, maintained and...

Leeds Teaching Hospitals Leeds