Job summary
Data Scientist with a strong engineering background with experience with real time data processing, 3-4 days on sight paddington.
Key skills required for this role
Data Science - quantitative finance or trading research - Data Engineer
Important
Senior Data Scientist
Job description
About the Role
We are seeking a highly skilledSenior Data Scientistto design and implement a robust data pipeline for an AI/LLM-driven trading platform. The ideal candidate will be responsible for collecting, extracting, cleaning, normalizing, and structuring large-scale data from sources (e.g., financial articles, books, and research papers). This role requires deep expertise in data engineering, NLP, and financial market data to ensure high-quality datasets for machine learning and trading strategy development.
Key Responsibilities
Data Pipeline Development:Build and maintain a scalable and efficient data pipeline that processes unstructured data for AI/LLM models.
Unstructured Data Processing:Extract and preprocess text from books, articles, and research reports to be utilized in AI/LLM-based analytics.
ETL & Data Engineering:Develop ETL processes to ingest and transform data from multiple sources into a unified format.
Data Quality Assurance:Implement validation and anomaly detection mechanisms to ensure data integrity and accuracy.
Collaboration:Work closely with quant researchers, AI/ML engineers, and trading teams to ensure seamless data flow and accessibility.
Automation & Optimization:Automate data ingestion, transformation, and storage processes for efficiency and scalability.
Requirements
Strong academic background in
Data Science, Machine Learning, Artificial Intelligence, or a related field.
Experience in building and optimisingdata pipelines and workflows.
Proficiency inPython, SQL, and ML libraries (TensorFlow, PyTorch, scikit-learn, etc.)and at least oneObject-Oriented Programming (OOP)language (e.g., Rust, C++, C#, …)
Hands-on experience withLLMs, NLP techniques, and AI-driven data processing.
Experience working with financial datasets and understanding of market data structures, is preferred.
Knowledge ofAPIs, cloud computing (AWS, GCP, Azure), and database management.
Strong problem-solving skills with the ability to work on complex data-driven projects.
Nice to Have
Background in
quantitative finance or trading research.
Experience inreal-time data processingand streaming architectures.
Exposure tocryptocurrency markets. Share
manages this role
Matchtech is a STEM Recruitment Specialist, with over 40 years’ experience