SoundCloud empowers artists and fans to connect and share through music. Founded in 2007, SoundCloud is an artist-first platform empowering artists to build and grow their careers by providing them with the most progressive tools, services, and resources. With over 400+ million tracks from 40+ million artists, the future of music is SoundCloud.
We are looking for a Senior Data Engineer to join the Soundcloud Search team (part of the Music Discovery group), with a mission to help users find and play what they’re looking for and enable them to explore further to discover music that exists nowhere else and connect directly with the artists that make it. You will be crucial in shaping the data infrastructure that powers our search systems, ensuring high-quality, scalable, and real-time data flows. Collaborating with a multidisciplinary team, you will contribute to building robust data pipelines that power all our feedback cycles, from feeding our Machine Learning algorithms to enhancing the search performance and user experience.
About the role:
As a Data Engineer, you will work on building and optimizing high-end data pipelines that drive key performance indicators (KPIs) and train our ML models. You will also be working on the real-time processing of the data pipelines that power search features for millions of users. You’ll work with a range of technologies, including Python and Scala, BigQuery and BigTable, Spark and Dataflow, and collaborate with other teams to ensure data is structured and prepared for both search systems and machine learning models.
About you:
You have a strong background in data engineering (at least 5 years), with strong foundations on algorithms and data structures. You have experience designing and optimizing data pipelines, data architecture and modeling, and ETL processes for large-scale (distributed) systems. You are comfortable working with millions of data points regularly and are passionate about ensuring data quality and consistency, which are key to improving the overall search experience. You are proficient in SQL and Python, ideally also experienced in Scala or Java. You also have experience training, prototyping, and deploying machine learning models. Experience in Search or Recommender Systems is a plus.
You are proactive, detail-oriented, and committed to learning. You thrive in a collaborative, Agile environment, working closely with cross-functional teams to tackle complex data challenges. You are a problem-solver at heart, driven by a passion for data and how it can enhance user experiences. While independent in your work, you excel in a team setting, contributing to a culture of excellence and shared success.