Data Engineer in Madrid or Remote


Share offer

Job Description

We are on a mission to make science open so everyone can live healthy lives on a healthy planet

Who we are

Frontiers is an award-winning open science platform and leading open access scholarly publisher.

We are one of the largest and most cited publishers globally. To date, our 200,000 freely available research articles have received more than 1 billion views and downloads and 2 million citations. Our journals span science, health, humanities and social sciences, engineering, and sustainability. And we continue to expand into new academic disciplines so more researchers can publish open access.

Be part of the publishing revolution and help us transform the way research is published, evaluated, and communicated to the world.

We are looking for a Data Engineer to help us improve the quality, scalability and resilience of our data-intensive applications in the company.

The ideal candidate will have hands-on experience designing data-intensive applications and big data solutions, and we also expect a good knowledge of all types of data storage: structured files, relational databases, documental databases, graph databases, etc.

The Role

  • Understand the functional requirements for defining the best data models and data flows between our applications, services, data storages and synchronization mechanisms

  • Integrate, transform, and consolidate data from various structured and unstructured data systems into structures that are suitable for building analytics solutions

  • Ensure that our data applications/processes are scalable, reliable, secure, extensible, traceable, available and manageable.

  • Design, implement, monitor, and optimize our data platforms

  • Support the different SW development teams in the modelling, design, construction, evolution, and decommission of their data-intensive applications and data models

  • Understand and promote the best Data frameworks and solutions, technical standards and key technologies, to effectively support existing and future business requirements

Key Responsibilities

  • Work closely with IT Architects to provide overall consistent and reliable data solutions for all the applications ecosystem

  • Create a partnership with Scrum teams and POs, understanding the application and business requirements, and helping them understand the data through exploration, building and maintaining secure and compliant data processing pipelines

  • Collaborate closely with Machine Learning and Data Science Team to improve the performance of our ML pipelines.

  • Create models and prototypes that validate your ideas, before bringing them to the development team

  • Create and maintain up to date the documents describing the Data strategy of your applications domain, as well as all relevant guidelines and standards

Tech Stack & Key Requirements

  • Python

  • Spark and PySpark

  • Excellent understanding of parallel processing and data architecture patterns

  • Solid knowledge on DataBricks,

  • ElasticSearch and DeltaLake are a nice-to-have

  • Experience on building Data Lakes

  • Expertise in data processing: data ingest and transformation, batch processing, streaming data processing, distributed processing, monitoring, optimization, logging

  • Experienced in troubleshooting data processing and data storage

  • Knowledge of data security standards

  • Knowledge of serving layer design: star schema, dimensions, incremental loading, stores

  • Knowledge of physical data storage structures: compression, partitioning, sharding, redundancy, distributions, archiving


With more than 50 nationalities represented in our global team, you will work regularly with teammates in other countries, and with our community of researchers, editors, and authors from around the globe.

Our mission to create solutions for healthy lives also extends to the working environment we provide for our employees.

This includes:

100% remote working

Employees now have the flexibility to choose where they want to work, with remote working available on a part- or full-time basis.

Learning and development

All employees have access to LinkedIn Learning (and Pluralsight for our technology team), an annual personal learning budget, and dedicated L&D time.


We offer free online yoga classes, an employee assistance plan, access to the Headspace app, and four wellbeing days on top of your annual leave allowance.

Volunteering opportunities

Employees can dedicate three days each year to volunteer for a personal cause or through our volunteering partner platform, Alaya.

Frontiers actively embraces diversity and is a safe and welcoming workplace. Recruitment is free from discrimination – including based on race, national or ethnic origin, age, religion, disability, sex, gender identity or sexual orientation. With over 600 employees from more than 50 different nations, our diversity creates vibrant teams and constantly challenges us to appreciate multiple perspectives.


About Frontiers


Other data engineer jobs that might interest you...