Data Engineer en Madrid o en remoto

Frontiers

Categoría

Ingeniero de Datos

Industria

Industria Search Engine

Lugar de trabajo

En remoto

Horas

Full-Time

Prácticas

false

Habilidades

Spark Python Pyspark

Comparte la oferta

Descripción de la oferta

We are on a mission to make science open so everyone can live healthy lives on a healthy planet

Who we are

Frontiers is an award-winning open science platform and leading open access scholarly publisher.

We are one of the largest and most cited publishers globally. To date, our 200,000 freely available research articles have received more than 1 billion views and downloads and 2 million citations. Our journals span science, health, humanities and social sciences, engineering, and sustainability. And we continue to expand into new academic disciplines so more researchers can publish open access.

Be part of the publishing revolution and help us transform the way research is published, evaluated, and communicated to the world.

We are looking for a Data Engineer to help us improve the quality, scalability and resilience of our data-intensive applications in the company.

The ideal candidate will have hands-on experience designing data-intensive applications and big data solutions, and we also expect a good knowledge of all types of data storage: structured files, relational databases, documental databases, graph databases, etc.

The Role

Understand the functional requirements for defining the best data models and data flows between our applications, services, data storages and synchronization mechanisms
Integrate, transform, and consolidate data from various structured and unstructured data systems into structures that are suitable for building analytics solutions
Ensure that our data applications/processes are scalable, reliable, secure, extensible, traceable, available and manageable.
Design, implement, monitor, and optimize our data platforms
Support the different SW development teams in the modelling, design, construction, evolution, and decommission of their data-intensive applications and data models
Understand and promote the best Data frameworks and solutions, technical standards and key technologies, to effectively support existing and future business requirements

Key Responsibilities

Work closely with IT Architects to provide overall consistent and reliable data solutions for all the applications ecosystem
Create a partnership with Scrum teams and POs, understanding the application and business requirements, and helping them understand the data through exploration, building and maintaining secure and compliant data processing pipelines
Collaborate closely with Machine Learning and Data Science Team to improve the performance of our ML pipelines.
Create models and prototypes that validate your ideas, before bringing them to the development team
Create and maintain up to date the documents describing the Data strategy of your applications domain, as well as all relevant guidelines and standards

Tech Stack & Key Requirements

Python
Spark and PySpark
Excellent understanding of parallel processing and data architecture patterns
Solid knowledge on DataBricks,
ElasticSearch and DeltaLake are a nice-to-have
Experience on building Data Lakes
Expertise in data processing: data ingest and transformation, batch processing, streaming data processing, distributed processing, monitoring, optimization, logging
Experienced in troubleshooting data processing and data storage
Knowledge of data security standards
Knowledge of serving layer design: star schema, dimensions, incremental loading, stores
Knowledge of physical data storage structures: compression, partitioning, sharding, redundancy, distributions, archiving

Benefits

With more than 50 nationalities represented in our global team, you will work regularly with teammates in other countries, and with our community of researchers, editors, and authors from around the globe.

Our mission to create solutions for healthy lives also extends to the working environment we provide for our employees.

This includes:

100% remote working

Employees now have the flexibility to choose where they want to work, with remote working available on a part- or full-time basis.

Learning and development

All employees have access to LinkedIn Learning (and Pluralsight for our technology team), an annual personal learning budget, and dedicated L&D time.

Wellbeing

We offer free online yoga classes, an employee assistance plan, access to the Headspace app, and four wellbeing days on top of your annual leave allowance.

Volunteering opportunities

Employees can dedicate three days each year to volunteer for a personal cause or through our volunteering partner platform, Alaya.

Frontiers actively embraces diversity and is a safe and welcoming workplace. Recruitment is free from discrimination – including based on race, national or ethnic origin, age, religion, disability, sex, gender identity or sexual orientation. With over 600 employees from more than 50 different nations, our diversity creates vibrant teams and constantly challenges us to appreciate multiple perspectives.

Leer la descripción completa

Acerca de Frontiers

Sitio web
http://www.frontiersin.org
Industria
Search Engine
Sede central
Lausane, Switzerland
Tamaño de la compañía
200-500
Fundada
2007

Data Engineer en Madrid o en remoto

Frontiers

Descripción de la oferta

Leer la descripción completa

Acerca de Frontiers

Otras ofertas de ingeniero de datos que podrían interesarte...

Big Data Architect en Solera

Data Engineer en Plain Concepts

Data Engineer en Lastminute