Filled
This offer is not available anymore

Site Reliability Engineer in Madrid or Remote

Carto

Category
Workplace
Remote
Hours
Full-Time
Internship
No
Share offer

Job Description

Everything happens somewhere - which is why spatial analytics is fundamental to companies trying to understand the “where” and the “why” of their business. CARTO is the leading cloud-native location intelligence platform, trusted by data scientists, data analysts, and developers from companies such as Vodafone, IKEA, Decathlon, Coca-Cola, and Mastercard to provide geospatial insights for use cases such as site selection, geomarketing, route optimization, network planning and much more.

With an exceptionally diverse team of 190 people spread across the US and Europe, CARTO (backed by Accel Partners, Salesforce Ventures, Earlybird Ventures, and Kibo Ventures, among others) is changing the way companies analyze location data - making it simple to do this straight out of modern, cloud data warehouses. Redefining its category, the company has grown rapidly in recent years providing a compelling alternative to legacy GIS software.

To continue this growth, the Infrastructure team is looking for an enthusiastic Site Reliability Engineer to join our 7-people infrastructure team and help us scale up a product that is constantly changing (new functionalities) with a 100% cloud-native infrastructure.

The team you will join: We are a 7-people team that includes the IT area owner and the Professional services one, who is responsible for projects that require custom developments not covered by our main product. We are a young but talented team, where we are learning from each other every day and with a perfect balance between those with more expertise (oncall, platform stabilization, debugging skills …) and those that were already born with Kubernetes, docker, cloud technologies and DevOps practices.

Location: The position is open to remote. In fact, we are mostly working remotely right now, but we do have offices in Madrid, and Seville in Spain, but any location in Spain would work!

You will

    • Write IaC (Infrastructure as a code) using Terraform, it should fit the three environments we work with (dev, staging, production).
    • Understand and contribute to CI / CD tasks (Google Cloud Build, Github Actions).
    • Develop new tools (Python, Node.js, Go …) that will help us to provide a better service to both our internal teams and our clients.
    • Develop HELM charts, to keep improving our Self Hosted version and make it more robust and compatible with cloud providers k8s services (GKE, AKS, EKS …)
    • Observability is a first-class citizen for us, so you will work with several monitoring services: grafana, prometheus, honeycomb, logstash, google cloud monitoring, pagerduty.
    • Provide support to other teams
    • Responsible for keeping the platform/infrastructure in a good shape: we have two levels of support (Oncall and working hours) and we need to check any kind of alert (critical, or not) throughout the day, precisely to avoid non-critical issues to become critical ones. Additionally, researching any issue until we get to the bottom is mandatory, no matter if it comes from a critical alert, or not.
    • Keep the infrastructure up to date: updating the different infrastructure pieces and components is a must, in order to avoid them becoming “legacy” parts, with the ultimate goal of aligning working methodologies and best practices (Terraform), even for those parts that are already “legacy”.
    • Proactively, perform any kind of improvement or refactor that will place our infrastructure in a better position.

Responsibilities

    • Keep growing the infrastructure in the cloud(s) in an agile and orderly manner, following best devops practices: IaC, metrics, monitoring…
    • Ensure the platform is in good shape and healthy by checking any kind of alert and researching any issue detected in the platform.
    • Gradually modernize the infrastructure by unifying working methodologies (past, present, future).
    • Perform any kind of improvement or refactoring that will help to have a more resilient infrastructure.
    • Provide support to other teams
    • Contribute and improve Kubernetes HELM chart for our Self Hosted version.

In one month…

    • You will get to know your buddies, who will help you during the Onboarding process and will be your reference point for any kind of question you may have over this period. The rest of the team too, as well as the day-to-day way of working.
    • You will complete your onboarding sessions thanks to the CARTO Academy material and to the trainings that you will attend either from the Infrastructure team or from any other team, like the Product one.
    • You will attend 4 sync meetings with the rest of the Infrastructure team, on a weekly basis with a 30 minutes duration, where you will learn the tasks that your teammates are working on and you will have the chance to receive or provide some advice about how to fix a complex issue discussed during the meeting.
    • You will work in operational day-to-day tasks, including production ones, that will help you get familiar with all the infrastructure pieces (monitoring, metrics, debugging issues …).
    • You will realize that there is still a knowledge gap about how the platform works and how all the pieces work together. It’s ok, there is no need to be worried because you will know who will be there to help you.

In 3 months…

    • You will deploy many Infrastructure changes through Terraform.
    • You will install and understand the Helm chart of our Self Hosted version and why not, contribute to it with some improvements.
    • You will start feeling more comfortable proposing changes and deploying them (no matter your expertise or seniority level, all the changes require a code review process).
    • You will keep asking questions to your buddies and the rest of the team and that is perfectly fine.
    • You will do your first response team shift: it is a two weeks shift (office hours) for fixing and helping other teams with issues related to the infrastructure (operational tasks).

In 6 months…

    • You will still ask questions (we all still ask questions), so please keep them coming. You will have a very good understanding of the platform and in case you are interested, you will have the chance to join the oncall shifts (24/7). This is totally optional.
    • You will enjoy getting your hands dirty to face new challenges. You will be able to debug, triage, and fix production issues.
    • You will work in mid/long term projects, where you could work shoulder to shoulder with other teams, or just with your teammates and sometimes, even by yourself.
    • You will feel comfortable working with other people at this stage and at the same time, confident enough to lead and execute projects / tasks by yourself.

How will you do this?

    • The Infrastructure team, as well as the rest of the Engineering team, plans the work/tasks with two-week iterations. For one iteration, at least, you will join the Response Team (operations) in order to have much better visibility of the platform and the most frequent issues that we need to help with. There is a knowledge base (wiki format) that will help you through it.
    • For another iteration, you will support the Professional Services team by helping them with Infrastructure tasks. This will help you get familiar with the custom solutions that CARTO clients demand and how they are deployed, from the Infrastructure point of view (GKE, Github Actions, Firebase, Cloud Functions …).
    • From this moment you will join the day-to-day Infrastructure team tasks. There is a kanban where we organize the tasks (using Shortcut, former clubhouse)..

You offer

    • Some experience like the one mentioned in the job description
    • Expertise with Terraform, or any other IaC tool
    • Familiar with cloud providers. CARTO mainly works with GCP, but any other expertise with other cloud providers is more than welcome
    • Familiar with cloud security terms: IAM, VPC, IAP…
    • Linux knowledge (containers, systems, signals…)
    • Debugging mindset and eager to fix issues
    • Experience with Kubernetes, HELM charts, and Docker
    • Development skills and with good database knowledge (PostgreSQL is a plus)
    • Very good communication skills, both conversation and written, as well as a good English level

We will be fascinated if…

    • You have experience with PostgreSQL and other warehouses like BigQuery, Redshift, or Snowflake
    • You have knowledge of any of the programming languages we use: NodeJS, Ruby, Python, Go
    • You have experience with CDN
    • You have faced a migration from legacy Infrastructure to “cloud native”
    • You have experience with maps and GIS

Interview process

    • Interview with HR
    • Interview with the Infrastructure Tech Lead
    • Technical challenge and technical interview with Infrastructure team members
    • Talk with the Director of Engineering and with the CTO

We offer

    • Competitive, results-based compensation
    • Access to our employee stock options plan
    • 24 days vacation (22 + Christmas Eve + New Years Eve)
    • 100% remote, with offices in Madrid and Seville.
    • Private medical insurance
    • Flexible work hours in a focused but casual environment
    • Education Stipend
    • Flexible compensation
    • English classes
    • A big vision: to help the world use location-based data to make better decisions. We believe that openness and sustainability are baked into this vision, and we share it with the world.
    • Contribute to a platform used by top companies around the world. Your work will have a direct impact on our users and clients.
To apply

Do you feel that you don't check all of the requirements? At CARTO we believe that professional development happens through teaching and learning from your peers and managers. Even if you’re uncertain whether you have the experience we’re looking for, please apply if this position sparks your curiosity.

Diversity of identity, perspective, and experience makes us stronger. We welcome you to apply to CARTO regardless of your background, age, gender, ethnicity, orientation, or ability.
 

About Carto

  • Saa S

  • Madrid, Spain

  • 50-200

  • 2012

.

Other devops jobs that might interest you...