Join the team. Make an impact.
At Holded, we believe that daily admin should never stop a great idea from becoming a success. That's why we create intuitive software to empower anyone who dares to start their own business. Long story short: we want to make business simple.
In order to create cutting-edge products that meet the needs of the sector, talent is essential. We are looking for passion, creativity, and commitment. In return, we offer the same.
Job requirements
The role
We are seeking a skilled and motivated Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring our SaaS platform's reliability, scalability, and performance.
You will work closely with our development and operations teams to design, implement, and maintain the infrastructure and tools necessary to support our growing customer base.
Read about other perks and benefits at jobs.holded.com
Key Responsibilities
- Infrastructure Management: Design, build, and maintain the infrastructure that supports our SaaS platform, ensuring high availability and scalability.
- Monitoring and Alerting: Develop and implement monitoring and alerting systems to quickly detect and respond to incidents. Create dashboards to visualize system performance and identify potential issues.
- Incident Response: Lead incident response efforts, including root cause analysis, mitigation, and post-mortem reporting. Develop and maintain incident response playbooks.
- Automation: Automate repetitive tasks to improve efficiency and reduce human error. Implement infrastructure as code (IaC) using Terraform, or similar.
- Performance Optimization: Analyze system performance and identify areas for improvement. Work with development teams to optimize application performance and reliability.
- Security: Ensure the protection of our infrastructure and applications by implementing best practices and conducting regular security audits.
- Collaboration: Collaborate with development, operations, and product teams to design and implement reliable and scalable systems. Participate in architecture reviews and provide input on system design.
- Documentation: Create and maintain documentation for systems, processes, and procedures to ensure knowledge sharing and operational continuity.
Qualifications
- Education: Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
- Experience: 3+ years of experience in a Site Reliability Engineer or similar role, preferably in a SaaS environment.
Technical Skills
- Strong knowledge of infrastructure as code (IaC) tools such as Terraform.
- Proficiency in Google Cloud Platform (GCP).
- Experience with containerization and orchestration tools like Docker and Kubernetes.
- Experience with monitoring and logging tools such as Datadog, or similar.
- Familiarity with CI/CD pipelines and tools like Google Cloud Build, and GitHub Actions.
- Proficiency in scripting languages such as Python, Bash, or similar.
- Knowledge of Redis and MongoDB.
- Understanding of networking concepts and security best practices.
Soft Skills
- Excellent problem-solving and troubleshooting skills.
- Strong communication and collaboration skills.
- Ability to work independently and in a team environment.
- Attention to detail and a commitment to reliability and quality.
What you will do
In one month
- You will have completed your onboarding.
- You will already know your team.
- You will have deployed several times to production.
- You will have joined the main architectural discussions that will be taking place and have actively participated in them.
- You will know the main metrics and service level indicators of the main product areas.
In three months
- You will know the architecture in detail, and you will be in the process of improving certain parts. By then, you will have clear areas you would like to improve and lead the adoption of those improvements.
- You will have led a successful project, be it an automation feature, a technical debt reduction, a DX improvement, etc... achieving the expected result and with total technical independence.
In six months
- You will already know all the processes and tools in depth.
- With you contributions, you will have improved some metrics or key indicators of the platform
What it's like to work with us
- Permanent contract
- Remote friendly
- Short work-day on Fridays
- 26 paid vacation days
- Free catered lunch at the office
- English/Spanish classes
- Referral program
- Continuous Training: annual budget for training for each employee
- Fully equipped kitchen with snacks, drinks, and fresh fruit
- Top-notch work equipment
- Office in front of the sea with ping pong, pool table, PlayStation…
- Interesting projects and a great work environment
- An excellent opportunity to grow with the company
- Discounts on a Gym membership
At Holded, we do things a bit differently. There’s no corporate nonsense and no old-fashioned hierarchy. Instead, we work in self-sufficient, autonomous teams. We are an equal opportunities employer and welcome applications from all suitably qualified persons regardless of their race, sex, disability, religion/belief, sexual orientation, or age.
We didn’t start Holded to be another software company. We started Holded to be epic, and you will be part of it.