Develop software and software fixes to integrate internal systems. Ensure code quality, test and distribute code updates, and monitor the health and stability of the servers.
What you'll do:
Engineering automation, monitoring, reactive and proactive system mitigation
Design and build automated code deployment systems that simplify development work which include Scale infrastructure via automation
Meet and beat Key Performance Indicators, SLAs, maintain an error budget and adhere to it
Ensure the platform holds a high degree of reliability, at least three 9s of availability
Provide recommendations and feedback in review sessions, design reviews and review sessions
Mentor junior developers in best practices on all aspects of reliability, scalability and high availability
Orchestrate deployment of CI/CD with containers
Own and resolve technically intricate issues that cross between DevOps, databases, networking, code, infrastructure and people; drive them to satisfactory completion
Prepare and present engineering-related documents to key stakeholders
Other duties as assigned
What we're looking for:
Bachelor’s degree in Computer Science or STEM field
Minimum of 7 years’ direct experience in a similar role without a Bachelor’s degree in Computer Science; minimum of 3 years’ experience with Bachelor’s degree in Computer Science
Excellent written and verbal communication skills in English
Experience with full lifecycle of SaaS implementations as well as Infrastructure as code
Listening, teamwork, analysis, reasoning and problem solving, time management, attention to details
Ability to write SQL and ELK queries, advanced Linux skills, diagnostic experience with monitoring applications (APM), APM configuration, basic IAM management, TCP/IP and OSI proficiency, and AWS proficiency
Ability to merge and apply PRs (Ansible and TF)
Deep understanding of release cycles and SDLC
Ability to create and complete Jenkins builds
Deep understanding of infrastructure and architecture