Netdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python, or other scripting languages, to join our distributed (remote) engineering team.
As a Senior SRE/DevOps engineer you will focus on supporting our netdata cloud offerings, augmenting our existing development infrastructure by implementing the automation necessary to catalyze further development of both our open-source project and our commercial offerings, and last, but certainly not least, participating in the development of Netdata by making sure it's a first-class citizen in various operating environments (e.g. orchestrated containers, IoT devices, etc.)
Your work will include building CI/CD pipelines, packaging, installation facilities, and operational processes as well as developing custom solutions for our various teams and systems. As a Netdata SRE/DevOps engineer, you will also be assisting engineers across our company, enabling them to provide world-class solutions for numerous platforms;as well as our community, open-source contributors, and team-members with your deep knowledge of systems and troubleshooting skills.
Responsibilities
- Develop our automated CI/CD, packaging, deployment, and execution environment infrastructure.
- Develop automation tools to catalyze existing development or operational processes.
- Evaluate, architect, and develop technology options for our infrastructure and systems.
- Troubleshoot, maintain, enhance and augment our platform.
- Automate tasks wherever possible.
- Stay up-to-date on emerging technologies.
Required experience
- A bachelor's degree in Computer Science or equivalent
- 3+ years of experience on CI/CD tools (Travis, Gitlab, AWS, Azure, etc) and methodologies
- Minimum 3 years of Linux systems development and/or administration.
- Minimum 2 years of experience with at least one scripting language, coupled with related automation projects
- Previous experience with cloud-based technologies and surrounding operational processes
- Self motivated, conscientious, with a problem-solving, hands-on mindset.
- Perfectionist where it matters, but also pragmatic, with effective time management skills.
- Team player, eager to help.
- Excellent analytical skills.
- Excellent command of spoken and written English.
Preferred experience
- Minimum 2 years of Go, Javascript and C development experience in demanding environments.
- Expert on Continuous Integration, with long experience in Test Automation
- 5+ years of shell scripting experience, on at least 2 languages (BASH, python, perl, ruby, etc.)
- Minimum 2 years of experience with Google Cloud app engine and surrounding operational processes
- Experience on configuration management and tools to support it (Ansible, puppet, etc.)
- Experience with monitoring solutions and service assurance in general.
- A linux, cross-distribution artisan. A good amount of knowledge on windows system administration
- Open source contributor
- Agile Development Methodology