Site Reliability Engineer - remote

bit.io
Posted 3 years ago
Critical toward achieving bit.io’s vision is a datastore that (1) scales to petabyte queries while still enabling fast query-iteration by users, and (2) applies data best practices while remaining flexible when opinionated defaults aren’t enough. You will run &improve bit.io’s production platform.

As an SRE at bit.io you will:

  • Work across the stack on all aspects of the core product
  • Collaborate directly with all teammates on a small, productive technology team
  • Solve petabyte-scale problems in the data space
  • Design, build, test and deploy a complex data management system
  • Make broad, impactful technology decisions with responsibility for their outcomes
  • Develop key SLOs for the production system and own delivering those SLOs

We’re looking for someone who has:

  • Ran production systems that dealt with large amounts of structured data
  • Experience and passion in data, data engineering, and data processing
  • A strong drive to make software easy-to-use

The ideal SRE candidate will have:

  • Expertise in Python
  • Expertise in Kubernetes and associated technologies
  • Ran and debugged microservices and container technologies
  • A strong desire to constantly improve the iteration speed of the company as a whole through automation &systems engineering