DevOps / SRE (Americas) - remote

Platform.sh
Posted 4 years ago

Job Summary: 

For its PaaS solution https://platform.sh is looking for an Operations and Service Reliability Engineer with a taste for Python and Go, great Linux system understanding, and a real hunger for the challenges of building robust, distributed systems.

Our external API is pure Hypermedia REST + oAuth on top of Pyramid. It mechanizes the Git layer and needs more features.

We can consistently generate from the same manifest a Docker container, an LXC one, or VM disk images (AWS, Azure, OpenStack), we want more targets.

We support any Python, Ruby, NodeJS or PHP, Java and .NET, time to roll-out Elixir, of course, Elixir (and Rust. We need Rust).

Directly reporting to one of our Directors for the Operations Infrastructure Department and in close interaction with our Engineering and Customer Success teams, you will be responsible for:

  • cloud operations: configure clusters, deploy stuff, follow-up on alerts, help customer support debug issues.
  • automating all of the above so they can instead drink margaritas (or non-alcoholic beverages, of course)
  • creating systems, tools &processes that will enhance our support and operations efficiency
  • improving service quality, discipline and reliability throughout lifecycle
  • monitoring operating objectives, streamline and automate intervention
  • continuous learning from Operations experience, modeled as software

This is a fully remote position for a candidate based in Americas.

The ideal candidate

  • Proficient in Python (Golang a plus)
  • has proven successful experience in an operations role,
  • has demonstrated the ability to successfully manage cloud-based infrastructure for a fast growing organization,
  • has experience with containerization technologies,
  • has had exposure to cloud services (AWS, Azure, GCP, ...),
  • understands how an OS works, knows networking, how git works, and the constraints of a distributed system,
  • Puppet experience

Note: we don't like stress, so we build everything to be robust and resilient, but stuff does break. This is a role with on-call duties and fire drills. If this fills you with dread... well, this might not be a fit for you.

About Platform.sh 

Platform.sh is an idea-to-cloud application platform that simplifies cloud infrastructures.

We give developers the tools they need to experiment, innovate, get rapid feedback and deliver better-quality features with speed and confidence thanks to our unique rapid cloning technology.

We want people who are passionate, open, multicultural, friendly, humble and smart to join us and help this fast-growing, award-winning company to revolutionize the tech industry.