Site Reliability Engineer - remote

Mentat
Posted 3 years ago
GitHub Jobs

Mentat is looking for seasoned Site Reliability Engineers to help our team and clients design and build enterprise automation frameworks spread across Azure, AWS, GCP, and VMware. As a Mentat SRE you will build monitorable, performant, reliable and highly-scalable automation systems with a small, fast-paced, growing team of engineers.

What You Will be Doing:

  • Evangelize best practices for building and operating highly reliable systems
  • Subject matter expert in observability and monitoring
  • Consult on system design to meet reliability and capacity requirements
  • Automate infrastructure and configuration management
  • Assist with all aspects of operational security and compliance
  • Design and build repeatable DevOps patterns for cloud deployments across Azure, AWS, Google, and VMware
  • Contribute to our Alpaca product by automating deployments, configuration, and common administrative tasks

Required:

  • Working knowledge of a centralized configuration tool like Chef, Puppet, or Ansible
  • Experience developing and monitoring mission-critical systems
  • Passion for reliable, scalable, observable software with strong sense of ownership
  • 2-4 years Azure or AWS experience
  • 2-4 years PowerShell/Bash or general Windows and Linux script automation
  • 2-4 years Ansible or Puppet
  • 2-4 years Gitlab
  • 1-2 years experience Elasticsearch
  • 2-4 years Terraform

Would Be Great To Have:

  • 4-6 years VMware
  • 4-6 years Azure and AWS
  • 4-6 years automation
  • 2-4 years Jenkins
  • 2-4 years containerization
  • 2-4 years Elasticsearch