Site Reliability Engineer - remote

Mentat
Posted 4 years ago

Mentat is looking for seasoned Site Reliability Engineers to help our team and clients design and build enterprise automation frameworks spread across Azure, AWS, GCP, and VMware. As a Mentat SRE you will build monitorable, performant, reliable and highly-scalable automation systems with a small, fast-paced, growing team of engineers.

What You Will be Doing:

  • Evangelize best practices for building and operating highly reliable systems
  • Subject matter expert in observability and monitoring
  • Consult on system design to meet reliability and capacity requirements
  • Automate infrastructure and configuration management
  • Assist with all aspects of operational security and compliance
  • Design and build repeatable DevOps patterns for cloud deployments across Azure, AWS, Google, and VMware
  • Contribute to our Alpaca product by automating deployments, configuration, and common administrative tasks

Required:

  • Working knowledge of a centralized configuration tool like Chef, Puppet, or Ansible
  • Experience developing and monitoring mission-critical systems
  • Passion for reliable, scalable, observable software with strong sense of ownership
  • 2-4 years Azure or AWS experience
  • 2-4 years PowerShell/Bash or general Windows and Linux script automation
  • 2-4 years Ansible or Puppet
  • 2-4 years Gitlab
  • 1-2 years experience Elasticsearch
  • 2-4 years Terraform

Would Be Great To Have:

  • 4-6 years VMware
  • 4-6 years Azure and AWS
  • 4-6 years automation
  • 2-4 years Jenkins
  • 2-4 years containerization
  • 2-4 years Elasticsearch