Mentat is looking for seasoned Site Reliability Engineers to help our team and clients design and build enterprise automation frameworks spread across Azure, AWS, GCP, and VMware. As a Mentat SRE you will build monitorable, performant, reliable and highly-scalable automation systems with a small, fast-paced, growing team of engineers.
What You Will be Doing:
- Evangelize best practices for building and operating highly reliable systems
- Subject matter expert in observability and monitoring
- Consult on system design to meet reliability and capacity requirements
- Automate infrastructure and configuration management
- Assist with all aspects of operational security and compliance
- Design and build repeatable DevOps patterns for cloud deployments across Azure, AWS, Google, and VMware
- Contribute to our Alpaca product by automating deployments, configuration, and common administrative tasks
Required:
- Working knowledge of a centralized configuration tool like Chef, Puppet, or Ansible
- Experience developing and monitoring mission-critical systems
- Passion for reliable, scalable, observable software with strong sense of ownership
- 2-4 years Azure or AWS experience
- 2-4 years PowerShell/Bash or general Windows and Linux script automation
- 2-4 years Ansible or Puppet
- 2-4 years Gitlab
- 1-2 years experience Elasticsearch
- 2-4 years Terraform
Would Be Great To Have: