Experienced Site Reliability Engineer – London

Theorem

Theorem is a software consultancy that believes in simplicity in software design. We deliver solutions for startups and enterprises. You can see our portfolio to learn more about the results we've delivered for our clients.
We are a remote first company with offices in Los Angeles and New York, and team members all around the world.
Job Duties:
  • Mentor and teach SRE best practices, internally and with our customers.
  • Build and maintain high-availability systems.
  • Identify improvement opportunities on existing systems, build plans and execute improvements.
  • Ensure our clients and their users have the best and fastest experience possible.
  • Participate in code and design reviews, teaching and learning from other engineers.
  • Plan, estimate and prioritize work in a collaborative and distributed team.
  • Potentially travel to spend time with clients.
Job Requirements:
  • Familiar with Python, C# or Ruby, and at least one other programming language.
  • Experience with Infrastructure as Code and Configuration Management tools.
  • Experience with alerting and monitoring tools.
  • Experience working in a highly distributed company.
  • Be open minded and always learning.
  • Experience with the following tools are preferred, but not necessarily required:
  • Terraform
  • CloudFormation
  • Chef
  • Docker + Kubernetes
  • Prometheus + Grafana
  • Elasticsearch + Logstash + Kibana
  • Splunk
  • Jenkins
Subscribe Now