Head of Site and Reliability Engineering

Bitso

About Bitso:
Bitso is Latin America’s leading cryptocurrency platform. Our goal is to evolve how we think about and use money. We believe that we should all have the opportunity to use our money whenever we want it, and how we want it, without boundaries or schedules. To achieve this, we provide individuals with fast, cheap, seamless and user-friendly financial services powered by blockchain technology. Cryptocurrencies do not rely on intermediaries to give them legitimacy or value. Instead, they are valuable because of the peer-to-peer technology that powers them. We firmly believe in crypto and the use cases it has. It’s time for the world to upgrade to a fair, open, transparent, and global financial system for all. #makecryptouseful.
Visit us at https://bitso.com/
About the team:
  • The Head of Site Reliability Engineering must evaluate, promote, coach, and develop his or her people, but without traditional direct oversight. Heads of Engineering are not involved in the day-to-day work of squads; they don't check on or approve the work of their chapter members, and they certainly don't micromanage or provide daily oversight
You are:
  • Experience working in and/or building an SRE program at a technology-focused organization
  • Extensive software engineering background, with deep knowledge of automation (testing, deployment, etc.) and distributed systems architecture
  • Excellent programming skills, ideally in multiple languages, with knowledge of multiple programming paradigms (eg OO vs functional)
  • Deep knowledge of modern networks, protocols and diagnostic tools
  • Solid experience and understanding of the foundational services of at least one major cloud services providers (AWS preferred), including the following services: compute, storage, database and network (VPCs, routing, load balancers)
  • Experience with infrastructure-as-code automation tools such as CloudFormation, CDK, Terraform or Pulumi
  • Experience with Docker, Kubernetes and associated tooling (kubectl, helm, etc.)
  • Knowledge of the DataDog monitoring and observability platform (or similar)
  • Knowledge of PagerDuty incident response and escalation tool (or similar)
  • Knowledge of Splunk log indexing and analytics tool (or similar)
  • Excellent written and verbal communication skills.
  • English language proficiency.
You’ll do:
  • Lead the hiring of specialist reliability and automation engineers to work alongside product teams on operational stability
  • Work with architects and product teams to ensure clear ownership of services, with escalation paths for incidents
  • Assist product teams in defining realistic operational goals for their services, in areas such as availability, latency and performance, by agreeing on appropriate service level indicators, objectives and agreements
  • Assist product teams in improving the monitoring and observability of their services
  • Assist in the automation of routine and manual tasks to eliminate toil
  • Be responsible for growing the existing incident management function and improving the on-call experience
Compensation and Benefits:
  • Purpose: You’ll be part of something bigger, working towards financial disruption and inclusion across Latin America
  • Culture: You’ll work in a thriving, friendly, and fun environment that promotes open discussions, jokes, learning, video games, and lots of fun.
  • People: You’ll work with some of the most driven and intelligent people in the crypto space, engaging with a network of diverse talent from 25+ nationalities bound by our quest to #makecryptouseful
  • Salary: We pay very competitively in the countries where we operate, based on sophisticated high-tech markets
  • Venue: Work from wherever you want, work asynchronously; this role is fully remote to give you maximum freedom
  • Unlimited Paid Time-Off: You choose your number of days off. Recharge batteries and enjoy who you are outside the office
    • This role is expected to work remotely.
    • These are the applicable requisites, although equivalent competencies in any of the above will also be considered.
    Bitso promotes an environment where people are treated fairly and with respect, free of discrimination, bullying, harassment, violence or threats.
    Please visit: https://bitso.com/legal/GI/terms to see our privacy policy.
Subscribe Now