Site Reliability Engineer, Singapore

Ripple’s mission is to enable payments every way, everywhere for everyone. We believe connecting traditional financial entities like banks, payment providers and corporations with emerging blockchain technologies and users is the path to an open, decentralized, and more inclusive financial future. This Internet of Value gives any internet-enabled person, application or device access to financial services that are transparent, fast, reliable, and cheap. Delivering this vision is a challenge of massive scale spanning $155 trillion in annual cross border fiat payments and the $1.5 trillion market of digital assets that has grown 10X in the last year. 

As a Site Reliability Engineer, you will play a critical role helping to advance Ripple’s development and production infrastructure. You will help design and automate our infrastructure management processes, including deployment, instrumentation, monitoring, and overall lifecycle management. You will troubleshoot incident root causes, implement associated action items, and identify opportunities to reduce MTTR using automation.

WHAT YOU’LL DO:

  • Manage operations of Ripple’s customer-facing production services 
  • Design and develop tools for automation and observability 
  • Manage our deployment and continuous integration architecture
  • Collaborate with product engineering to ensure code is production-ready
  • Help facilitate the operation of multiple data centers and the rapid scaling of the Ripple network
  • Research promising new tools and technologies, push the team to experiment and evolve
  • Participate in a robust and fair on-call framework: SRE are Tier 2, and our worldwide team uses a follow the sun model so that on-call shifts are approximately 9am-5pm.

WHAT WE’RE LOOKING FOR:

  • Extensive Linux/*nix systems background
  • Advanced experience at troubleshooting complex systems
  • Proven development background with Go, Python, or Java
  • Experience with the Hashicorp tool ecosystem (Terraform, Vault)
  • Configuration management and orchestration experience (Helm, Argo)
  • Fluent with Docker and Kubernetes
  • Experience working with cloud infrastructures, particularly AWS and GCP
  • Unwilling to show fear when confronted with the JVM
  • Security awareness, with an emphasis on designing for security best practices
  • Recognition of the importance of automation on scalability and reliability
  • 5+ years experience working in SRE / DevOps / Software Engineering
  • A passion for building highly reliable infrastructure

WHAT WE OFFER:

  • The chance to work in a fast-paced start-up environment with experienced industry leaders
  • A learning environment where you can dive deep into the latest technologies and make an impact
  • Competitive salary and equity
  • Medical and vision with 100% employer contributions for employees and dependents
  • Industry-leading parental leave policies
  • Generous wellness reimbursement program
  • Weekly company-wide meeting – ask anything you may want to know