Site Reliability Engineer, Singapore

Posted 3 years ago
Applications may have closed

Ripple

Ripple’s mission is to enable payments every way, everywhere for everyone
We believe connecting traditional financial entities like banks, payment providers and corporations with emerging blockchain technologies and users is the path to an open, decentralized, and more inclusive financial future
This Internet of Value gives any internet-enabled person, application or device access to financial services that are transparent, fast, reliable, and cheap
Delivering this vision is a challenge of massive scale spanning $155 trillion in annual cross border fiat payments and the $1,5 trillion market of digital assets that has grown 10X in the last year

As a Site Reliability Engineer, you will play a critical role helping to advance Ripple’s development and production infrastructure
You will help design and automate our infrastructure management processes, including deployment, instrumentation, monitoring, and overall lifecycle management
You will troubleshoot incident root causes, implement associated action items, and identify opportunities to reduce MTTR using automation

WHAT YOU’LL DO:

Manage operations of Ripple’s customer-facing production services
Design and develop tools for automation and observability
Manage our deployment and continuous integration architecture
Collaborate with product engineering to ensure code is production-ready
Help facilitate the operation of multiple data centers and the rapid scaling of the Ripple network
Research promising new tools and technologies, push the team to experiment and evolve
Participate in a robust and fair on-call framework: SRE are Tier 2, and our worldwide team uses a follow the sun model so that on-call shifts are approximately 9am-5pm

WHAT WE’RE LOOKING FOR:

Extensive Linux/*nix systems background
Advanced experience at troubleshooting complex systems
Proven development background with Go, Python, or Java
Experience with the Hashicorp tool ecosystem (Terraform, Vault)
Configuration management and orchestration experience (Helm, Argo)
Fluent with Docker and Kubernetes
Experience working with cloud infrastructures, particularly AWS and GCP
Unwilling to show fear when confronted with the JVM
Security awareness, with an emphasis on designing for security best practices
Recognition of the importance of automation on scalability and reliability
5+ years experience working in SRE / DevOps / Software Engineering
A passion for building highly reliable infrastructure

WHAT WE OFFER:

The chance to work in a fast-paced start-up environment with experienced industry leaders
A learning environment where you can dive deep into the latest technologies and make an impact
Competitive salary and equity
Medical and vision with 100% employer contributions for employees and dependents
Industry-leading parental leave policies
Generous wellness reimbursement program
Weekly company-wide meeting – ask anything you may want to know