Sardine Logo

Sardine

Staff SRE

Job Posted 4 Days Ago Posted 4 Days Ago
Be an Early Applicant
Remote
2 Locations
180K-220K Annually
Senior level
Remote
2 Locations
180K-220K Annually
Senior level
The Staff SRE is responsible for maintaining production services, improving security and operational processes, and collaborating across engineering teams for scalable and reliable services. This role requires a proactive approach to monitoring and debugging systems.
The summary above was generated by AI

Who we are:

We are a leader in fraud prevention and AML compliance. Our platform uses device intelligence, behavior biometrics, machine learning, and AI to stop fraud before it happens. Today, over 300 banks, retailers, and fintechs worldwide use Sardine to stop identity fraud, payment fraud, account takeovers, and social engineering scams. We have raised $145M from world-class investors, including Andreessen Horowitz, Activant, Visa, Experian, FIS, and Google Ventures.

Our culture:

  • We have hubs in the Bay Area, NYC, Austin, and Toronto. However, we maintain a remote-first work culture. #WorkFromAnywhere

  • We hire talented, self-motivated individuals with extreme ownership and high growth orientation.

  • We value performance and not hours worked. We believe you shouldn't have to miss your family dinner, your kid's school play, friends get-together, or doctor's appointments for the sake of adhering to an arbitrary work schedule.

About the Role:

Site Reliability Engineers (SREs) are responsible for keeping all production services running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments. As an SRE at Sardine, you will build and run the core components that processes billions of events to protect financial institutions from fraud and compliance risks. You will also partner with our other engineering teams to help make their services more performant, scalable, observable, and reliable. We believe every engineering team at Sardine should be responsible for the software they build, and SREs play a critical part in providing the tools, practices, and expertise to make that happen.

You will:

  • Run the production environment by monitoring availability and taking a holistic view of system health

  • Having a proactive approach to monitoring rather than a reactive approach. - Build monitoring that alerts on symptoms rather than on outages.

  • Participate in on-call rotations, along with every member of the engineering team

  • Improve and automate operational processes

  • Constantly improve the security of the product and security operation

  • Debug production issues across services and levels of the stack

  • Partner with engineering teams to ensure their products meet production standards

  • Be willing to go out of your comfort zone to unfamiliar territory to solve unique issues.

  • Help shape our company's engineering culture and keep high engineering standards.

  • Run our GCP and AWS infrastructure with Terraform, Kubernetes, and Datadog together with the devops team.

An ideal candidate has:

  • 7+ years experience designing, building, and operating large-scale production systems

  • Experience with Google Cloud Platform

  • Experience with monitoring tools like datadog and preferably open source toolings like prometheus/grafana/jaeger(tracing)

  • Good to have elastic search experience.

  • Experience with container orchestration tools like Kubernetes and tools that support Kubernetes deployment, like ArgoCD and helm.

  • Strong programming skills in primarily GoLang and/or any other languages

  • Strong knowledge about database optimization

  • Good knowledge of ensuring good security practices within cloud infrastructure.

Compensation: Base pay range of $180,000 - $220,000 + equity with tremendous upside potential + Attractive benefits

The compensation offered for this role will depend on various factors, including the candidate's location, qualifications, work history, and interview performance, and may differ from the stated range.

Benefits we offer:

  • Generous compensation in cash and equity

  • Early exercise for all options, including pre-vested

  • Work from anywhere: Remote-first Culture

  • Flexible paid time off, Year-end break, Self care days off

  • Health insurance, dental, and vision coverage for employees and dependents - US and Canada specific

  • 4% matching in 401k / RRSP - US and Canada specific

  • MacBook Pro delivered to your door

  • One-time stipend to set up a home office — desk, chair, screen, etc.

  • Monthly meal stipend

  • Monthly social meet-up stipend

  • Annual health and wellness stipend

  • Annual Learning stipend

  • Unlimited access to an expert financial advisory

Join a fast-growing company with world-class professionals from around the world. If you are seeking a meaningful career, you found the right place, and we would love to hear from you.

Top Skills

Argocd
Ci/Cd
Datadog
Elasticsearch
Git
Go
Google Cloud Platform
Grafana
Jaeger
Kubernetes
Prometheus
Terraform

Similar Jobs

12 Days Ago
Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Enterprise Web • Machine Learning • Natural Language Processing • Software • Conversational AI • Automation
As a Site Reliability Engineer at Kustomer, you will build and maintain cloud infrastructure, automate deployment processes, improve scalability and performance, and collaborate on best practices across engineering teams. Responsibilities include managing on-call practices, security compliance, and infrastructure upgrades while leading system migrations and optimizing developer environments.
13 Days Ago
Remote
United States
145K-195K Annually
Expert/Leader
145K-195K Annually
Expert/Leader
Information Technology • Marketing Tech
Lead reliability strategies for SMS infrastructure, collaborating with teams to drive value, optimize performance, and ensure system reliability for telecom operations.
Top Skills: AnsibleAsteriskAWSAzureDatadogDockerElasticsearchGCPGitGitlabHaproxyJavaJenkinsK8SLinuxMySQLNginxOpensipsRestSipSngrepTerraformTomcatVoipdWireshark
15 Days Ago
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
As a Staff Software Engineer in Site Reliability Engineering at Affirm, you will lead the development of backend systems, guiding projects and ensuring operational excellence. You will set technical strategy, enhance system reliability, and foster team growth through mentorship and quality standards.

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account