Sardine

Staff SRE

Posted 4 Days Ago

Be an Early Applicant

Remote

2 Locations

180K-220K Annually

Senior level

Remote

2 Locations

180K-220K Annually

Senior level

The Staff SRE is responsible for maintaining production services, improving security and operational processes, and collaborating across engineering teams for scalable and reliable services. This role requires a proactive approach to monitoring and debugging systems.

The summary above was generated by AI

Who we are:

We are a leader in fraud prevention and AML compliance. Our platform uses device intelligence, behavior biometrics, machine learning, and AI to stop fraud before it happens. Today, over 300 banks, retailers, and fintechs worldwide use Sardine to stop identity fraud, payment fraud, account takeovers, and social engineering scams. We have raised $145M from world-class investors, including Andreessen Horowitz, Activant, Visa, Experian, FIS, and Google Ventures.

Our culture:

We have hubs in the Bay Area, NYC, Austin, and Toronto. However, we maintain a remote-first work culture. #WorkFromAnywhere
We hire talented, self-motivated individuals with extreme ownership and high growth orientation.
We value performance and not hours worked. We believe you shouldn't have to miss your family dinner, your kid's school play, friends get-together, or doctor's appointments for the sake of adhering to an arbitrary work schedule.

About the Role:

Site Reliability Engineers (SREs) are responsible for keeping all production services running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments. As an SRE at Sardine, you will build and run the core components that processes billions of events to protect financial institutions from fraud and compliance risks. You will also partner with our other engineering teams to help make their services more performant, scalable, observable, and reliable. We believe every engineering team at Sardine should be responsible for the software they build, and SREs play a critical part in providing the tools, practices, and expertise to make that happen.

You will:

Run the production environment by monitoring availability and taking a holistic view of system health
Having a proactive approach to monitoring rather than a reactive approach. - Build monitoring that alerts on symptoms rather than on outages.
Participate in on-call rotations, along with every member of the engineering team
Improve and automate operational processes
Constantly improve the security of the product and security operation
Debug production issues across services and levels of the stack
Partner with engineering teams to ensure their products meet production standards
Be willing to go out of your comfort zone to unfamiliar territory to solve unique issues.
Help shape our company's engineering culture and keep high engineering standards.
Run our GCP and AWS infrastructure with Terraform, Kubernetes, and Datadog together with the devops team.

An ideal candidate has:

7+ years experience designing, building, and operating large-scale production systems
Experience with Google Cloud Platform
Experience with monitoring tools like datadog and preferably open source toolings like prometheus/grafana/jaeger(tracing)
Good to have elastic search experience.
Experience with container orchestration tools like Kubernetes and tools that support Kubernetes deployment, like ArgoCD and helm.
Strong programming skills in primarily GoLang and/or any other languages
Strong knowledge about database optimization
Good knowledge of ensuring good security practices within cloud infrastructure.

Compensation: Base pay range of $180,000 - $220,000 + equity with tremendous upside potential + Attractive benefits

The compensation offered for this role will depend on various factors, including the candidate's location, qualifications, work history, and interview performance, and may differ from the stated range.

Benefits we offer:

Generous compensation in cash and equity
Early exercise for all options, including pre-vested
Work from anywhere: Remote-first Culture
Flexible paid time off, Year-end break, Self care days off
Health insurance, dental, and vision coverage for employees and dependents - US and Canada specific
4% matching in 401k / RRSP - US and Canada specific
MacBook Pro delivered to your door
One-time stipend to set up a home office — desk, chair, screen, etc.
Monthly meal stipend
Monthly social meet-up stipend
Annual health and wellness stipend
Annual Learning stipend
Unlimited access to an expert financial advisory

Join a fast-growing company with world-class professionals from around the world. If you are seeking a meaningful career, you found the right place, and we would love to hear from you.

Top Skills

Argocd

Ci/Cd

Datadog

Elasticsearch

Git

Google Cloud Platform

Grafana

Jaeger

Kubernetes

Prometheus

Terraform

Similar Jobs

Kustomer

Software Engineer, SRE (Senior/Staff levels)

12 Days Ago

Remote

Senior level

Artificial Intelligence • Enterprise Web • Machine Learning • Natural Language Processing • Software • Conversational AI • Automation

As a Site Reliability Engineer at Kustomer, you will build and maintain cloud infrastructure, automate deployment processes, improve scalability and performance, and collaborate on best practices across engineering teams. Responsibilities include managing on-call practices, security compliance, and infrastructure upgrades while leading system migrations and optimizing developer environments.

EZ Texting

Staff Site Reliability Engineer, Telecom & SMS

13 Days Ago

Remote

United States

145K-195K Annually

Expert/Leader

145K-195K Annually

Expert/Leader

Information Technology • Marketing Tech

Lead reliability strategies for SMS infrastructure, collaborating with teams to drive value, optimize performance, and ensure system reliability for telecom operations.

Top Skills: AnsibleAsteriskAWSAzureDatadogDockerElasticsearchGCPGitGitlabHaproxyJavaJenkinsK8SLinuxMySQLNginxOpensipsRestSipSngrepTerraformTomcatVoipdWireshark

Affirm

Staff Software Engineer - SRE, Backend (Reliability Engineering)

15 Days Ago

Easy Apply

Remote

United States

Easy Apply

Senior level

Big Data • Fintech • Mobile • Payments • Financial Services

As a Staff Software Engineer in Site Reliability Engineering at Affirm, you will lead the development of backend systems, guiding projects and ensuring operational excellence. You will set technical strategy, enhance system reliability, and foster team growth through mentorship and quality standards.

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Amazon, Microsoft, Meta, Google
Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Madrona, Fuse, Tola, Maveron
Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

By clicking Apply you agree to share your profile information with the hiring company.