Qumulo Logo

Qumulo

Site Reliability Engineer

Job Posted 9 Days Ago Reposted 9 Days Ago
2 Locations
Mid level
2 Locations
Mid level
As a Site Reliability Engineer at Qumulo, you will collaborate across teams to develop and monitor applications, manage build/test infrastructure, automate processes, and troubleshoot issues while ensuring system availability.
The summary above was generated by AI

About the Company:

Qumulo is the unstructured data platform to store and manage exabyte-scale data anywhere – at the edge, in the core data center and in the cloud. With unstructured data growing in more locations faster than ever before, enterprises today need a way to store, manage, and curate data simply and efficiently in any location, on any platform. This is precisely what Qumulo was founded to accomplish.

At Qumulo, we are building an open and collaborative culture where people can do their best work with customers as our magnetic field. We act as owners, we share by default, we are data driven and experimental and as an inclusive workplace, we encourage and celebrate multiple points of view. As part of our culture we believe diversity drives innovation.

About the Position:

As an SRE at Qumulo, you will help to develop solutions that help to manage and monitor applications we use internally and to support our customers.  We manage our internal build and test infrastructure which includes running multiple builds and hundreds of thousands of tests continuously in both on-prem environments and on the cloud  (such as AWS and Azure Native Qumulo Scalable File Service [ANQ]). This build and test environment is a core part of our engineering processes, providing continuous feedback to our engineering teams and allowing us to deliver new product releases regularly throughout each year. We also build and operate managed components of ANQ, delivering a highly available service to customers and keeping the service up to date with our latest features.

We work across engineering, product and customer success teams to identify opportunities to improve our processes and ensure that our existing systems are available and working as expected.  We implement solutions that reduce work through automation, providing scalable solutions that span our on-prem and cloud environments. We help manage the operating expense of running systems across multiple clouds.  We help drive down failures by providing frequent feedback to engineers on their changes with high quality test analytics.

Responsibilities: 

You will collaborate with a team that identifies opportunities, plans new features, and implements solutions. You will work with team members to build a backlog and deliver solutions iteratively.You will troubleshoot build and test failures, diagnosing problems that vary from build time compilation failures to integration test failures involving both virtual machine instances and Qumulo qualified hardware. You will implement monitoring to ensure that systems are working as expected and can raise alerts when problems are detected.

This position does include an on-call rotation which requires availability to respond to critical incidents impairing our owned applications.  

Technologies:

  • Experience working in Linux (we use Ubuntu) 
  • Experience with Python or similar programming languages
  • Experience with system orchestration tools (such as Ansible, Terraform, and cloud specific implementations like AWS CloudFormation) is preferred.
  • Experience with one or more of the major cloud providers (AWS, GCP, Azure)
  • Functional working understanding of Kubernetes and working with containers to manage applications (we manage clusters in our on-prem locations as well as in the "cloud")
  • Experience with monitoring tools and technologies (we use a combination of home grown solutions that utilize OpenMetrics as well as tools like Grafana, InfluxDB, and Prometheus)
  • Experience troubleshooting systems issues
  • Knowledge of build automation and test frameworks

Key Benefits

The annual pay range for the role is USD $140,000.00 - $190,000.00.

Individual pay depends on various factors, such as role level, relevant experience and skills, and location. Pay ranges are reviewed and typically updated each year. Offers are made within the pay range applicable at the time. U.S. based employees have access to healthcare benefits, short-term and long-term disability coverage, basic life insurance, wellbeing benefits, flexible time off, and paid holidays, among others.

  • Excellent healthcare coverage
  • Parental leave
  • 401K investment plan
  • Unlimited paid time off, strongly encouraged to take at least 3 weeks per year

Qumulo is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, gender, religion, sex, sexual orientation, age, disability, military status, or national origin or any other characteristic protected under federal, state, or applicable local law.

Please note that employment at Qumulo is contingent upon completion of a satisfactory background check.

For more information on our Applicant and Employee Privacy Notice please click on the link below:

https://qumulo.com/applicant-employee-privacy-notice

Top Skills

Ansible
AWS
Aws Cloudformation
Azure
GCP
Grafana
Influxdb
Kubernetes
Linux
Prometheus
Python
Terraform
Ubuntu

Qumulo Seattle, Washington, USA Office

1501 4th Avenue, Seattle, WA, United States, 98101

Similar Jobs

7 Days Ago
Seattle, WA, USA
124K-186K Annually
Senior level
124K-186K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Site Reliability Engineer to enhance deployment capabilities for military systems, drive technical direction, and improve operational efficiency using modern technologies.
Top Skills: C++Cloud TechnologiesCybersecurityGoNetworkingPythonRust
9 Days Ago
Easy Apply
Hybrid
3 Locations
Easy Apply
Senior level
Senior level
Fintech • Mobile • Software • Financial Services
Responsible for delivering and maintaining systems for incident management, driving standardization and process improvements, and ensuring high system availability.
Top Skills: AnsibleAWSCfengineChefGoLinuxPuppetPythonRubyTerraformUnix
5 Days Ago
Seattle, WA, USA
117K-221K Annually
Senior level
117K-221K Annually
Senior level
Artificial Intelligence • Information Technology • Natural Language Processing • Software • Business Intelligence • Generative AI
The Senior Site Reliability Engineer will lead technical initiatives, improve infrastructure, mentor teams, and ensure system reliability using tools like Kubernetes and Terraform.
Top Skills: AnsibleBashDockerGoGrafanaKubernetesPrometheusPythonSplunkSQLTerraform

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account