Icon menu dark

Software Engineer, Development Tools and Site Reliability

About the position:

As a Qumulo Member of Technical Staff in our Development Tools and Site Reliability Engineering team, you will also architect, develop and maintain the continuous integration systems that allow the engineering team to get quick feedback on the quality of their work.  You will also own production services for monitoring Qumulo clusters deployed in our customers’ data centers.

We are looking for someone with strong analytical and troubleshooting skills, fluency in coding and systems design, solid communication skills and a desire to tackle the complex problems of scaling a young organization's infrastructure to maturity.

About the company:

Qumulo is a Seattle based data storage startup. We are building solutions that will permanently change the enterprise storage marketplace, improving quality and increasing service standards. We are dedicated to building not only to a fast, reliable product, but providing customers with a seamless user experience and unprecedented visibility on their data.

Founded in 2012 by the inventors of scale-out NAS, our vision has attracted a team of pioneers from Amazon Web Services, Google, and Microsoft. Our mission is simple – to be the company the world trusts to store, manage and curate its data.


  • Architect, implement and maintain build and continuous integration (CI) systems
  • Set standards for how to integrate with the build and CI systems
  • Educate and support Agile teams in integrating with CI systems
  • Production operation responsibilities for keeping our customer monitoring service operational and responsive as our customer base grows
  • Participate in capacity planning for our development and test infrastructure


  • BS or MS degree in Computer Science or related technical field (or equivalent practical experience)
  • 4+ years of relevant work experience
  • Strong Python and/or Ruby development
  • Experience managing development and test environments
  • Excellent troubleshooting and debugging skills
  • Deep experience with virtualization platforms, monitoring tools, and networking
  • Ability to handle periodic on-call duty as well as out-of-band requests
  • Experience in C, C++ or Java
  • Experience with CI tools, such as Jenkins, TeamCity, Bamboo, or Travis CI
  • Experience with Puppet, Chef, Ansible, and/or Salt
  • Expertise in analyzing and troubleshooting large-scale distributed systems
  • Knowledge of IP networking, network analysis and performance and application issues using standard tools such as tcpdump
Verified open
Posted by employer


There was an error handling your request. Please make sure you're online.