Credit Karma's mission is to make financial progress possible for everyone. We have over 70 million US members and are a true mission-oriented business, a rare case where our incentives are aligned with our users’ - we succeed by helping our members.
At Credit Karma, each engineering team is responsible for monitoring and managing the reliability and performance of its services.  The mission of the Observability Team is to provide tools to allow other teams to visualize, analyze, monitor, and alert on the performance, integrity, and security of our systems and applications - and moreover, to provide insight at the granularity of a single component as well as at a multi-system level.
We are looking for a Staff or Senior Software Engineer to help design and build an integrated, context-aware Observability platform the provides our Engineering, Site Operations, and Security teams with real-time monitoring, alerting, and anomaly detection.  Responsibilities include prototyping new Observability technologies; understanding our stakeholders’ needs; collaborating with other engineers and teams; leading software and system design discussions; delivering high quality, well-tested software; and supporting the 24/7/365 operation of our Observability system.  Our ideal candidate is self-aware; embraces learning, collaboration, and innovation; and is comfortable with (or interested in) about 30% devops time in setting up and maintaining new systems.

What you’ll do:

  • Design and build our Observability system - one system to rule them all!
  • Research and prototype new Observability technologies and platforms including devops work
  • Design novel ways to integrate metrics, tracing, and logging into a single unified Observability platform
  • Detect anomalies using machine learning
  • Apply best practices to build a highly available, high volume Observability system
  • Demonstrate the Credit Karma values of knowing, owning, collaborating, innovating, and growing
  • What we look for (you don't need all of these, but the more, the merrier):

  • Strong fundamentals in Computer Science
  • 4+ years of professional experience in platform or server-side engineering (Scala, TypeScript, Node.js, and Go are pluses)
  • 2+ years of devops experience at scale
  • Strong teamwork and interpersonal skills
  • Ability to mentor other software engineers
  • Curiosity and creativity
  • Experience using several of the following technologies:

  • Linux (CentOS, RHEL)
  • Proxies (e.g. Nginx)
  • Containers, Orchestration (e.g. Docker, Kubernetes)
  • Cloud (GCP, AWS)
  • Data streaming (e.g. Kafka, Akka Streams, Google Pub/Sub, Amazon Kinesis)
  • Time-series databases (e.g. InfluxDB, OpenTSDB)
  • Metrics collectors (e.g. Telegraf, Statsd, Collectd)
  • Log management (e.g. Splunk)
  • Distributed tracing (e.g. Zipkin)
  • Anomaly detection, machine learning
  • Credit Karma is seeking a diversity of ideas which we believe comes from a diversity of backgrounds. We are committed to being a diverse and inclusive workplace and welcome people of different backgrounds, experiences, abilities and perspectives. Join us!