Job Application for Senior Software Engineer, Observability at Redpanda Data

Redpanda is pioneering the Agentic Data Plane (ADP) - a new category in AI infrastructure that makes it simple and secure to connect AI agents with enterprise data and systems. Built on a multi-modal data streaming engine, Redpanda empowers agentic applications that reason and act in real-time with speed, autonomy, and precision.

Global leaders including Activision Blizzard, Cisco, Moody's, Texas Instruments, Vodafone and 2 of the top 5 banks in the U.S. rely on Redpanda to process hundreds of terabytes of data a day.

Backed by premier venture investors Lightspeed, GV and Haystack VC, Redpanda is a diverse, people-first organization with teams distributed around the globe.

About the Role:

We are looking for a Senior Software Engineer to join our Observability team and help build the platform that gives Redpanda’s engineering organization deep visibility into the health, performance, and behavior of our systems. You will own and evolve our Grafana-based observability stack—spanning metrics, logs, and traces—and ensure that every team at Redpanda has the tooling and insights they need to ship reliable, high-performance software.

This is a high-impact role at the intersection of infrastructure and developer experience. You will work closely with platform and product engineering teams to design scalable observability solutions, drive adoption of best practices, and reduce mean time to detection and resolution across our cloud and on-premise deployments.

You Will:

Design, build, and maintain Redpanda’s observability platform using the Grafana stack (Grafana, Mimir, Loki, Tempo, Alloy/Agent)
Develop and optimize dashboards, alerts, and SLO/SLI frameworks that give engineering teams actionable insights into system health
Build and operate scalable metrics, logging, and distributed tracing pipelines that handle high-cardinality data across cloud and on-premise environments
Instrument services and infrastructure with OpenTelemetry to ensure comprehensive, standards-based telemetry collection
Partner with platform teams to improve incident detection, root-cause analysis, and mean time to resolution (MTTR)
Evaluate and integrate new observability tools and techniques, driving continuous improvement of our monitoring capabilities
Contribute to internal tooling and automation that streamlines observability onboarding for engineering teams
Participate in on-call rotation to keep observability infrastructure running and incident free

You Have:

5+ years of experience in software engineering with a focus on observability, monitoring, or infrastructure
Deep hands-on experience with the Grafana stack (Grafana, Mimir/Prometheus, Loki, Tempo) in production environments
Strong understanding of metrics, logging, and distributed tracing paradigms and their trade-offs at scale
Experience with OpenTelemetry (OTel) for instrumentation and telemetry collection
Proficiency in Go and Python
Experience running and operating infrastructure on Kubernetes in public cloud environments (AWS, GCP, or Azure)
Comfortable working with a 100% distributed engineering team, collaborating on GitHub, etc.
Experience with AI coding tools (e.g., Claude Code) and able to independently validate, refine, and productionize generated outputs
Solid understanding of time-series databases, log aggregation systems, and query languages (PromQL, LogQL)

Nice to Have:

Strong understanding of Go
Experience operating a SaaS platform with production observability at scale
Familiarity with eBPF-based observability or continuous profiling tools (e.g., Pyroscope, Parca)
Experience with infrastructure-as-code (Terraform, Pulumi) and GitOps workflows
Operated and used streaming platforms (e.g., Kafka, Redpanda) either as a user or provider
Experience building or managing multi-tenant observability platforms
Contributions to open-source observability projects (Grafana, Prometheus, OpenTelemetry, etc.)

Join Redpanda if you’d enjoy being part of a fast-moving, diverse, people-first organization with team members around the globe and a culture based on trust, transparency, communication, and kindness. You'll dive into a nimble, high-impact team with the latest AI tools — and the budget to actually use them.

#LI-Remote

Create a Job Alert

Interested in building your career at Redpanda Data? Get future opportunities sent straight to your email.

First Name

Last Name

Preferred First Name

Country

Phone

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Do you have experience working at a pre-IPO B2B SaaS company in the past 5 years?

Select...

Have you hands-on worked with the Grafana stack (Grafana, Prometheus or Mimir, Loki, Tempo) in production environments?

Select...

Do you have professional experience writing code in Go/Python?

Select...

Do you have hands-on experience running and operating applications on Kubernetes in production?

Select...

Have you used query languages such as PromQL and/or LogQL in a professional setting?

Select...

Please confirm that you have read and understood our GDPR Policy.

Select...

When you apply to a job on this site, the personal data contained in your application will be collected by Redpanda Data, Inc. (“Controller”), which is located at 5758 Geary Blvd, #153 San Francisco, CA 94121 and can be contacted by emailing legal@redpanda.com. Your personal data will be processed for the purposes of managing Controller’s recruitment related activities, which include setting up and conducting interviews and tests for applicants, evaluating and assessing the results thereto, and as is otherwise needed in the recruitment and hiring processes. Such processing is legally permissible under Art. 6(1)(f) of Regulation (EU) 2016/679 (General Data Protection Regulation) as necessary for the purposes of the legitimate interests pursued by the Controller, which are the solicitation, evaluation, and selection of applicants for employment.
Your personal data will be shared with Greenhouse Software, Inc., a cloud services provider located in the United States of America and engaged by Controller to help manage its recruitment and hiring process on Controller’s behalf. Accordingly, if you are located outside of the United States, your personal data will be transferred to the United States once you submit it through this site. Because the European Union Commission has determined that United States data privacy laws do not ensure an adequate level of protection for personal data collected from EU data subjects, the transfer will be subject to appropriate additional safeguards under the standard contractual clauses or another legally recognized manner. You can obtain a copy of the standard contractual clauses by contacting us at legal@redpanda.com.
Your personal data will be retained by Controller as long as Controller determines it is necessary to evaluate your application for employment. Under the GDPR, you have the right to request access to your personal data, to request that your personal data be rectified or erased, and to request that processing of your personal data be restricted. You also have the right to data portability. In addition, you may lodge a complaint with an EU supervisory authority.

What is your current location?

Select...

Who is your most recent/current employer?

LinkedIn Profile

Senior Software Engineer, Observability

#LI-Remote

Apply for this job