Back to jobs
tags.new

Staff Software Engineer (DevOps / Platform Engineering)

San Francisco Bay Area / Boston

About The Role

We are looking for a seasoned Staff DevOps and Platform Engineer to own and evolve the infrastructure that powers Liberate’s real-time AI voice and workflow automation systems. This is a critical technical leadership role. You will inherit and advance a modern AWS-based platform that spans PBX telephony, canary routing, MTLS-based integrations with carriers, secure production environments, CI/CD, and compliance posture.

You will drive reliability, scalability, and operational rigor across our multi-agent runtime. You will also mentor engineers, design forward-looking system improvements, and create the platform foundations that enable Liberate’s rapid product expansion.

Key Responsibilities

Infrastructure Architecture and Ownership

  • Lead architecture and operation of core AWS infrastructure including PBX systems, EKS, networking, IAM, VPC design, and secure environment isolation.
  • Own and improve Canary routing infrastructure for LRA (LLM REST API) via Traefik and GitOps patterns.
  • Maintain and optimize CI/CD flows including GitHub-based CodeBuild jobs, artifact pipelines, and environment promotion workflows.

Secure Integrations and Network Edge

  • Manage and evolve MTLS proxy infrastructure used to integrate with carrier systems like Frontline.
  • Own HAProxy-based proxy fleet, certificate lifecycle, root CA management, and IP-restricted ingress patterns.
  • Ensure secure, audited access to production systems, tokens, and root-level accounts.

Operational Excellence

  • Lead incident response, on-call rotations, and postmortems. Improve reliability metrics (SLA/SLO/SLI) for voice, agent runtime, and workflow systems.
  • Maintain and improve non-obvious production infra details including external service dependencies, version pinning, and update cadences.
  • Partner with AWS Support to optimize pricing, scaling configs, and resource utilization.

CI/CD, Tooling, and Developer Experience

  • Modernize developer workflows: streamlined builds, repeatable environments, safe deployment strategies (blue/green, canary, feature flags).
  • Build internal tools and abstractions to make engineers productive while enforcing safety, configuration hygiene, and compliance requirements.

Compliance and Security

  • Lead infrastructure-related components of SOC2, pen-testing, and Vanta-driven controls.
  • Ensure auditability, traceability, secure storage of credentials, and alignment with enterprise customer expectations.

Cross-Functional Leadership

  • Work closely with AI Platform, Forward Deployed Engineering, and Product teams to translate business goals into scalable infrastructure decisions.
  • Mentor engineers across DevOps, platform, and backend areas. Help set engineering standards and raise operational maturity across the org.

Required Qualifications

  • 8+ years of DevOps, SRE, or platform engineering experience operating production systems at scale.
  • Deep hands-on AWS expertise (EKS, IAM, VPC, ALB/NLB, CloudWatch, KMS).
  • Strong experience with Kubernetes, container orchestration, and multi-environment management.
  • Proficiency with Terraform or other IaC tools and GitOps workflows.
  • High proficiency in Python, Go, or Typescript for tooling, automation, and internal platform services.
  • Experience with Traefik, HAProxy, or similar load-balancing and routing systems.
  • Familiarity with secure network architectures, MTLS, certificate hierarchies, and service-to-service authentication.
  • Strong background managing CI/CD systems such as GitHub Actions and CodeBuild.
  • Ability to lead incidents, design SLOs, and drive reliability across mission-critical systems.
  • Excellent communication and leadership skills in distributed teams.

Preferred Qualifications

  • Experience with PBX or telephony systems in AWS, SIP routing, or real-time communication pipelines.
  • Experience with voice agents, WebRTC, or low-latency streaming services.
  • Prior work in regulated or enterprise environments where compliance is a first-class requirement.
  • Experience scaling infra in fast-growth startups.
  • Contributions to open source, infrastructure design talks, or technical publications.

Why This Role Matters

Our platform supports real-time, multi-agent reasoning and voice workflows that depend on low latency, reliability, and airtight security. This role is the backbone of that capability. If you're excited to own mission-critical infrastructure in a company where infrastructure is product, we’d love to talk.

Strong preference for Boston or San Francisco based, but open to remote within the U.S.

Create a Job Alert

Interested in building your career at Liberate? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf