Sr. Manager, Site Reliability Engineering (SRE)
Xometry (NASDAQ: XMTR) powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry’s digital marketplace gives manufacturers the critical resources they need to grow their business while also making it easy for buyers at Fortune 1000 companies to tap into global manufacturing capacity.
Sr. Manager, Site Reliability Engineering (SRE)
North Bethesda, MD, Lexington, KY, Boston, MA
Xometry (NASDAQ: XMTR) powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry’s digital marketplace gives manufacturers the critical resources they need to grow their business while also making it easy for buyers at Fortune 1000 companies to tap into global manufacturing capacity.
We are looking for a Sr. Manager of Site Reliability Engineering (SRE) to join our organization. You will be responsible for crafting the strategic direction for SRE teams and initiatives, helping Xometry build cost-effective, secure, fast, and reliable systems for our global manufacturing marketplace.
Responsibilities:
- Together with engineering, product, and program management leaders, define our standards, metrics, practices to improve operational rigor, efficiency, and engineering velocity.
- Establish automated and self-service strategies to improve operational efficiency and development team self-sufficiency.
- Champion and measure observability, monitoring, and metrics practices.
- Supervise development, configuration, and maintenance of the underlying platforms for deployed software: AWS accounts and networking, kubernetes clusters, and similar systems.
- Supervise development, configuration, and maintenance of observability and monitoring tools
- Supervise development, configuration, and maintenance of software development (CI/CD) tools (github actions runners, ArgoCD, etc).
What You Need to Bring:
- A degree or equivalent experience with 7+ years of experience in software development and site reliability, in a fast-paced, product-driven environment.
- An opinionated and iterative approach to balance short-term priorities with a long-term target architecture for systems and processes.
- A proven track record of building and growing a high-performing SRE team.
- A strong understanding of infrastructure automation observability within distributed systems.
- Experience in defining & operationalizing SLOs, SLAs, and error budgets for platform and application systems.
- Demonstrated ability to interact and communicate effectively with junior-level ICs all the way to technology, product, and business executives.
- A US person (citizen or green card holder).
#LI-Hybrid
Xometry is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.
For US based roles: Xometry participates in E-Verify and after a job offer is accepted, will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S.
Apply for this job
*
indicates a required field