Senior Operations Engineer
Our story:
The widespread adoption of intelligent technologies powered by automation, AI, ML, and knowledge graphs is accelerating. As these technologies become increasingly accessible, our aim is to make their capabilities empowering, trustworthy, and useful to real people in the real world.
Adapter was founded in 2022 by Adam Ghetti and Dr. David Bader, with the support of some of the most esteemed Tier 1 Silicon Valley firms and individual entrepreneurs. We are a small but dedicated team, currently working towards solving a significant problem. We recognize the importance of being early movers in this field, and have assembled a well-supported and passionate team to do so.
What we are looking for:
We are seeking a Senior Operations Engineer who will play a key role in the development of a product, which at its core aims to empower individuals through thoughtful collaboration with intelligent technology.
You will partner with a brilliant team of engineers and innovators and will be at the cutting edge of some of the most interesting consumer use-cases for intelligent technologies.
We have established a culture that promotes both remote work and in-person collaboration, with team members currently dispersed around the country with hubs in Austin, NYC, and the Bay Area. We believe that the integration of these two elements allows for maximum productivity and creativity as we strive to achieve our goal.
Responsibilities:
- Consult with stakeholders to identify infrastructure needs, identifying solutions to support research and product development
- Spearhead technical planning and oversee the implementation of resilient, scalable systems.
- Design, implement, and maintain automated infrastructure provisioning and configuration management processes.
- Observability Design and implement a robust observability system using structured logging, telemetry, and optimized tooling to minimize manual intervention. Foster operational excellence through centralized, proactive monitoring solutions that support continuous improvement.
- Monitoring and Incident Response: Implement monitoring solutions for proactive issue detection and lead incident response activities to ensure system reliability.
Qualifications:
- Experience deploying public facing applications and distributed services at large scale; ex. 100k+ users.
- Proficient in scripting languages such as Python, and Bash.
- Experience with containerization technologies (e.g., Docker, Kubernetes).
- Familiarity with major cloud platforms (e.g., AWS, Azure, GCP) and infrastructure as code (e.g., CDK, Terraform).
- Experience with observability tools (e.g., Prometheus, Datadog, Honeycomb, ELK Stack).
- Experience with incident response, postmortem and root cause analysis
- Experience in establishing and monitoring SLIs
Desired:
- Experience owning the implementation of cybersecurity best practices in infrastructure design and deployment.
- Experience leading an operations team for a highly available service
- Experience with cross platform observability
Work Experience:
- 5+ years of experience as an Operations Engineer, startup experience is a plus
- Bachelor's or Master's degree in Computer Science, Information Technology, or related field (or comparable work experience)
Benefits:
- Early stage equity
- Comprehensive health insurance
- Generous PTO
- Remote and in person cultures that promote collaboration
Full compensation packages are based on candidate experience and certifications.
United States - Remote Pay Range
$200,000 - $240,000 USD
Apply for this job
*
indicates a required field