
Streaming Infrastructure DevOps Engineer - Romania
Armis, the cyber exposure management & security company, protects the entire attack surface and manages an organization’s cyber risk exposure in real time. In a rapidly evolving, perimeter-less world, Armis ensures that organizations continuously see, protect and manage all critical assets - from the ground to the cloud. Armis secures Fortune 100, 200 and 500 companies as well as national governments, state and local entities to help keep critical infrastructure, economies and society stay safe and secure 24/7.
Armis is a privately held company headquartered in California.
This is a hybrid role based in our Bucharest office (2 days a week in the office)
The role:
An operations-first engineer with deep expertise in running, scaling, automating, and monitoring streaming infrastructure like Kafka and RabbitMQ.
At Armis, streaming is at the heart of our product. We operate hundreds of streaming applications that transform, aggregate, analyze, and enrich the most valuable data we collect from our clients. We process billions of events and petabytes of raw data daily — and we’re just getting started.
Our mission is to provide a rock-solid, scalable, and secure infrastructure foundation that empowers our engineers to build and operate streaming services with confidence. This includes managing the full lifecycle of Kafka and RabbitMQ clusters, automating deployments, securing system access, and building out observability and monitoring capabilities that scale with our growth.
We are seeking a skilled and motivated DevOps engineer with deep familiarity in the streaming ecosystem to join our elite infrastructure team. If you're excited by the challenge of operating mission-critical systems at scale and optimizing the developer experience through automation and tooling, we’d love to hear from you.
What you will do…
- Automate Deployment and Operation
Oversee deployment of Kafka and RabbitMQ clusters (including Confluent Cloud & CFK). Build automation pipelines to ensure repeatability and resiliency across environments. - Monitor and Support Production Systems
Own production stability of global Kafka clusters. Handle on-call rotations, incident management, troubleshooting, and scaling challenges. - Improve Infrastructure Observability
Build and maintain observability systems: dashboards, alerting pipelines, metrics collection (Prometheus, Grafana, etc.). - Optimize System Performance
Collaborate with peers on benchmarking and optimization initiatives. Work on tuning Kafka brokers, cluster configurations, and runtime parameters. - Provide Developer Support and Training (Infra-focused)
Help developers configure topics, quotas, and consumers appropriately. Train service owners to interpret monitoring data and avoid pitfalls. - Develop and Maintain Infrastructure
Contribute to building infrastructure tools and scripts (IaC, Helm charts, etc.) that make provisioning and managing clusters reliable and efficient. - Secure Infrastructure Access
Configure and maintain secure access patterns across streaming infrastructure, ensuring proper authentication and role-based access controls are enforced for both developers and services.
What we expect…
- 8+ years of experience in DevOps, SRE, or Infrastructure Engineering roles.
- Deep hands-on Kafka experience, including deploying, maintaining, scaling, and monitoring clusters.
- Experience with RabbitMQ.
- Extensive experience with Docker, Kubernetes, Helm, and GitOps-style deployments.
- Infrastructure as Code experience (Terraform, Pulumi, etc.).
- Strong skills in scripting and automation (Python, Bash, etc.).
- Familiarity with Confluent Cloud, Confluent for Kubernetes, and similar tools.
- Solid understanding of authentication and authorization mechanisms in distributed systems.
- Production support mindset – with proven troubleshooting and incident resolution history.
- Collaboration and communication skills – especially with dev teams depending on platform support.
- Experience with Istio Service Mesh (bonus).
- Experience with GovCloud (bonus).
Bonus Qualities:
- Mentorship and leadership experience in infrastructure or SRE teams.
- Contributions to automation or monitoring open-source tooling.
- Active participant in SRE or DevOps communities.
- Conference speaker or internal tech trainer.
- Technical writing about infrastructure automation or reliability.
The choices you make in your career journey matter. You want to do interesting work in an important field while also having time to live your life, which is why we place so much value in your life-work balance. Armis sets you up for success with comprehensive health benefits, discretionary time off, paid holidays including monthly me days, and a highly inclusive and diverse workplace. Put your unique experiences and perspective to work in an environment where they will enable you to thrive, grow, and live your life with integrity.
Armis is proud to be an equal opportunity employer. We never discriminate based on race, ethnicity, color, ancestry, national origin, religion, sex, sexual orientation, gender identity, age, disability, veteran status, genetic information, marital status or any other legally protected (or not) status. In compliance with federal law, all persons hired will be required to submit satisfactory proof of identity and legal authorization.
Create a Job Alert
Interested in building your career at Armis Security? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field