
Infrastructure Engineer
Dataiku is The Universal AI Platform™, giving organizations control over their AI talent, processes, and technologies to unleash the creation of analytics, models, and agents. Providing no-, low-, and full-code capabilities, Dataiku meets teams where they are today, allowing them to begin building with AI using their existing skills and knowledge.
How you’ll make an impact
At Dataiku, our mission is to enable customers to bring large-scale data analytics and AI technologies into a centralized, easy-to-use platform. To support this mission, we are looking for an Infrastructure Engineer to help operate, maintain, and troubleshoot our internal and customer-facing infrastructure.
You will work closely with experienced infrastructure and platform engineers, contributing to the reliability and day-to-day operations of our systems. This role is hands-on and operationally focused, with a strong emphasis on UNIX/Linux systems and cloud infrastructure.
Our infrastructure primarily runs on AWS, with some components on Azure and GCP. The tooling environment includes Terraform, Ansible, Kubernetes, and Python, though deep expertise in all of these is not required at entry.
What you’ll work on
- Operate, maintain, and troubleshoot UNIX/Linux systems running in cloud environments
- Support and maintain existing configuration management and Infrastructure as Code setups
- Assist with the operation of cloud-based infrastructure, including virtual machines, networking components, and managed services
- Help monitor system health and performance, investigate alerts, and participate in incident response and root cause analysis
- Perform routine infrastructure updates and maintenance to ensure systems remain secure, reliable, and up to date
- Support Kubernetes clusters and containerized workloads, primarily from an operational and troubleshooting perspective
- Collaborate with senior engineers to improve automation, monitoring, and operational practices
- Document procedures, operational runbooks, and troubleshooting steps to improve team efficiency
What you need to be successful
- Experience working with UNIX/Linux systems, including hands-on troubleshooting and shell scripting
- Understanding of networking fundamentals (TCP/IP, DNS, routing, firewalls, load balancing) in cloud or data-center environments
- Basic experience operating infrastructure in a cloud environment (preferably AWS), including compute, networking, and monitoring services
- Basic scripting or development experience (e.g., Python)
- Clear communication skills and a collaborative, respectful approach to working with teammates
- Willingness to learn, ask questions, and grow technical depth over time
Nice to have
- Exposure to Infrastructure as Code tools such as Terraform
- Familiarity with at least one configuration management or automation tool (e.g., Ansible, Chef, Puppet, SaltStack)
- Familiarity with Kubernetes or container-based environments
- Experience with monitoring tools such as Grafana or similar platforms
- Ability to investigate incidents, follow runbooks, and escalate appropriately when needed
- Interest in automation and reliability, even if you have not yet designed large-scale systems yourself #LI-Hybrid #LI-FR1
Create a Job Alert
Interested in building your career at Dataiku? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field
.jpg?1756841146)