
Back to jobs
Site Reliability Engineer
Singapore
Visier is the leader in people analytics and we believe in a 'people-first' approach to business strategy. Our innovative technology transforms the way that organisations make decisions, allowing them to elevate their employees and drive better business outcomes. Embarking on an exciting new chapter in our growth story, we are looking for talented individuals who can help both Visier and our customers grow, evolve and win!
Visier is seeking a skilled Site Reliability Engineer to join our dynamic team. As an SRE at Visier, you'll directly contribute to the reliability and scalability of our cloud-based analytics platform. You'll work alongside experienced engineers, tackling complex infrastructure challenges and mastering essential SRE practices. You'll gain hands-on experience with technologies like Kubernetes, Kafka, Cassandra, and a comprehensive suite of AWS services, building a strong foundation to develop your career with Visier in the future!
What you'll be doing...
- Architect and Automate: Design, develop, and deploy infrastructure as code using Terraform and Packer, ensuring high availability and scalability in our AWS environment.
- Pipeline Mastery: Build and optimize robust CI/CD pipelines with Jenkins and Groovy, streamlining deployment processes and enhancing release velocity.
- Security by Design: Implement and enhance security best practices within our infrastructure, safeguarding sensitive data and ensuring compliance.
- Infrastructure Optimization: Troubleshoot, monitor, and improve the performance and reliability of critical infrastructure services, including Kong API gateway, Cassandra, PostgreSQL, Consul, Vault, and Kafka.
- Develop SRE Tools: Write and maintain automation scripts and tools using Python to streamline operational tasks and improve efficiency.
- Contribute to System Design: Participate in system design discussions and contribute to the evolution of our infrastructure architecture.
- Incident Response: Participate in on-call rotations and contribute to incident response efforts, learning to diagnose and resolve production issues.
What you'll bring to the table...
- A strong foundation in software development principles and a passion for infrastructure as code.
- Proficiency in at least one programming language (Python, Java, Scala, Go, etc.).
- A solid understanding of Linux systems administration and networking fundamentals.
- A proactive approach to problem-solving, with a strong sense of ownership and accountability.
- A desire to learn and grow in a fast-paced, collaborative environment.
- A strong understanding of CI/CD concepts.
- A desire to learn and improve security practices.
Bonus Points:
- Experience with AWS services (EC2, S3, RDS, etc.) and related tools.
- Familiarity with containerization and orchestration technologies (Docker, Kubernetes).
- Knowledge of configuration management tools (Ansible, Chef, Puppet).
- Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack).
- Experience with Infrastructure as code tools.
Most importantly, you share our values...
- You roll up your sleeves
- You make it easy
- You are proud
- You never stop learning
- You play to win
Apply for this job
*
indicates a required field