
SRE Senior
Welcome to 10Pearls!
We believe in harnessing the power of technology for social good through our core values: Innovate, modernize and accelerate.
About us
We are 10Pearls, an award-winning digital development company, helping businesses with product design, development, and technology acceleration. We have a culture of innovation, uniquely designed to help companies transform, digitalize and scale by levering digital technology. We offer an awesome work environment, challenging projects with US customers, great benefits, and much more.
We are seeking a well-rounded SRE/DevOps Engineer who has practical experience in infrastructure operations and DevOps practices. This role is ideal for an experienced jack of all trades who enjoys solving complex technical challenges and implementing reliable, scalable solutions. You will work closely with the Head of SRE to enhance our infrastructure, automate processes and improve observability and disaster recovery strategies.
Responsibilities:
- Enhance disaster recovery and multi-region capabilities to improve system resilience.
- Improve monitoring, alerting and observability through tools like Grafana, Prometheus, Sentry.
- Support on-call processes by enhancing alerting strategies and automating responses where possible.
- Collaborate with development and operations teams to address reliability challenges and enhance performance.
- Automate routine tasks using scripting languages and Infra as Code tools.
- Contribute to postmortems and continuous improvement initiatives to enhance system stability.
- Maintain clear documentation of infra, processes and disaster recovery.
Requirements:
- +3 years in Site Reliability Engineering, DevOps, or related role.
- Experience with AWS (EC2, Lambda, S3, Route 53), Kubernetes, Docker.
- Hands on experience with terraform, or other infra as code tools.
- Familiarity with monitoring and alerting systems (OpenSearch, Grafana, Prometheus, Sentry).
- Experience with pipeline tools such as Gitlab CI or similar.
- Proficiency in Python, Bash, or similar scripting languages.
- Basic knowledge of networking, DNS, and load balancing.
- Strong troubleshooting skills with the ability to respond to incidents effectively.
- Ability to work within a team and support cross-functional initiatives.
- Clear and concise technical writing skills.
Nice to have:
- Experience with multi-region AWS setups and disaster recovery planning.
- Knowledge of Grafana monitoring and dashboards
- Understanding of compliance frameworks (HIPAA, HITRUST).
What we offer:
- Work from home
- Flexible Schedules
- Paid PTO
- Amazing People oriented organizational culture
- Challenging projects using the latest technologies with clients from the US and Canada
- Technology and Soft Skills Internal Training
- Online Courses from AdworldPrime
We thank you for applying to this job position, we’re more than thrilled to start reviewing your profile and great skills! This is the first step in our selection process, so you will be hearing back from our awesome recruitment team regarding the next steps 😀
10Pearls Team
Apply for this job
*
indicates a required field