Job Application for Site Reliability Engineer

Who we are

We're a leading, global security authority that's disrupting our own category. Our encryption is trusted by the major ecommerce brands, the world's largest companies, the major cloud providers, entire country financial systems, entire internets of things and even down to the little things like surgically embedded pacemakers. We help companies put trust - an abstract idea - to work. That's digital trust for the real world.

Job Summary

The Site Reliability Engineer (SRE) collaborates with development teams to embed reliability, scalability, and performance best practices throughout the software development lifecycle. This role bridges software engineering and cloud operations, ensuring mission-critical systems remain highly available and resilient. By integrating reliability early, the SRE fosters a culture of shared responsibility while enabling rapid and safe feature delivery.

What you will do

Design and build fault-tolerant, high-performing systems that meet Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
Implement monitoring, alerting, distributed tracing, and logging to ensure real-time system health visibility and proactive issue resolution.
Act as a first responder for production incidents, conduct blameless postmortems, and drive root cause analysis (RCA) and corrective actions.
Develop self-healing, automated deployments, and scaling solutions to minimize toil and improve system efficiency.
Improve continuous integration and deployment pipelines to enable safe, rapid, and reliable feature rollouts.
Review code, debug issues, and perform quality assurance (QA) on software components to enhance system reliability and performance.
Work closely with development teams to ensure best practices in software architecture, coding standards, and operational readiness.
Forecast scalability needs and optimize cloud infrastructure costs while balancing performance and efficiency.
Ensure production environments meet security and compliance requirements, collaborating with teams to mitigate vulnerabilities and enforce best practices.
Work closely with development teams to embed reliability at every stage rather than treating it as an afterthought.
Use error budgets to balance feature velocity with system stability.
Implement observability and automation-first principles to measure system health and drive continuous improvement.
Leverage game days, chaos engineering, and resilience testing to validate system robustness and refine operational processes.

What you will have

3-5 years of extensive experience in distributed systems, cloud-native architectures (AWS, GCP, Azure), and DevOps practices.
Proficiency in Kubernetes, Terraform, CI/CD pipelines, and Infrastructure as Code (IaC).
Strong scripting and automation skills in Python, Go, Bash, or similar languages.
Expertise in observability tools such as Prometheus, Grafana, Datadog, Splunk, New Relic, and Open Telemetry.
Ability to troubleshoot complex production issues and drive scalable, resilient solutions.
Experience reviewing code, debugging applications, and conducting software testing to ensure high reliability and quality.

Benefits

Generous time off policies
Top shelf benefits
Education, wellness and lifestyle support

#LI-SD1

First Name

Last Name

Phone

Location (City)

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf

Are you legally authorized to work in the country in which this role is located?

Select...

Do you now, or will you in the future, require work authorization sponsorship from your employer?

Select...

Please confirm the country in which you currently reside?

Select...

If the country you currently live in is NOT listed in the question above, please confirm in which country you live?

Were you encouraged to apply to this role by an DigiCert employee? If yes, please kindly state the full name

What are your salary expectations for this role?

APJ Self Identification of Demographic Information

Digicert invites you to self-identify your personal demographic information to help continue our mission to foster inclusivity and diversity in our workplace. In keeping with the DigiCert Care culture, our values set the foundation for how we act, how we make decisions and how we win. These values shape our work culture and demonstrate our dedication to ensuring everyone is welcomed and supported. We invite you to self-identify your gender. Completing this survey is voluntary and you may select “Decline to Disclose”, but we hope you choose to participate.

Your responses to this survey will also help DigiCert live up to our commitment to build inclusive teams that reflect the communities we serve. Responding is completely optional and voluntary and does not affect your standing as a candidate. Whatever your decision, it will not be used for the purposes of any employment decision. However, we do hope that you will participate because your responses help us measure the effectiveness of our outreach and recruitment. Any information you do provide is anonymized and stored separately from your application in a confidential file, and the information cannot be viewed by your interview team or hiring manager at any time.

We hope you will join us in our commitment and enthusiasm for making DigiCert a place where everyone belongs!

Voluntary Self-Identification of Gender *

Select...

Site Reliability Engineer - Embedded

Apply for this job

APJ Self Identification of Demographic Information