Site Reliability Engineer - CorpTech
Company Overview
Arcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world’s most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow’s challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients achieve transformational business outcomes.
Financial technology is a high-growth industry as change and innovation continue to disrupt the status-quo and prompt major transformation. Arcesium is at a particularly interesting time in our own growth as we look to leverage our successfully established market position and expand operations in pursuit of strategic new business opportunities. We value intellectual curiosity, proactive ownership, and collaboration with colleagues, and we empower you to meaningfully contribute from day one and accelerate your professional development.
Position Summary
We are looking for an SRE to join our Corporate Technology team. The ideal candidate will be involved in planning, designing, and implementing various applications and infrastructure used by our staff. Strong focus will be on developing and managing applications built with cloud native and serverless technologies leveraging Azure & AWS Services.
The ideal candidate is an excellent Site Reliability Engineer with experience in cloud-based tech and a firm understanding of how to solve business needs using emerging technologies with emphasis on building applications that are cost friendly and support zero-touch operations. You’ll also need to analyze various reports and statistical data to measure productivity levels and identify root causes for underperforming areas, develop customized reporting to measure and track operational statistics, data and results, oversee weekend activities across various office spaces such as user migrations to newer platforms, software & hardware upgrades and audits etc.s
Responsibilities
- Build integrations with third party SaaS applications that will include custom user provisioning, SSO, automation for migrating data and custom integrations with other applications
- Use MS Azure for managing operations in Windows Compute and Solutioning domain.
- Write good code, catch bugs, and style issues in code reviews, ship small features independently
- Participate in all aspects of the software development life cycle for AWS/Azure solutions, including planning, requirements, development, testing, and quality assurance
- Ensure the applications have optimal observability, monitoring and alerts that help identity the problems before they affect business productivity.
- You may also be involved in supporting our existing Corporate Tech applications and infrastructure like – Azure, AD, M365, Slack, Outlook/Exchange, AWS Workspaces/desktop infrastructure and other enterprise SaaS products.
- Handle operation issues for both Portugal and London office and act as Escalation Engineer for both the sites.
Qualifications & Must Haves
- 2+ years of solid Site Reliability Engineering skills, with a proven track record in developing quality software solutions and passion for technology.
- Hands on experience in diagnosing and troubleshooting operational issues, including root cause analysis (RCA) documentation.
- Strong programming skills, with proficiency in Python (preferred) or Java.
- Good understanding of the Linux operating system and TCP/IP suite of networking protocols, DNS, DHCP, VLANs, routing and switching.
- Experience managing and scaling distributed systems, including configuration management, in public, private, or hybrid cloud environments.
- Excellent verbal and written communication in English and Portuguese. Flexibility to collaborate across global time zones.
- Strong sense of ownership and integrity, demonstrated through clear communication and effective teamwork.
- Exceptional problem-solving abilities, adaptability, and a proactive approach to learning and development.
- Have a valid work permit to work in the country and travel across Europe.
- Willingness to travel as required to provide on-site support.
Good To Have
- Experience in any other object-oriented languages is a plus.
- Experience in cloud native and/or serverless architecture, Slack apps, and Azure/AWS certifications are a plus.
- Experience with CI/CD pipelines, container orchestration tools (e.g., Kubernetes), and monitoring and logging tools (e.g., Prometheus, Grafana) is highly desirable.
Arcesium and its affiliates do not discriminate in employment matters on the basis of race, color, religion, gender, gender identity, pregnancy, national origin, age, military service eligibility, veteran status, sexual orientation, marital status, disability, or any other category protected by law. Note that for us, this is more than just a legal boilerplate. We are genuinely committed to these principles, which form an important part of our corporate culture, and are eager to hear from extraordinarily well qualified individuals having a wide range of backgrounds and personal characteristics.
Apply for this job
*
indicates a required field