
Back to jobs
Site Reliability Engineer (Crypto Trading)
Hong Kong
We are hiring for one of our ecosystem projects in the digital asset space. Currently seeking a skilled Site Reliability Engineer (SRE) to join their Digital Asset Team. This person will be responsible for ensuring the reliability, scalability, and security of our trading platform, which handles high-frequency, low-latency transactions in a 24/7 environment. You will collaborate with software engineers, traders, and security teams to maintain and enhance our infrastructure, optimize system performance, and implement robust incident response strategies.
Key Responsibilities
- Design, implement, and maintain infrastructure to ensure 99.99% uptime for our trading platform and associated applications, minimizing latency and ensuring high availability.
- Support and optimize DevOps processes specific to trading systems, including trade execution pipelines, market data feeds, and order management systems, ensuring low-latency and high-throughput performance.
- Manage and enhance DevOps workflows for trading-related applications, including user interfaces, APIs, and backend services, to ensure seamless integration and deployment.
- Develop and maintain monitoring, alerting, and logging systems for both trading and application infrastructure to proactively identify and resolve performance bottlenecks and system anomalies.
- Lead incident response, root cause analysis (RCA), and post-mortem processes for trading and application systems, driving improvements to prevent recurrence and enhance resilience.
- Build and maintain CI/CD pipelines, Infrastructure as Code (IaC), and automated workflows for both trading and application environments to streamline deployments and reduce manual intervention.
- Optimize infrastructure to handle high transaction volumes, market spikes, and application traffic, ensuring seamless performance under load.
- Collaborate with security teams to implement best practices for securing digital asset transactions and application services, including key management, encryption, and compliance with regulatory standards.
- Continuously optimize system performance, focusing on reducing latency in trade execution, data processing, and application response times.
- Work closely with development teams to ensure reliable code deployments for trading systems and applications, providing guidance on writing performant, production-ready code.
- Forecast infrastructure needs based on trading volume trends, application usage, and market demands, ensuring scalability and cost efficiency.
- Maintain detailed documentation of trading and application systems, processes, and incident reports to support team knowledge sharing and compliance.
Qualifications
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
- 5+ years of experience as an SRE, DevOps Engineer, or similar role in a high-availability, low-latency environment.
- Hands-on experience with Trading/Application support, including supporting trading platforms, market data systems, or financial applications; managing CI/CD pipelines, and deploying scalable web or backend applications.
- Track record of managing production systems with 24/7 uptime requirements.
- Proficiency in cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes).
- Strong scripting skills in Python, Bash, or Go for automation and tooling in both trading and application contexts.
- Expertise in Infrastructure as Code tools (e.g., Terraform, Ansible, CloudFormation).
- Experience with monitoring tools (e.g., Prometheus, Grafana, Datadog) and logging frameworks (e.g., ELK Stack, Splunk) for trading and application systems.
- Familiarity with databases (e.g., PostgreSQL, Redis, MongoDB) and message queues (e.g., Kafka, RabbitMQ) used in trading and application environments.
- Knowledge of network protocols, security practices, and distributed systems.
Preferred Qualifications:
- Experience with blockchain technology, cryptocurrency exchanges, or digital asset custody.
- Familiarity with regulatory frameworks (e.g., KYC/AML, GDPR, SOC 2).
- Certifications such as AWS Certified DevOps Engineer, Google Professional Cloud DevOps Engineer, or equivalent.
Create a Job Alert
Interested in building your career at Hyphen Connect Limited? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field