
Back to jobs
Senior Staff Engineer, Microservice Governance
Singapore, Singapore
OKX will be prioritising applicants who have a current right to work in Singapore, and do not require OKX's sponsorship of a visa
Who We Are
At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.
About the Team
The Middleware team covers multiple domains, including: Microservice Governance, RPC frameworks, Message Queues, and Data Middleware. They provide robust infrastructure support for the entire company's business and product lines.
What You’ll Be Doing
-
Lead the top-level design and long-term planning of the company's unified Microservice governance system, covering various communication methods like RPC, messaging, and gateways, to build a stable, efficient, and intelligent service governance platform.
-
Spearhead the R&D and implementation of adaptive governance algorithms, including but not limited to adaptive rate limiting based on queuing theory or Little's Law, adaptive circuit breaking based on error rates and response times, and adaptive load balancing based on node load and health status.
-
Conduct in-depth research into the core mechanisms of mainstream RPC frameworks (e.g., Dubbo3, gRPC), including service discovery, load balancing, serialization protocols, and threading models, and lead the deep customization and performance optimization of these frameworks Take charge of the architectural evolution and high-availability construction of core Middleware such as configuration centers (Apollo), service registries (etcd, Nacos, ZK), and distributed task schedulers (XXL-JOB、Argo Workflow), ensuring the ultimate stability of foundational services Design and promote the platform-level implementation of advanced release strategies like canary release, blue-green deployment, and traffic dyeing to improve R&D delivery efficiency and the stability of system changes.
-
Build a resilience-oriented architecture with a "desired state" mindset, introducing chaos engineering principles and tools to proactively identify system vulnerabilities and continuously enhance the system's anti-fragility.
-
As a Middleware architect, provide expert-level guidance to business units on Microservice decomposition, high-availability architecture design, and performance/capacity planning.
What We Look For In You
-
Bachelor's degree or higher in Computer Science or a related field, with 8+ years of R&D experience in Middleware or distributed systems.
-
Proficient in Java or Golang, with a deep understanding of JVM/GC tuning or the Go runtime, and extensive experience in online troubleshooting and performance optimization.
-
Systematic knowledge and profound practical experience in Microservice governance areas such as rate limiting, circuit breaking, degradation, isolation, retries, and load balancing, with in-depth research into the underlying algorithmic principles.
-
In-depth understanding of the source code and design philosophy of at least one mainstream RPC framework (Dubbo3, gRPC) or service governance framework (Spring Cloud, Sentinel) Familiarity with the implementation principles of service registries/configuration centers like Nacos, etcd, and Zookeeper, with a solid foundation in CAP/BASE theory and consensus algorithms like Raft/ZAB.
-
Rich architectural design experience in the Middleware domain, capable of designing and implementing complex distributed systems from scratch (0 to 1).
-
Proven experience and successful cases in areas like adaptive algorithms, full-link stress testing, or chaos engineering are highly preferred.
-
Excellent abstraction skills and architectural thinking, adept at modeling complex problems and designing elegant, scalable systems.
Perks & Benefits
-
Competitive total compensation package
-
L&D programs and Education subsidy for employees' growth and development
-
Various team building programs and company events
-
Wellness and meal allowances
-
Comprehensive healthcare schemes for employees and dependants
-
More that we love to tell you along the process!
Information collected and processed as part of the recruitment process of any job application you choose to submit is subject to OKX's Candidate Privacy Notice.
Create a Job Alert
Interested in building your career at OKX? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field