Back to jobs
Staff Cloud Infrastructure Engineer (Kubernetes)
San Jose, California, United States
Who We Are
At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom.
OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves.
Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er.
OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves.
Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er.
About the Team
Cloud Infrastructure Engineering is a critical engineering discipline and a job function in the company. Its charter is to build tools and infrastructure that promote early detection of production failures, leading to a stellar customer experience. Our work is to drive safety, health, and uptime of our platform, and the ability to remedy unforeseen problems. By removing some of the complex burdens on how to scale and maintain uptime in distributed systems, Cloud Infrastructure Engineer allows development teams to focus on feature development instead of the nuances of achieving and maintaining service level commitments.
About the Opportunity
We’re looking for a creative and driven individual that can spearhead our effort to push “outside the box” infrastructure implementations, focusing on Kubernetes and container technologies, that will have a tremendous impact on our platform’s stability and scalability.
What You’ll Be Doing
-
Maintain and configure AWS and Alibaba Cloud products and services
-
Investigate new Kubernetes features and provide guidance and suggestions for current systems
-
Maintain service access, cost optimization, etc. for each Kubernetes environment
-
Prepare relevant documentation for Kubernetes operation, maintenance, and specifications
-
Architect, deploy, and manage Kubernetes environments to ensure high availability, scalability, and security
-
Monitor and optimize the performance of containerized applications and Kubernetes clusters
-
Develop and maintain infrastructure as code (IaC) using tools like Terraform or Helm
-
Collaborate with development teams to ensure seamless integration and deployment of new features
What We Look For In You
-
Bachelors degree or above, major in Computer Science or relevant domains, with over 6 years of experience in DevOps, SRE or related positions
-
Familiar with the Linux operating system, TCP/IP network protocol, and other basic computer knowledge
-
Relevant developer experience and familiarity with at least one scripting language (Shell/Python/Go)
-
Proficient in Kubernetes (k8s) administration, including deployment, scaling, and management of containerized applications
-
Familiar with the management, scheduling, operation, safety, and other features of Kubernetes products
-
Familiar with Kubernetes extensions such as Operator/CRD/CSI/CNI/CRI, and have relevant operation and maintenance or development experience
-
Strong engineering skills with proficiency in at least one operation and maintenance or infrastructure sub-area, public cloud networking, SRE, DevOps, or cloud-native application
-
Familiar with the content and processes of operation and maintenance work, drive processes, and grasp the overall concept of the system
-
Solid Linux platform operation and maintenance and debugging capabilities, proficient in troubleshooting, configuration tuning, and performance analysis
-
Experience with cloud platforms such as AWS, Google Cloud, or Azure, specifically with Kubernetes services like EKS, GKE, or AKS
-
Experience in monitoring, logging, and alerting solutions for Kubernetes environments
Nice to Have
-
Bilingual in English and Mandarin
-
Familiar with the operation and maintenance management of Alibaba Cloud, Google Cloud, Microsoft Cloud, and other cloud providers
Perks & Benefits
- Competitive total compensation package
- L&D programs and Education subsidy for employees' growth and development
- Various team building programs and company events
OKX Statement
The base salary range for this position is $240,000 to $280,000. The salary offered depends on a variety of factors, including job-related knowledge, skills, experience, and market location. In addition to the salary, a performance bonus and long-term incentives may be provided as part of the compensation package, as well as a full range of medical, financial, and/or other benefits, dependent on the position offered. Applicants should apply via OKX internal or external careers site.
OKX is committed to equal employment opportunities regardless of race, color, genetic information, creed, religion, sex, sexual orientation, gender identity, lawful alien status, national origin, age, marital status, and non-job related physical or mental disability, or protected veteran status. Pursuant to the San Francisco Fair Chance Ordinance, we will consider employment-qualified applicants with arrest and conviction records.
Apply for this job
*
indicates a required field