Back to jobs
Intern - AI Cluster Engineering (Summer 2026)
San Jose, CA
Job Title: Intern - AI Cluster Engineering (Summer 2026)
Office Location: San Jose, CA
Work Model: Onsite
Duration: June 2026 - August 2026
Work Model: Onsite
Duration: June 2026 - August 2026
At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive the evolution of advancing mobile technology, empowering cloud computing, and pioneering future technologies. Our cutting-edge memory technologies are essential in today's most advanced electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape.
We're looking for innovative minds to join our mission of shaping the future of technology. At SK hynix America, you'll be part of a team that's pioneering breakthrough memory solutions while maintaining a strong commitment to sustainability. We're not just adapting to technological change – we're driving it, with significant investments in artificial intelligence, machine learning, and eco-friendly solutions and operational practices. As we continue to expand our market presence and push the boundaries of what's possible in semiconductor technology, we invite you to be part of our journey to creating the next generation of memory solutions that will define the future of computing.
About the Role:
- We are building the AI DC level software framework. It is a cutting-edge platform to validate next-generation, full-stack AI infrastructure. We are looking for talented interns to develop the AI application workload framework.
- Beyond AI models, AI applications will be containerized, optimized, and stress-tested on versatile GPU clusters to find architectural bottlenecks.
- We are looking for interns who will onboard and optimize diverse AI Applications onto our AI cluster, deep-diving into specific domains like Generative AI (LLM), Physical AI (Robotics), and AI for Science (Bio/Physics) to benchmark their performance on the latest GPUs, DPUs, and Network fabrics.
Responsibilities:
- Onboard Diverse AI Applications:
- Port and deploy state-of-the-art AI workloads, including LLM (vLLM, TGI), Physical AI (Isaac Sim, ROS 2), and Scientific AI (AlphaFold, GROMACS).
- Resolve software dependency issues and build optimized Docker images for the A^3 Registry.
- Application Profiling & Analysis:
- Analyze the unique resource patterns of each application (e.g., “Isaac Sim requires heavy ray-tracing capability” or “Megatron-LM is bottlenecked by RDMA”).
- Identify performance bottlenecks using profiling tools (Nsight Systems, PyTorch Profiler).
- Develop "Test Recipes":
- Define the standard configuration (Recipe) for each application to ensure reproducible testing.
- Collaboration: Work with infrastructure engineers to tune the system (OS, Network, Storage) to fit your application's needs.
Qualifications:
- Education: Currently pursuing a MS, or PhD in Computer Science, Electrical Engineering, AI, or related fields.
- Programming: Strong proficiency in Python (Bash scripting is a plus).
- Containerization: Experience with Docker (building Dockerfiles, managing dependencies).
- AI Fundamentals: Basic understanding of Deep Learning workflows (Training vs. Inference) and frameworks (PyTorch, TensorFlow).
- OS: Comfort with Linux command-line environment
Preferred Qualifications:
- Orchestration: Experience with Kubernetes (K8s) or Slurm.
- Profiling: Experience with performance profiling tools (Nsight Systems, DCGM, PyTorch Profiler).
- Domain Expertise (in one of the following):
- LLM: Experience with vLLM, TGI, or TensorRT-LLM.
- Distributed Systems: Experience with Multi-node training (Megatron-LM, DeepSpeed).
- Robotics: Experience with ROS 2, Isaac Sim, or Reinforcement Learning.
- HPC/Science: Experience with MPI, OpenMP, or Bio-informatics tools.
- Hardware: Curiosity about computer architecture (GPU, DPU, Memory, Network Fabric).
Housing Allowance:
Eligible interns will receive a housing allowance during their internship.
Equal Employment Opportunity:
SKHYA is an Equal Employment Opportunity Employer. We provide equal employment opportunities to all qualified applicants and employees, and prohibit discrimination and harassment of any type without regard to race, sex, pregnancy, sexual orientation, religion, age, gender identity, national origin, color, protected veteran or disability status, genetic information or any other status protected under federal, state, or local applicable laws.
SKHYA is an Equal Employment Opportunity Employer. We provide equal employment opportunities to all qualified applicants and employees, and prohibit discrimination and harassment of any type without regard to race, sex, pregnancy, sexual orientation, religion, age, gender identity, national origin, color, protected veteran or disability status, genetic information or any other status protected under federal, state, or local applicable laws.
Compensation:
Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. Pay within the provided range varies by work location and may also depend on job-related skills and experience. Your Recruiter can share more about the specific salary range for the job location during the hiring process.
Pay Range
$26 - $50 USD
Create a Job Alert
Interested in building your career at SK hynix America? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field