Snowflake Logo

Snowflake

AI System Research and Development Engineer - Optimization

Job Posted 4 Days Ago Posted 4 Days Ago
Be an Early Applicant
2 Locations
Senior level
2 Locations
Senior level
The role involves optimizing GPU kernel performance and enhancing deep learning system efficiency, while collaborating with a specialized team and staying updated on advancements.
The summary above was generated by AI

Build the future of the AI Data Cloud. Join the Snowflake team.

We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems.

Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models. This is an exciting opportunity to collaborate with a world-class team, including founding members of DeepSpeed, vLLM, and TensorFlow. Together, we will push the boundaries of deep learning systems and drive cutting-edge innovations in AI.

Responsibilities:

  • Analyze and optimize GPU kernel performance for training and inference of LLMs.

  • Develop and implement strategies to enhance the efficiency and scalability of deep learning systems.

  • Profile and benchmark deep learning systems using tools and techniques to identify bottlenecks.

  • Design and implement optimizations to reduce latency and improve resource utilization for training and inference.

  • Stay updated with the latest advancements in GPU kernel optimization, deep learning, and LLM system development.

  • Contribute to the development of agentic frameworks and applications for LLM-driven workflows, enhancing automation, reasoning, and decision-making capabilities.

  • Open-source and publish innovations, optimizations, and engineering practices in technical blogs, top-tier conferences and journals.

Requirements:

  • Bachelor’s degree in Computer Science, Electrical Engineering, or a related field. A Master’s degree or PhD is preferred.

  • 5 years of experience in GPU kernel optimization, deep learning system optimization, or high-performance computing (HPC).

  • Proficiency in deep learning frameworks such as PyTorch, TensorFlow, JAX.

  • Strong understanding of GPU architectures and experience with CUDA or similar frameworks.

  • Experience with frameworks like CUTLASS, Triton, cuDNN, etc.

  • Experience with profiling tools (e.g., nvprof, Nsight) and performance analysis methodologies.

  • Solid problem-solving skills and ability to debug complex performance issues.

  • Excellent communication skills and ability to work effectively in a cross-functional team environment.

Join us in optimizing deep learning systems and pushing the boundaries of AI efficiency. Apply now to be part of our dynamic and pioneering team!

Every Snowflake employee is expected to follow the company’s confidentiality and security standards for handling sensitive data. Snowflake employees must abide by the company’s data security plan as an essential part of their duties. It is every employee's duty to keep customer information secure and confidential.

Snowflake is growing fast, and we’re scaling our team to help enable and accelerate our growth. We are looking for people who share our values, challenge ordinary thinking, and push the pace of innovation while building a future for themselves and Snowflake.

How do you want to make your impact?

For jobs located in the United States, please visit the job posting on the Snowflake Careers Site for salary and benefits information: careers.snowflake.com

Top Skills

Cuda
Cudnn
Cutlass
Deep Learning System Optimization
Gpu Kernel Optimization
Jax
Nsight
Nvprof
PyTorch
TensorFlow
Triton

Snowflake Bellevue, Washington, USA Office

In the heart of Silicon Valley, you'll find our 4-story, 2-tower San Mateo hub, which actually emerged from the very spot Snowflake started in 2012 – it all began in one of our founder's humble San Mateo apartments.

Similar Jobs

An Hour Ago
Seattle, WA, USA
106K-200K Annually
Junior
106K-200K Annually
Junior
Artificial Intelligence • Information Technology • Natural Language Processing • Software • Business Intelligence • Generative AI
The Machine Learning Engineer I will develop and optimize machine learning models, collaborate with teams to deliver insights, and manage data pipelines at Qualtrics.
Top Skills: Artificial IntelligenceC#JavaMachine LearningPythonPyTorchTensorFlow
2 Hours Ago
Seattle, WA, USA
136K-257K Annually
Mid level
136K-257K Annually
Mid level
Artificial Intelligence • Information Technology • Natural Language Processing • Software • Business Intelligence • Generative AI
The role involves designing, implementing, and optimizing machine learning models, collaborating with teams to develop scalable solutions, and enhancing customer experiences.
Top Skills: Artificial IntelligenceC#JavaMachine LearningPythonPyTorchTensorFlow
2 Hours Ago
Hybrid
4 Locations
195K-343K Annually
Senior level
195K-343K Annually
Senior level
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Lead the Retrieval Infrastructure team to build scalable retrieval systems for Snap's recommendation systems, guiding machine learning engineers and software engineers.
Top Skills: Embedding-Based RetrievalHnswMachine LearningRetrieval SystemsScannVector Search Algorithms

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account