1 день назад
HPC Solutions Engineer (GPU)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
Текст:
TL;DR
HPC Solutions Engineer (GPU): Building and maintaining large-scale GPU clusters for premium clients with an accent on bespoke resource orchestration and high-performance infrastructure. Focus on automating deployment, optimizing distributed computing environments, and ensuring seamless integration of machine learning workloads.
Company
is a mission-driven startup building a marketplace for compute by directly matching independent data centers and hardware providers to end users.
What you will do
- Collaborate with customers to define technical requirements and optimize their use of distributed GPU resources.
- Manage and coordinate GPU cluster operations, including high-speed networking via InfiniBand.
- Oversee deployment and maintenance of machine learning environments using SLURM and virtual storage.
- Automate infrastructure provisioning and management using Ansible and Terraform.
- Conduct performance tuning to maximize throughput and reduce latency for complex workloads.
- Document operational procedures and maintain system configurations.
Requirements
- 7+ years of experience in high performance compute, distributed machine learning, or system architecture.
- Proficiency in managing NVIDIA GPU environments and associated frameworks.
- Strong experience with high-speed networking technologies, specifically InfiniBand.
- Expertise in HPC job schedulers, preferably SLURM.
- Strong automation skills using Ansible and Terraform.
- Strong coding skills in Python and familiarity with machine learning libraries.
Culture & Benefits
- Fully remote role with a high-accountability, high-agency culture.
- Opportunity to work with diverse hardware configurations and cutting-edge GPU use cases.
- Competitive salary, equity, and benefits package.
- Flexible PTO policy.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →