Мэтч & Сопровод
Покажет вашу совместимость и напишет письмо
Описание вакансии
Текст:
TL;DR
Software Engineer (AI): Building and optimizing LLM serving infrastructure with an accent on GPU performance and disaggregated serving architectures. Focus on profiling, benchmarking, and enhancing the serving stack to ensure high-efficiency model deployment.
Location: Must be based in Sunnyvale, CA or Seattle, WA, USA
Salary: $207,000–$301,000
What you will do
- Design and implement disaggregated serving architectures for large language models.
- Enhance the LLM serving stack to improve throughput and latency.
- Profile and benchmark LLM models on GPU accelerators to identify performance bottlenecks.
- Build custom performance analysis tooling to monitor system efficiency.
- Collaborate with research and SRE teams to deploy LLMs into production environments.
Requirements
- Must be based in Sunnyvale, CA or Seattle, WA, USA.
- Strong experience in distributed computing and performance profiling.
- Deep understanding of GPU acceleration and LLM serving stacks.
- Proficiency in benchmarking and debugging complex machine learning systems.
- Ability to work effectively with cross-functional research and engineering teams.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →
Похожие вакансии
4 дня назад
Software Engineer (AI Benchmarking)
125 000 - 200 000$
micro1
1 день назад
Senior Software Engineer (AI)
80 000 - 170 000$
1 день назад
Artificial Intelligence (AI) Engineer (Defense)
142 696 - 158 303$
3 дня назад
Staff Research Engineer (AI)
140 400 - 372 300$
2 дня назад
Software Engineer (AI)
150 000 - 225 000$
1 день назад
Senior AI Engineer (AI)
143 500 - 215 200$