Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
Текст:
TL;DR
AI Inference Engineer (AI): Developing and optimizing large-scale machine learning model deployment for real-time inference with an accent on API development, performance benchmarking, and system reliability. Focus on exploring novel research, implementing LLM inference optimizations, and addressing system bottlenecks.
Location: San Francisco
Salary: $200K – $350K
Company
Perplexity is seeking an AI Inference engineer to join their growing team.
What you will do
- Develop APIs for AI inference for both internal and external customers.
- Benchmark and address bottlenecks throughout the inference stack.
- Improve the reliability and observability of systems and respond to system outages.
- Explore novel research and implement LLM inference optimizations.
Requirements
- Experience with ML systems and deep learning frameworks (e.g., PyTorch, TensorFlow, ONNX).
- Familiarity with common LLM architectures and inference optimization techniques (e.g., continuous batching, quantization).
- Understanding of GPU architectures or experience with GPU kernel programming using CUDA.
Culture & Benefits
- Full-time U.S. employees enjoy a comprehensive benefits program including equity, health, dental, vision, retirement, fitness, commuter, and dependent care accounts.
- Full-time employees outside the U.S. enjoy a comprehensive benefits program tailored to their region of residence.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →
Похожие вакансии
FAIR
3 часа назад
Research Engineer, SysML (AI)
141 000 - 208 000$
Thinking Machines Lab
3 дня назад
Software Engineer, Research Acceleration (AI)
350 000 - 475 000$
7 дней назад
Staff AI Inference and Acceleration Engineer (Robotics)
180 000 - 275 000$
3 дня назад
AI Engineer (Robotics)
200 000 - 290 000$
NDA
3 часа назад
AI Outcome Customer Engineer (AI)
183 000 - 266 000$
Hebbia
2 дня назад
Applied Research Engineer (AI)
160 000 - 300 000$