Назад
Company hidden
10 часов назад

Senior MLOps Engineer (AI)

Формат работы
hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Germany
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior MLOps Engineer (AI): Building and maintaining scalable machine learning infrastructure and automated pipelines for model training, deployment, and monitoring with an accent on reliability, reproducibility, and performance. Focus on bridging the gap between data science and production engineering by developing self-service tools, optimizing Kubernetes-based workflows, and ensuring high availability for ML services.

Location: Must be based in Berlin or able to commute to the Berlin office 3 times a week.

Company

hirify.global is a leading global travel search engine and part of Booking Holdings, operating a portfolio of brands including momondo, Cheapflights, and HotelsCombined.

What you will do

  • Build and operate end-to-end ML infrastructure, including CI/CD pipelines and model orchestration.
  • Define standards for model serving to ensure low latency and high availability.
  • Develop core MLOps capabilities like feature stores, model registries, and automated performance monitoring.
  • Enable Kubernetes autoscaling and GPU provisioning as self-service tools for ML practitioners.
  • Design resilient monitoring and observability systems to reduce manual interventions.
  • Create standardized workflows to streamline the model development lifecycle for Data Scientists.

Requirements

  • Experience building and operating ML platforms in production environments.
  • Solid working knowledge of containerization (Docker), orchestration (Kubernetes), and Linux internals.
  • Familiarity with ML lifecycle tooling, including orchestration frameworks, feature stores, and model registries.
  • Experience owning production systems, defining SLOs, and building observability using tools like Prometheus, Grafana, or Datadog.
  • Proficiency in writing production-quality code in Python.
  • Must be able to commute to the Berlin office 3 times a week.

Nice to have

  • Experience with incident response and diagnosing large-scale system failures.
  • Background in modernizing production infrastructure with a focus on reliability and cost-efficiency.

Culture & Benefits

  • 6 weeks paid vacation plus a birthday day off.
  • Company-wide week off per year to fully recharge.
  • Mental health support including company-paid therapy and HeadSpace subscription.
  • Development Dollars, leadership training, and access to thousands of e-learnings.
  • Pension plan contributions, public transportation subsidies, and bike leasing.
  • Free lunch 2 days per week and regular social events.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →