Reliability Engineer IV
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Reliability Engineer IV (Cloud): Ensuring availability, performance, monitoring, and incident response for cloud platforms and services with an accent on production readiness requirements, reliability modeling, and incident-driven improvements. Focus on solving complex reliability issues using MTTR/MTTF metrics, root cause analysis, and predictive maintenance to prevent downtime and meet contractual deliverables.
Company
is a defense technology platform delivering advanced manufacturing solutions and technology-enabled services for U.S. national security customers.
What you will do
- Ensure cloud platforms and services meet production requirements (diagrams, dependencies, monitoring/logging plans, backups, and high-availability setups).
- Manage reliability issues including uncaught exceptions, hardware degradation, networking problems, high resource usage, and slow responses.
- Evaluate and analyze products/components/materials/equipment to predict failures and improve reliability.
- Create prototypes, run product tests, interpret results using statistical distributions and reliability models, and recommend design/manufacturing changes.
- Monitor production equipment diagnostics and maintenance records to predict and prevent downtime; perform root cause analysis and corrective actions.
- Collaborate with engineering and development teams; support reliability program evaluations for subcontractors and provide technical support to operational strategies.
Requirements
- Clearance: Active Secret Clearance
- Bachelor’s degree with 8 years of experience, or Master’s degree with 6 years of experience.
- Proficiency in reliability modeling, failure mode analysis, and predictive maintenance methodologies.
- Ability to develop and implement reliability solutions with significant autonomy; strong analytical and problem-solving skills.
- DoD 8570 / 8140 IAT Level II certification.
- At least one cloud certification.
Culture & Benefits
- Full-time/permanent employee role with remote location.
- Health and wellness programs, income protection, paid leave, and retirement/savings benefits.
- Competitive compensation with learning and development opportunities.
- Flexibility to balance quality work and personal life.
Hiring process
- Staff meeting and storytime participation (camera on) as part of ongoing team routines.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →