Resumen del rol

Architect- Platform Engineer

Requisitos y responsabilidades

Contenido del rol extraído en secciones para revisar más rápido.

About Quantiphi:

  • 3 NVIDIA Partner of the Year awards
  • 3 AWS AI/ML Partner of the Year awards
  • 21x Google Cloud Partner of the Year awards in the past 10 years
  • 3 Snowflake Partner of the Year awards
  • Rated Leaders by Gartner, Forrester, IDC, ISG, Everest Group and other leading analyst firms

Key Responsibilities:

  • Design and implement scalable infrastructure for LLM and GenAI workloads across multi-GPU environments
  • Perform GPU profiling, benchmarking, and performance optimization for distributed training workloads
  • Manage and schedule compute-intensive jobs using Slurm-based clusters and OpenShift/Kubernetes environments
  • Enable and optimize the NVIDIA GPU stack (CUDA, cuDNN, NCCL, Triton, RAPIDS, etc.)
  • Collaborate with cross-functional teams to deploy models in research and production environments
  • Build and support GenAI pipelines (fine-tuning, RAG, multi-modal inferencing, LLMOps)
  • Develop reusable infrastructure templates using tools like Terraform and Helm
  • Contribute to internal innovation (PoCs, workshops) and support client-facing delivery engagements
  • Develop and deliver automation software required for building & improving the functionality, reliability, availability, and manageability of applications and cloud platforms
  • Champion and drive the adoption of Infrastructure as Code (IaC) practices and mindset
  • Design, architect, and build self-service, self-healing, synthetic monitoring and alerting platform and tools
  • Automate the development and test automation processes through CI/CD pipeline (Git, Jenkins, SonarQube, Artifactory, Docker containers)
  • Build container hosting-platform using Kubernetes
  • Introduce new cloud technologies, tools; processes to keep innovating in the commerce area to drive greater business value.
  • Lead the technical discussion regarding architecture designing and troubleshooting with the clients and provide solutions proactively as required

Basic Qualifications:

  • Strong experience with Slurm and distributed training environments
  • Hands-on expertise with Red Hat OpenShift and/or Kubernetes
  • Deep knowledge of the NVIDIA GPU ecosystem (CUDA, cuDNN, NCCL, Nsight, Triton/TensorRT)
  • Strong foundation in Linux systems, performance tuning, and multi-GPU optimization
  • Experience deploying GenAI workloads (LLM fine-tuning, RAG pipelines, multi-modal systems)
  • Familiarity with Infrastructure-as-Code tools (Terraform, Ansible)
  • Experience with cloud GPU environments (GCP, Azure, AWS, OCI) and/or on-prem GPU clusters
  • Serve as a mentor or guide for senior resources / team leads.
  • Lead the technical discussion regarding architecture design

Other Qualifications (OQs):

  • Experience with NVIDIA NIMs, DGX systems, or GPU-accelerated containers
  • Knowledge of LLMOps frameworks and MLOps integration
  • Familiarity with vector databases and retrieval systems for RAG architectures
  • Comfortable working in client-facing environments and collaborating with AI solution teams

Other Qualifications (OQs):

  • Experience working with FHIR R4, HL7 v2, or SMART on FHIR
  • Integration with EHR systems (e.g., Epic)
  • Understanding of HIPAA compliance and healthcare data privacy
  • Exposure to clinical workflows, CDS Hooks, or patient-facing applications
  • Experience building clinical decision support systems or healthcare interoperability solutions

What’s in it for YOU at Quantiphi:

  • Make an impact at one of the world’s fastest-growing AI-first digital engineering companies.
  • Up-skill and discover your potential as you solve complex challenges in cutting-edge areas of technology alongside passionate, talented colleagues.
  • Work where innovation happens - work with disruptive innovators in a research-focused organization with 60+ patents filed across various disciplines.
  • Stay ahead of the curve, immerse yourself in breakthrough AI, ML, data, and cloud technologies and gain exposure working with Fortune 500 companies.
Roles similares

Mantén una lista de respaldo.

Ver stack
FocoPlatform EngineeringÁrea del rol
Señal de senioritySeniorNivel del candidato
StackAWS, Azure, CI/CDSkills principales
Ubicación1 país aceptadoElegibilidad

Stack

Usa estas tags para comparar roles remotos similares.

Elegibilidad de ubicación

Candidatos deberían aplicar solo cuando el país del perfil aparece aquí.

Tu perfilPaís no definidoInicia sesión para comparar tu país con este rol.

Flujo de contratación

WithMira muestra el rol y luego envía candidatos a la aplicación de la empresa.

1Revisa fit del rol, stack y elegibilidad de ubicación en WithMira.
2Abre la página de aplicación de la empresa desde el link rastreado.
3Guarda el rol o suscríbete a oportunidades similares antes de salir.
Aplicar en el sitio de la empresaSitio de la empresaAbrir link