Steer Health

Senior QA Engineer

Rol remoto de QA Engineering con fit claro de ubicación del candidato.

Publicado11 jun 2026

Países elegibles1 país aceptado

Señal de senioritySenior

Modelo de trabajoRemoto

Ubicaciones aceptadas para candidatos

India

CI/CD GCP JavaScript LLM Node.js REST TypeScript

Puedo aplicar realmente?Revisa la lista de países

Las ubicaciones aceptadas para candidatos están listadas (1).

Actualidad de la fuente11 jun 2026

Fit de ubicación1 país aceptado

Match de stackCI/CD, GCP

Camino de aplicaciónSitio de la empresa

Resumen de fit de MiraPor qué vale revisar este rol

Fit de ubicación1 país aceptadoAgrega tu país

Match de stackAgrega skills al perfil para compararCI/CD, GCP

Señal de senioritySeniorDefine tu nivel para una revisión más precisa.

Preparación para aplicarSitio de la empresaLa aplicación continúa en el sitio de la empresa.

Aplicación

Aplicar en el sitio de la empresa

Aplicación externa

Aplicando aSenior QA EngineerSteer Health

Fit de país1 país aceptado

Camino de aplicaciónSitio de la empresa

WithMiraGuarda o suscríbete antes de salir

Aplicación de la empresa

WithMira mantiene este rol para descubrimiento. La aplicación continúa en el sitio de la empresa.

Aplicar en el sitio de la empresa

Guardar rol

Resumen del rol

Senior QA Engineer

Requisitos y responsabilidades

Contenido del rol extraído en secciones para revisar más rápido.

Requirements

Design, build, and maintain automated test suites using Playwright for web and API surfaces, including AI-generated content flows.

Requirements

Lead QA strategy for voice automation pipelines built on ElevenLabs — developing test cases for synthesis quality, latency, and failure modes.

Requirements

Validate Claude (Anthropic) integrations: prompt-response accuracy, edge case handling, safety behaviors, and output consistency across builds.

Requirements

Build and maintain Node.js-based test tooling, harnesses, and custom reporters for CI/CD pipelines.

Requirements

Deploy, monitor, and triage test infrastructure on Google Cloud Platform — leveraging Cloud Run, GCS, and Pub/Sub for scalable test execution.

Requirements

Define and track quality metrics: test coverage, flakiness rates, mean-time-to-detect, and regression velocity.

Requirements

Collaborate with engineers during design reviews to surface testability gaps and advocate for observable, fault-tolerant system design.

Requirements

Mentor junior QA engineers and establish team-wide standards for test authoring, review, and maintenance. Required Qualifications
5+ years of QA engineering experience, with at least 2 years on systems that include LLMs, AI APIs, or speech/audio pipelines.
Expert-level Playwright skills — authoring resilient selectors, managing parallel workers, and debugging flaky tests at scale.
Proficient Node.js developer — comfortable writing custom test runners, CLI tooling, and service mocks in TypeScript/JavaScript.
Hands-on GCP experience: deploying workloads to Cloud Run or GKE, querying logs in Cloud Logging, configuring artifact storage in GCS.
Familiarity with ElevenLabs or comparable TTS/voice APIs — understanding synthesis parameters, webhook flows, and audio quality evaluation.
Practical experience testing Claude or other LLMs — designing determinism-aware test strategies, evaluating prompt regressions, and building evals.
Strong understanding of REST, WebSocket, and gRPC protocols for API-level testing.
Experience integrating test suites into CI/CD pipelines (GitHub Actions, Cloud Build, or similar).
Nice to Have
Experience writing custom LLM evals or using evaluation frameworks such as PromptFoo or Braintrust.
Background in audio signal quality assessment or speech intelligibility testing.
Familiarity with observability tooling: OpenTelemetry, Datadog, or GCP Cloud Monitoring.
Knowledge of accessibility testing standards (WCAG 2.1) and assistive technology compatibility. Core Technology Stack:Google Cloud Platform (GCP)
Cloud Run, GCS, Pub/Sub, Cloud Logging, GKE for scalable test infrastructure ElevenLabs — Voice Automation TTS pipeline testing, synthesis quality evaluation, webhook and latency validation
Node.js / TypeScript Custom test runners, service mocks, CLI tooling, and CI/CD integration
Playwright End-to-end and API-level browser automation with parallel execution
Claude (Anthropic) LLM integration QA, prompt regression testing, and output evaluation