Resumen del rol

Senior QA Engineer

Requisitos y responsabilidades

Contenido del rol extraído en secciones para revisar más rápido.

Requirements

  • Design, build, and maintain automated test suites using Playwright for web and API surfaces, including AI-generated content flows.

Requirements

  • Lead QA strategy for voice automation pipelines built on ElevenLabs — developing test cases for synthesis quality, latency, and failure modes.

Requirements

  • Validate Claude (Anthropic) integrations: prompt-response accuracy, edge case handling, safety behaviors, and output consistency across builds.

Requirements

  • Build and maintain Node.js-based test tooling, harnesses, and custom reporters for CI/CD pipelines.

Requirements

  • Deploy, monitor, and triage test infrastructure on Google Cloud Platform — leveraging Cloud Run, GCS, and Pub/Sub for scalable test execution.

Requirements

  • Define and track quality metrics: test coverage, flakiness rates, mean-time-to-detect, and regression velocity.

Requirements

  • Collaborate with engineers during design reviews to surface testability gaps and advocate for observable, fault-tolerant system design.

Requirements

  • Mentor junior QA engineers and establish team-wide standards for test authoring, review, and maintenance. Required Qualifications
  • 5+ years of QA engineering experience, with at least 2 years on systems that include LLMs, AI APIs, or speech/audio pipelines.
  • Expert-level Playwright skills — authoring resilient selectors, managing parallel workers, and debugging flaky tests at scale.
  • Proficient Node.js developer — comfortable writing custom test runners, CLI tooling, and service mocks in TypeScript/JavaScript.
  • Hands-on GCP experience: deploying workloads to Cloud Run or GKE, querying logs in Cloud Logging, configuring artifact storage in GCS.
  • Familiarity with ElevenLabs or comparable TTS/voice APIs — understanding synthesis parameters, webhook flows, and audio quality evaluation.
  • Practical experience testing Claude or other LLMs — designing determinism-aware test strategies, evaluating prompt regressions, and building evals.
  • Strong understanding of REST, WebSocket, and gRPC protocols for API-level testing.
  • Experience integrating test suites into CI/CD pipelines (GitHub Actions, Cloud Build, or similar).
  • Nice to Have
  • Experience writing custom LLM evals or using evaluation frameworks such as PromptFoo or Braintrust.
  • Background in audio signal quality assessment or speech intelligibility testing.
  • Familiarity with observability tooling: OpenTelemetry, Datadog, or GCP Cloud Monitoring.
  • Knowledge of accessibility testing standards (WCAG 2.1) and assistive technology compatibility. Core Technology Stack:Google Cloud Platform (GCP)
  • Cloud Run, GCS, Pub/Sub, Cloud Logging, GKE for scalable test infrastructure ElevenLabs — Voice Automation TTS pipeline testing, synthesis quality evaluation, webhook and latency validation
  • Node.js / TypeScript Custom test runners, service mocks, CLI tooling, and CI/CD integration
  • Playwright End-to-end and API-level browser automation with parallel execution
  • Claude (Anthropic) LLM integration QA, prompt regression testing, and output evaluation
Roles similares

Mantén una lista de respaldo.

Ver stack
FocoQA EngineeringÁrea del rol
Señal de senioritySeniorNivel del candidato
StackCI/CD, GCP, JavaScriptSkills principales
Ubicación1 país aceptadoElegibilidad

Stack

Usa estas tags para comparar roles remotos similares.

Elegibilidad de ubicación

Candidatos deberían aplicar solo cuando el país del perfil aparece aquí.

Tu perfilPaís no definidoInicia sesión para comparar tu país con este rol.

Flujo de contratación

WithMira muestra el rol y luego envía candidatos a la aplicación de la empresa.

1Revisa fit del rol, stack y elegibilidad de ubicación en WithMira.
2Abre la página de aplicación de la empresa desde el link rastreado.
3Guarda el rol o suscríbete a oportunidades similares antes de salir.
Aplicar en el sitio de la empresaSitio de la empresaAbrir link