Block

Senior Machine Learning Engineer, Model Risk Management

Remote 11003 Risk - Prod Dev - Square role with clear candidate location fit.

PostedRecently added

Eligible countries27 accepted countries

Seniority signalSenior

Work settingRemote

Accepted candidate locations

ArgentinaBahamasBarbadosBelizeBermudaBolivia+21 more

Can I actually apply?Check country list

Accepted candidate countries are listed (27).

Source freshnessRecently added

Location fit27 accepted countries

Stack matchAWS, GCP

Apply pathCompany site

Mira fit summaryWhy this role is worth checking

Location fit27 accepted countriesAdd your country

Stack matchAdd profile skills to compareAWS, GCP

Seniority signalSeniorSet your level for a sharper check.

Apply readinessCompany siteApplication continues on the company site.

Application

Apply on company site

External apply

Applying forSenior Machine Learning Engineer, Model Risk ManagementBlock

Country fit27 accepted countries

Apply pathCompany site

WithMiraSave or subscribe before leaving

Company application

WithMira keeps this role for discovery. The application continues on the company site.

Role overview

Readable role content extracted into sections for faster review.

Independently challenge model owners across lending, fraud, and AML: reproduce their results, set and defend the acceptance thresholds, and own the call on whether a model is sound.
Hunt the silent errors that make metrics lie, and prove them out before they reach production.
Choose evaluation that holds up under real conditions: rare events, shifting populations, and drift that only shows up after launch.
Work hands-on in codebases you did not write, learning the data, configs, and conventions, and ship production code in the tooling you build to validate them.
Build the agentic validation tooling the team depends on, orchestrating agents that run in parallel.
Reason about ML systems end to end — how features, training, serving, monitoring, and scale fit together — to evaluate and challenge an owner's design.
Tie explainability and fair-lending findings on consumer credit models back to the model and product decisions that follow.
Help define how Block validates the systems at the frontier of production AI, setting standards where none exist yet.

A quantitative degree or equivalent experience, and senior-IC depth building or validating models in a high-stakes domain such as credit, fraud, or financial crime.
Command of effective-challenge methodology: reproduction, conceptual-soundness review, benchmarking, stress testing, and outcomes analysis, with an eye for how a model holds up after launch and where its assumptions break.
Deep applied ML and statistics across model families, from regression and tree ensembles to deep learning, with sound judgment about evaluation, calibration, and generalization.
Experimentation and statistical rigor: holdout and experiment design, reasoning about uncertainty, and evaluating a model beyond aggregate accuracy.
Solid software and data engineering: production-quality Python, SQL on large datasets, and reproducible, tested code.
Fluency with modern AI: building with LLMs and agentic tools, and the judgment to know when their output can be trusted.
Familiarity with model risk management frameworks and fair-lending standards, with the specifics learnable on the job.
The communication to explain and defend your conclusions to model owners and senior stakeholders, and the independence to operate under ambiguity.

Python (NumPy, Pandas, scikit-learn, LightGBM, XGBoost, PyTorch)
AI dev tools: Claude Code, Cursor, Copilot; agent skills and frameworks for building LLM-powered tooling
MLflow / Databricks; Prefect on GCP Vertex AI
Snowflake and cloud object storage
GitHub and CI (ruff, pytest)
Jira and Linear
GCP and AWS

Similar roles