What Is Predictive Hiring Analytics? Statistical models that forecast candidate outcomes before the hire is made

Question 1

How much historical data do you need to build useful models?

Answer

Useful single-role-family models typically require 100-200 hires with consistent outcome data (performance ratings, retention beyond 12 months) for the basic regression models. More sophisticated approaches (ensemble methods, neural networks) need 1000+ examples to outperform simpler baselines. For low-volume specialized roles, predictive analytics rarely produces signal worth the implementation cost; high-volume operational hiring is where models pay back.

Question 2

Are predictive hiring models legal?

Answer

Generally yes when designed thoughtfully, but increasingly scrutinised. The US EEOC has issued guidance on AI-based hiring decisions; New York City’s Local Law 144 requires bias audits of automated employment decision tools; the EU AI Act categorises hiring AI as ‘high risk’ with documentation and audit requirements. Compliant deployment requires bias testing across protected demographics, transparent disclosure to candidates, and human override capability.

Question 3

What are the most common pitfalls in predictive hiring models?

Answer

(1) Training on biased historical decisions - if past hires favored certain demographics, the model encodes those biases. (2) Outcome metric problems - using ‘retention’ alone as the outcome incentivises hiring conservative candidates likely to stay regardless of performance. (3) Model drift - hiring patterns shift over time; a 3-year-old model often performs worse than no model. (4) Treating the model as an oracle - good practice uses model output as one input, not the decision itself. (5) Not validating on held-out data - in-sample fit always looks good; only out-of-sample validation reveals real predictive power.

Question 4

How do you measure if a predictive model is actually working?

Answer

Track three signals over a 12-18 month measurement period: (1) quality of hire delta - comparing average performance ratings of model-recommended hires vs control hires; (2) retention delta - 12-month retention rates of model-recommended vs control hires; (3) source-mix shift - changes in where successful hires come from after the model is deployed. Statistical significance requires reasonable sample sizes - typically 200+ hires per group over the measurement period.

What Is Predictive Hiring Analytics? Statistical models that forecast candidate outcomes before the hire is made

Key Points: Predictive Hiring Analytics

How Predictive Hiring Analytics Works in Treegarden

Predictive Hiring Analytics in Treegarden

Related HR Glossary Terms

Frequently Asked Questions About Predictive Hiring Analytics

Ready to see Treegarden in action?