Feature Guide Treegarden Team 22 April 2026 9 min read

How to Build a Candidate Database That Your Team Will Actually Use

Most candidate databases become graveyards of forgotten CVs. Here's how to build one that surfaces the right candidates at the right time.

How to Build a Candidate Database That Your Team Will Actually Use

Your best future hire might already be in your ATS—buried under three years of outdated data. In an era where 75% of hiring managers say time-to-hire has increased over the past five years (HR Dive, 2023), a disorganized candidate database becomes a bottleneck for growth. Candidate database software isn’t just about storage—it’s about strategic accessibility. Done right, it transforms passive talent into active opportunities. Let’s uncover how to build a candidate database that your HR team will use daily, not just during hiring crises.

Why Candidate Databases Fail

The average organization loses 30% of quality candidates from their talent pool within 12 months due to poor database management (LinkedIn Workforce Report, 2022). Why does this happen? Three key reasons emerge:

1. Data Decay: Unstructured entries, duplicate records, and outdated contact info render databases obsolete. If your team hasn’t reviewed profiles in six months, chances are 40% of data is inaccurate (Gartner, 2023).
2. Overloading with Irrelevant Data: Storing every CV without filtering creates noise. For example, keeping 1,000 candidates for a single role when only 15 are relevant wastes search time.
3. Legal Risks: Non-compliance with GDPR (UK) or FCRA (US) can lead to fines up to £17.5M (UK) or $43,782 per violation (US). Failing to anonymize sensitive data or retain records for required periods is a common pitfall.

The Hidden Cost of a Broken Database

For every hour spent rekeying duplicate candidate data, £26 is lost in UK SMEs (CIPD, 2023). Automated data hygiene tools reduce this by up to 70%.

5 Principles of a Useful Candidate Database

  1. Simplicity Over Complexity: Your database should allow searches by keyword, role type, and location in under 5 clicks. Treegarden’s Kanban pipeline lets users drag-and-drop candidates into hiring stages without IT support.
  2. Structured Data Capture: Define mandatory fields (e.g., skills, availability) while making optional fields (e.g., hobbies) accessible via expandable menus. This reduces clutter while preserving context.
  3. Compliance-First Design: Embed auto-expiration for candidate data after 24-36 months (UK GDPR Article 5(1)(d)) and automated opt-out workflows for FCRA (US).
  4. Active Engagement Triggers: Set up auto-nudges for candidates who haven’t been contacted in 6 months. Personalized emails increase re-engagement by 40% (Recruiter.com, 2023).
  5. Team-Wide Accessibility: Ensure hiring managers can view candidate profiles without requesting admin access. Treegarden’s role-based permissions allow this while maintaining data security.

What Data to Capture (And What Creates GDPR Risk)

Capture only what’s essential to your hiring process. The table below shows recommended data fields and regulatory considerations:

Data TypeRecommendedRegulatory Risks
Full Name & Contact InfoMust be anonymizable under GDPR Art. 17 (Right to Erasure)
Work History & SkillsNone if stored as general categories (e.g., "project management")
Ethnicity / Nationality✅ (optional)UK Equality Act 2010: Only collect if required for diversity reporting
Salary HistoryUK Equality Act 2010: May create indirect discrimination risks

GDPR & FCRA Alert

The UK Information Commissioner’s Office (ICO) and US FTC both penalize organizations storing excessive candidate data. Limit retention to 36 months unless the individual provides explicit consent for longer storage.

CV Parsing as the Entry Point to a Clean Database

Manual data entry takes 12 minutes per candidate, while AI parsing reduces this to 90 seconds (Treegarden performance benchmark). Bulk parsing of 50+ CVs at once is essential for volume hiring scenarios. Look for systems that:

  • Identify unstructured data patterns (e.g., "Sales Director, XYZ Co. (2018–2022)" becomes structured job history)
  • Auto-reject candidates who fail right-to-work checks in the UK or E-Verify in the US
  • Tag skills using industry taxonomies (e.g., mapping "digital marketing" to "SEO/SEM")

Treegarden’s Smart Parsing

Our parser extracts 92% of relevant data from PDF, DOCX, and scanned images—with 98% accuracy in UK postcode formatting to meet Right to Work check requirements.

Keyword and Boolean Search for Finding Past Candidates

Advanced search syntax can uncover hidden gems. For example:

"(software engineer OR developer) AND (Python OR Java) NOT (agency OR contractor)"

Treegarden’s Boolean search supports:

  • Nested queries: "(marketing AND (London OR Manchester)) AND NOT (recruiter OR agency)"
  • Skill mapping: Search for candidates with "AWS certification" while excluding those with "junior" in their title
  • Timeline filters: Find candidates active between 2019–2022 who may now be open to new roles

Organizations using Boolean search reduce time-to-hire by 22% compared to basic keyword searches (HR Tech Outlook, 2023).

GDPR and UK Data Retention Rules for Candidate Data

Under UK GDPR Article 5(1)(e), candidate data must be "kept in a form which permits identification of data subjects for no longer than is necessary." This means:

  • Default Retention: 24-36 months unless the candidate provides explicit consent for longer storage
  • Access Requests: 30 days to respond to Subject Access Requests under GDPR Article 12
  • Right to be Forgotten: Must delete data permanently upon request, including backups

In the US, FCRA requires notice before using background checks and opt-out rights for adverse actions. Treegarden automates these requirements with template notifications and audit trails.

How Treegarden’s Candidate DB Works

Unlike Greenhouse or iCIMS which require 6–8 weeks for setup, Treegarden’s candidate database is operational within 48 hours. Key differentiators include:

  • AI Screening: Scores candidates on 18+ factors including cultural fit and career progression
  • Compliance Automation: Auto-expires records after 36 months and generates EEO-1 report-ready data
  • Cost Transparency: No hidden fees for storage or API access—£100/month for unlimited candidates

No More Data Graveyards

Our system automatically archives inactive candidates after 18 months, with one-click reactivation for future roles.

Segmenting Your Candidate Database for Maximum Reuse

A candidate database without segmentation is a filing cabinet — you know things are in there, but finding the right one requires searching through everything. Effective segmentation transforms the database from storage into a strategic asset: when a role opens, the relevant talent pool segment can be identified and contacted within hours rather than days. The investment in segmentation pays back immediately on first use and compounds with every subsequent search.

The most useful segmentation dimensions for most organisations are: role function (engineering, sales, marketing, operations, HR), seniority level, location and work authorisation status, skills (both hard skills and critical soft skills), engagement status (active applicant, passive interested, passive cold, previously declined offer), and pipeline stage reached in previous applications (screened out at CV, interviewed, reached offer stage but declined). This last dimension is particularly valuable — candidates who reached offer stage in a previous search are pre-qualified, pre-warmed, and represent the highest probability re-engagement targets when a similar role opens.

Segmentation Priority Tags: "Silver medallist" — reached final round but not hired ✓ | "Strong keeper" — passed screening, not progressed due to role closure ✓ | "Future fit" — qualified for future role, no current opening ✓ | "Actively looking" — recently expressed interest in opportunities ✓ | These four tags alone will transform re-engagement targeting effectiveness.

Automated segmentation tools in modern ATS platforms can tag candidates based on application behaviour (how far they progressed, how quickly they responded, whether they withdrew) without manual effort. This creates segmentation data as a by-product of the hiring process rather than requiring additional HR data entry. For databases of 1,000+ candidates, automated tagging is the only scalable approach — manual tagging backlogs create incomplete segments that are worse than no segmentation at all.

Re-engaging and Reactivating Past Candidates

The most overlooked value in most candidate databases is the pool of previously assessed candidates who were not hired not because they were underqualified, but because of timing — the role was filled by someone slightly more experienced, the headcount was frozen, or the candidate accepted another offer before the process concluded. These individuals represent zero sourcing cost, reduced screening cost, and existing relationship warmth. Reactivating them when conditions change is the highest ROI activity in candidate database management.

A structured re-engagement cadence for high-value segments should run quarterly: a personalised outreach note from the recruiter or hiring manager acknowledging the previous interaction, referencing something specific about the candidate's background, and introducing a concrete next step (new role opening, updated job description, invitation to a company event or webinar). Generic blast emails to large database segments have low response rates and damage the relationship quality you invested in building. Personalised, contextual outreach to targeted segments consistently outperforms volume approaches by 3–5x on response rate.

GDPR in the UK and EU creates a specific compliance requirement for re-engagement: you must have a lawful basis for contacting a candidate who is not currently in an active hiring process. The most defensible lawful basis for talent pool outreach is legitimate interest, provided you conducted a legitimate interests assessment (LIA) documenting why the processing is necessary and proportionate. Alternatively, obtain explicit consent from candidates during their initial application for periodic contact about relevant opportunities, and honour opt-out requests promptly. Document your legal basis in your data processing register before running re-engagement campaigns.

Free Calculators for This Topic

Save time with these free HR calculators — no sign-up required:

Related Reading Helpful Calculators

Frequently Asked Questions

Why is my candidate database going unused?

If it takes more than 3 clicks to find a candidate, your team will avoid using it. Implement search filters and mobile access to boost adoption.

How often should I clean my candidate database?

Quarterly reviews are ideal. Delete duplicates, verify contacts, and remove records older than 36 months unless there’s a business reason to keep them.

Does Treegarden help with GDPR compliance?

Yes—we provide automated data anonymization, retention policies, and audit trails to meet both UK GDPR and US FCRA requirements.

How does Treegarden compare to Greenhouse?

We offer faster setup (48 hours vs. 6 weeks), lower pricing (50% less), and built-in HR modules (payroll, PTO) for SMBs.

Building a candidate database your team will use isn’t about collecting every scrap of information—it’s about creating a living ecosystem of quality talent. With Treegarden’s candidate database software, you gain compliance-ready tools, intelligent search capabilities, and a system designed for daily use. Start your free 14-day trial to transform your ATS from a data graveyard into a hiring engine.

This article was created with AI assistance. Content has been editorially reviewed by the Treegarden team.

See exactly what Treegarden costs

All features included. Unlimited jobs. Unlimited users. No demo required to see the price. Startup: $299/mo · Growth: $499/mo · Scale: $899/mo.

View transparent pricing →

Start Using Treegarden Today

Simplify hiring and manage your team with one ATS + HR platform powered by AI. No long contracts, no hidden fees.