The problem: CVs from all directions, endless manual input
Recruiters receive resumes through multiple channels: email attachments, contact forms, internal references, career fairs (often on USB stick or printed), recruiting agencies, and LinkedIn messages. Each CV must arrive in ATS as a candidate profile - with name, email, phone, skills and experience correctly extracted.
Traditionally, this process looks like this: you open the CV, read the contact data, enter them manually in the ATS form, attach the document, move on to the next one. An experienced recruiter can manually process approx8-10 CVs per hour. After a career fair where you collected 50 CVs, we are talking about5-6 hours of purely administrative work- an entire working day lost on data entry.
This repetitive work has other negative consequences:
- Input errors- a wrong email, an incomplete phone number, an incorrectly written name
- Delays in contact- candidates wait days or weeks until they are entered into the system
- Demotivarea echipei- recruiters should recruit, not do data entry
- Loss of candidates- a CV that sits on the desktop for 2 weeks represents a candidate who probably accepted another offer
Treegarden rezolva aceasta problema prin Bulk CV Upload- a functionality that allows the automatic loading and processing of up to 50 CVs simultaneously, with intelligent data extraction from each document.
Technical specifications: what Bulk CV Upload supports
Before going into details about the workflow, here are the technical specifications of the functionality:
Upload parameters
Maximum 50 files per upload. Maximum 20 MB per tab. Supported formats: PDF, DOC and DOCX. Files are processed sequentially, with real-time feedback for each document. ZIP files are not accepted for security reasons - each CV must be uploaded individually as a document.
Why these limits?The limit of 50 files ensures that processing remains fast and reliable. 20 MB per file covers any standard CV (a typical CV is 100-500 KB; even those with images and complex formatting rarely exceed 5 MB). The PDF format is the most common for CVs, and DOC/DOCX covers documents created in Microsoft Word.
Why are ZIP files not supported?This decision was taken for security reasons. ZIP files can contain executables, malicious scripts or files with manipulated paths that pose security risks. By only accepting individual documents (PDF, DOC, DOCX), we completely eliminate these attack vectors.
Data extraction: the 3-tier system
The heart of the Bulk CV Upload functionality isdata extraction engine. Treegarden uses a 3-tier extraction system to maximize the success rate in CV parsing:
Nivelul 1: Parser PHP nativ
The first level uses native PHP libraries to extract text from documents. This is the fastest and most reliable tier - works great for text-based PDFs and standard Word documents. Extract name, email, phone number, education and professional experience using intelligent pattern matching.
Nivelul 2: Instrumente command-line
If the native PHP parser fails to extract text (for example, for scanned PDFs or documents with unusual formatting), the system automatically switches to specialized command-line tools. These tools have extensive text extraction capabilities, including from PDFs that contain images with embedded text.
Level 3: Fallback with default values
Even if no level of parsing succeeds in extracting the data (corrupted document, pure image, unrecognized format),The CV is still loaded in the system. The candidate's profile is created with default values (the name derived from the file name, the other fields marked for manual filling), and the original document is attached. Thus, no CV is lost - the recruiter can open the document and manually fill in the information that was not automatically extracted.
Niciun CV pierdut
The design philosophy of Bulk CV Upload is "no CV lost". Even if the parsing fails completely, the document is loaded and the profile is created. It is always better to have a partial profile (with attached CV that you can read) than to lose a complete CV. This approach guarantees that, regardless of the quality of the documents received, they all end up in the database.
The complete workflow: from files to profiles
Let's go through step by step what the Bulk CV Upload process looks like in practice. Recruiter Ana participated in a career fair and collected 35 CVs in PDF format.
Pasul 1: Selectarea fisierelor
Ana opens the Candidates module in Treegarden and clicks the "Bulk CV Upload" button. A file selection window opens where you can select all 35 PDFs at once (or you can drag and drop them). The system instantly checks that each file respects the limits (under 20 MB, supported format) and displays a summary: "35 selected files, ready to process".
Pasul 2: Procesarea automata
Ana clicks "Load and process". Each CV is processed sequentially, with a visible progress bar. For each document, the system:
- Upload the file to the server
- Extract the text using the 3-level system
- Identify and parse: full name, email address, phone number, skills, professional experience and education
- Check if there is already a candidate with the same email or phone number (duplicate detection)
- Create the candidate's profile with the extracted data
Step 3: Review the extracted data
After processing, Ana sees a list with all 35 created profiles. Each profile shows the automatically extracted data, highlighting the fields that have been successfully completed and those that require verification. Ana can edit any field directly from this view, without opening each individual profile.
Step 4: Confirmation and integration
After the review, the profiles are confirmed and integrated into the candidate database. From here, they can be associated with open jobs, their AI Match Scores can be calculated, and they can be managed like any other candidate in the pipeline.
Sfat practic
After a career fair or recruitment event, the process resumes the same day. The more time passes, the more difficult the candidates become to contact (they have also applied in other places). With Bulk CV Upload, 50 CVs processed in 10 minutes means you can start contacting candidates on the evening of the event itself.
Duplicate detection: no double entries in the database
A common problem in managing candidate databases isduplicarea intrarilor. The same candidate applies on LinkedIn and on BestJobs, sends the CV and by reference - and ends up in the database 3 times with slightly different information.
Treegarden rezolva aceasta problema prin automatic detection of duplicatesduring Bulk CV Upload. The system checks each processed CV against the existing database, using several criteria:
- Email identic- the most reliable duplicate indicator
- Same phone number- the second matching criterion
- Combinatie nume + email partial- for cases where the email differs slightly ([email protected] vs [email protected])
When a duplicate is detected, the recruiter receives a notification with options: (1) ignore the new CV and keep the existing profile, (2) update the existing profile with the information from the new CV, or (3) create a new separate profile. This flexibility ensures that the recruiter has complete control, but does not miss any duplicates.
Validarea inteligenta a numelor
Extracting names from CVs is surprisingly complex. Names can contain special characters, diacritics, hyphens, apostrophes and various cultural formats. A simple parser could misinterpret "Maria-Elena O'Brien" or "Jose Luis Garcia Fernandez".
Treegarden's parsing engine supports:
- Caractere UTF-8 complete - diacritice romanesti (a, i, s, t), caractere internationale (e, u, o, n)
- Cratime in nume - Maria-Elena, Jean-Pierre, Ana-Maria
- Apostrofuri - O'Brien, D'Angelo, McDonald's
- Nume compuse - Jose Luis, Ana Maria, Ion Cristian
- Prefixes and suffixes - Dr., Ing., Jr., III
Cultural diversity in name parsing
In a globalized labor market, Romanian companies receive CVs from candidates from various cultures. Treegarden correctly recognizes the Romanian, Western European, Anglo-Saxon, Hispanic and Eastern European name formats. You will no longer see "O" and "Brien" as two separate entries, or "Jean" and "Pierre" as two different people.
Security: database protection
Uploading files to a system is always a potential attack vector. Treegarden implements multiple security measures to protect both the platform and candidate data:
File type validation
The system checks not only the file extension, but also the actual content (MIME type). An executable file renamed as "CV.pdf" is automatically detected and rejected. Only real documents (PDF, DOC, DOCX) are accepted.
Eliminarea fisierelor ZIP
Archive files (ZIP, RAR, 7Z) are completely prohibited. This decision eliminates the risks of: zip bombs (compressed files that expand to enormous sizes), path traversal attacks (files with manipulated paths that try to write in unauthorized locations), and executables hidden in archives.
Protectie path traversal
File names are automatically sanitized - special characters, "../" sequences and other attempts to manipulate file paths are removed. Each file is stored in a controlled location, without the possibility of accessing other directories of the server.
Processing timeout
Each file has a processing timeout of 30 seconds. If a document cannot be processed within this interval (possibly corrupted or malformed), processing is stopped and the file is marked for manual review. This measure prevents denial-of-service attacks through files that consume excessive resources.
GDPR compliance
CVs uploaded via Bulk Upload are treated as personal data according to GDPR. The recruiter who uploads the documents confirms that he has the legal basis for data processing (consent, legitimate interest). The data is stored encrypted, with controlled access based on roles, and can be completely deleted at the candidate's request (the right to deletion).
When to use Bulk CV Upload: practical scenarios
Bulk CV Upload isn't just for career fairs. Here are the most common scenarios in which this functionality makes a difference:
After a career fair or recruitment event
Collect dozens of printed or digital CVs. Instead of entering them manually in the following days, they upload them all in 10 minutes and start contacting candidates the same day.
When you receive CVs by email from an agency
Recruitment agencies often send batches of CVs as email attachments. Download them all and upload them in bulk in Treegarden - the profiles are automatically created and ready for evaluation.
Migration from another system
If you switch from another ATS (or from Excel/physical files) to Treegarden, Bulk CV Upload is the fastest way to import existing candidates. Export the CVs from the old system and upload them in batches of 50.
Referinte interne multiple
When you launch a referral program and employees send multiple resumes at once, Bulk Upload processes them all quickly, with duplicate detection included.
International or seasonal recruitment
Seasonal or international recruitment campaigns generate large volumes of CVs in a short time. Bulk Upload allows fast processing without creating an administrative bottleneck.
Bulk CV Upload is one of the functionalities that differentiates Treegarden from most competitors. While other ATS systems require the manual entry of each candidate or only offer CSV import (which requires prior data extraction), Treegarden processes the original documents directly, with automatic extraction on 3 levels.
The result: entire hours of administrative work eliminated, a database of candidates that grows quickly and correctly, and recruiters who can devote their time to the activities that really matter - evaluating and contacting top talent.