Building a Resume Screening Agent: Automated Candidate Evaluation and Shortlisting

The Resume Screening Bottleneck

A single job posting can attract hundreds of applications. Recruiters spend an average of 7 seconds per resume on initial screening — a pace that guarantees missed talent and inconsistent evaluation. An AI resume screening agent applies the same criteria to every candidate, evaluates skill matches systematically, and surfaces the strongest applicants while flagging potential bias in the process.

The critical responsibility here is fairness. An automated screening system that perpetuates bias causes more harm than a manual process because it does so at scale. This guide builds bias mitigation directly into the architecture.

Resume Parsing and Structured Extraction

The first step is converting unstructured resume text into a structured format the agent can reason about.

flowchart TD
    START["Building a Resume Screening Agent: Automated Cand…"] --> A
    A["The Resume Screening Bottleneck"]
    A --> B
    B["Resume Parsing and Structured Extraction"]
    B --> C
    C["Candidate Scoring Engine"]
    C --> D
    D["Bias Mitigation Tools"]
    D --> E
    E["FAQ"]
    E --> DONE["Key Takeaways"]
    style START fill:#4f46e5,stroke:#4338ca,color:#fff
    style DONE fill:#059669,stroke:#047857,color:#fff

from dataclasses import dataclass, field
from typing import Optional
from agents import Agent, Runner, function_tool
import json
import re

@dataclass
class ParsedResume:
    candidate_id: str
    name: str
    email: str
    skills: list[str]
    experience_entries: list[dict]  # role, company, duration_months, description
    education: list[dict]  # degree, institution, year
    certifications: list[str]
    total_experience_years: float

@dataclass
class JobCriteria:
    job_id: str
    required_skills: list[str]
    preferred_skills: list[str]
    min_experience_years: int
    required_education: str  # "bachelor", "master", "none"
    required_certifications: list[str]
    weight_skills: float = 0.4
    weight_experience: float = 0.3
    weight_education: float = 0.15
    weight_certifications: float = 0.15

PARSED_RESUMES: dict[str, ParsedResume] = {}
JOB_CRITERIA_DB: dict[str, JobCriteria] = {}

Candidate Scoring Engine

The scoring tool evaluates each candidate against explicit, weighted criteria. Each dimension produces a normalized score between 0 and 1.

See AI Voice Agents Handle Real Calls

Book a free demo or calculate how much you can save with AI voice automation.

Book a Demo ROI Calculator

def _calculate_skill_score(
    candidate_skills: list[str],
    required: list[str],
    preferred: list[str],
) -> tuple[float, list[str], list[str]]:
    """Score skill match and return matched/missing skills."""
    candidate_lower = {s.lower() for s in candidate_skills}
    required_lower = {s.lower() for s in required}
    preferred_lower = {s.lower() for s in preferred}

    required_matches = candidate_lower & required_lower
    preferred_matches = candidate_lower & preferred_lower
    missing_required = required_lower - candidate_lower

    if not required_lower:
        score = 1.0
    else:
        required_ratio = len(required_matches) / len(required_lower)
        preferred_bonus = (
            len(preferred_matches) / len(preferred_lower) * 0.2
            if preferred_lower else 0
        )
        score = min(required_ratio + preferred_bonus, 1.0)

    return score, list(required_matches | preferred_matches), list(missing_required)

@function_tool
def score_candidate(candidate_id: str, job_id: str) -> str:
    """Score a candidate against job criteria with detailed breakdown."""
    resume = PARSED_RESUMES.get(candidate_id)
    criteria = JOB_CRITERIA_DB.get(job_id)

    if not resume:
        return json.dumps({"error": "Candidate resume not found"})
    if not criteria:
        return json.dumps({"error": "Job criteria not found"})

    # Skill scoring
    skill_score, matched_skills, missing = _calculate_skill_score(
        resume.skills, criteria.required_skills, criteria.preferred_skills
    )

    # Experience scoring
    exp_ratio = resume.total_experience_years / max(criteria.min_experience_years, 1)
    experience_score = min(exp_ratio, 1.0)

    # Education scoring
    edu_levels = {"none": 0, "associate": 1, "bachelor": 2, "master": 3, "phd": 4}
    candidate_edu = max(
        (edu_levels.get(e.get("degree", "").lower(), 0) for e in resume.education),
        default=0,
    )
    required_edu = edu_levels.get(criteria.required_education.lower(), 0)
    education_score = 1.0 if candidate_edu >= required_edu else 0.5

    # Certification scoring
    if criteria.required_certifications:
        cert_lower = {c.lower() for c in resume.certifications}
        req_cert_lower = {c.lower() for c in criteria.required_certifications}
        cert_score = len(cert_lower & req_cert_lower) / len(req_cert_lower)
    else:
        cert_score = 1.0

    # Weighted total
    total = (
        skill_score * criteria.weight_skills
        + experience_score * criteria.weight_experience
        + education_score * criteria.weight_education
        + cert_score * criteria.weight_certifications
    )

    return json.dumps({
        "candidate_id": candidate_id,
        "overall_score": round(total * 100),
        "breakdown": {
            "skills": {"score": round(skill_score * 100), "matched": matched_skills, "missing": missing},
            "experience": {"score": round(experience_score * 100), "years": resume.total_experience_years},
            "education": {"score": round(education_score * 100)},
            "certifications": {"score": round(cert_score * 100)},
        },
        "recommendation": "advance" if total >= 0.7 else "review" if total >= 0.5 else "decline",
    })

Bias Mitigation Tools

Bias mitigation is not an afterthought — it is a core system requirement.

@function_tool
def run_bias_audit(job_id: str, scored_candidates: str) -> str:
    """Audit a batch of scored candidates for potential bias indicators."""
    candidates = json.loads(scored_candidates)

    audit_checks = {
        "criteria_objectivity": True,
        "name_blind_scoring": True,
        "education_prestige_excluded": True,
        "gap_penalty_removed": True,
    }

    criteria = JOB_CRITERIA_DB.get(job_id)
    if criteria:
        subjective_terms = {"culture fit", "communication style", "personality"}
        all_skills = set(s.lower() for s in criteria.required_skills + criteria.preferred_skills)
        if all_skills & subjective_terms:
            audit_checks["criteria_objectivity"] = False

    flagged = [c for c in audit_checks if not audit_checks[c]]
    return json.dumps({
        "audit_passed": len(flagged) == 0,
        "checks": audit_checks,
        "flagged_issues": flagged,
        "recommendation": "Review flagged criteria before finalizing shortlist"
                          if flagged else "No bias indicators detected",
    })

screening_agent = Agent(
    name="ScreenBot",
    instructions="""You are ScreenBot, a resume screening assistant.
Evaluate candidates strictly against stated job criteria.
Never factor in candidate names, personal demographics, or school prestige.
Always run a bias audit before finalizing any shortlist.
Present results as scored rankings with clear justification for each score.""",
    tools=[score_candidate, run_bias_audit],
)

FAQ

How do you handle candidates who have relevant experience but use different terminology?

Implement a skills synonym mapping that normalizes variations. For example, "React.js", "ReactJS", and "React" should all map to the same skill. The skill matching function should compare against normalized forms rather than raw strings.

What legal considerations apply to automated resume screening?

Several jurisdictions require disclosure when AI is used in hiring decisions. New York City's Local Law 144, for instance, mandates annual bias audits for automated employment decision tools. Always consult legal counsel, provide candidate opt-out options, and maintain human oversight for final hiring decisions.

Should the agent completely replace human recruiters?

No. The agent should shortlist and rank candidates, but a human recruiter should review the shortlist before candidates are advanced or rejected. The agent accelerates the process and improves consistency, but human judgment remains essential for nuanced evaluation of career narratives and potential.

#ResumeScreening #CandidateEvaluation #HiringAutomation #BiasMitigation #AgenticAI #LearnAI #AIEngineering

Building a Resume Screening Agent: Automated Candidate Evaluation and Shortlisting

The Resume Screening Bottleneck

Resume Parsing and Structured Extraction

Candidate Scoring Engine

Bias Mitigation Tools

FAQ

How do you handle candidates who have relevant experience but use different terminology?

What legal considerations apply to automated resume screening?

Should the agent completely replace human recruiters?

Try CallSphere AI Voice Agents

Related Articles

Building an AI Agent with Tool-Use Chains: Sequential Tool Orchestration for Complex Tasks

WebArena and Real-World Web Agent Benchmarks: How We Measure Browser Agent Performance

Building a Hypothesis-Testing Agent: Scientific Method Applied to Data Analysis