HR Knowledge Base Evaluation

Quality measurement of the HR policy RAG pipeline using RAGAS metrics. Evaluates semantic chunking, parent-document retrieval, and multi-country policy responses.

Last evaluated: April 14, 202615 test cases • Meridian Technologies HR Knowledge Base

1

Document RAG Evaluation

Product Demo

Structured Data RAG

  • Fixed-size product records (~100-200 tokens)
  • No chunking needed
  • Product comparison & cost calculation tools
  • Single-country catalog
HR Demo

Document RAG

  • Semantically chunked policy documents (variable size)
  • Parent-document retrieval (get_full_section)
  • Policy comparison & eligibility checking tools
  • Multi-country policies (US, UK, Germany)

Same evaluation framework: Both demos use the identical RAGAS pipeline — same 4 metrics, same Python scorer, same Claude Haiku evaluator. This enables apples-to-apples comparison across different RAG data models.

2

Overall Scores

100%

Routing

Query classification accuracy

79%

Faithfulness

Answers grounded in retrieved policies

58%

Relevancy

Answers address the actual question

56%

Ctx Precision

Retrieved chunks are relevant & well-ranked

75%

Ctx Recall

Retrieved chunks cover the ground truth

3

Scores by Query Type

Simple Lookup

4 queries
Faith
94%
Relev
65%
CPrec
75%
CRec
92%

🧠Advisory

3 queries
Faith
69%
Relev
51%
CPrec
90%
CRec
77%

⚖️Comparison

2 queries
Faith
82%
Relev
58%
CPrec
0%
CRec
100%

📋Procedural

2 queries
Faith
84%
Relev
79%
CPrec
50%
CRec
75%

🚫Off-Topic

2 queries
Faith
100%
Relev
100%
CPrec
100%
CRec
100%

🧪Edge Case

2 queries
Faith
56%
Relev
31%
CPrec
30%
CRec
17%
4

Individual Test Results

01How many PTO days do new employees get?Agentic84%
100%
Faithfulness
70%
Relevancy
100%
Ctx Precision
67%
Ctx Recall

Retrieved Policy Chunks (15)

pto-us-000pto-de-001pto-us-001pto-uk-000pto-uk-002pto-de-000pto-uk-001jury-duty-us-001family-medical-leave-us-001bereavement-global-001family-medical-leave-de-000sick-leave-us-002+3 more

Generated Answer

New full-time employees at Meridian Technologies receive 15 business days of PTO annually. Employees with 0-2 years of tenure accrue 15 business days per year, which equals 1.25 days per month. PTO eligibility begins on the first day of employment and accrues monthly in equal increments. Referenced Policies: Paid Time Off (PTO) (US): Contains the complete PTO accrual schedule for new employees I...

Latency: 10485ms

02What is the 401k match?Agentic85%
85%
Faithfulness
55%
Relevancy
100%
Ctx Precision
100%
Ctx Recall

Retrieved Policy Chunks (15)

retirement-us-000retirement-us-001retirement-uk-000retirement-us-002retirement-de-000retirement-de-002retirement-uk-001hsa-fsa-de-001retirement-de-001hsa-fsa-de-002life-insurance-global-001health-insurance-us-000+3 more

Generated Answer

Meridian Technologies provides a 401(k) match equal to 6% of your eligible compensation when you contribute to your 401(k) account. To receive the full company match, you must contribute at least 6% of your gross salary to the plan. If you contribute less than 6%, you'll receive a proportional match. The company match is calculated on a pay period basis and deposited quarterly. Referenced Policie...

Latency: 11257ms

03What's the password policy?Agentic91%
100%
Faithfulness
63%
Relevancy
100%
Ctx Precision
100%
Ctx Recall

Retrieved Policy Chunks (15)

password-policy-global-000password-policy-global-001password-policy-global-002byod-global-001data-classification-us-001data-classification-de-001acceptable-use-global-001remote-access-global-000data-classification-uk-000data-classification-de-000remote-access-global-001data-classification-us-000+3 more

Generated Answer

Meridian Technologies requires all employees to use passwords with a minimum of 14 characters that include at least one uppercase letter, lowercase letter, number, and special character. Passwords must be changed every 90 days, cannot contain your username or dictionary words, and cannot reuse any of your previous 12 passwords. Additionally, multi-factor authentication (MFA) must be enabled within...

Latency: 45220ms

04What is the probationary period for new hires?Agentic66%
90%
Faithfulness
73%
Relevancy
0%
Ctx Precision
100%
Ctx Recall

Retrieved Policy Chunks (15)

probation-us-000probation-de-000probation-uk-000probation-de-001probation-us-001probation-us-002probation-uk-001at-will-termination-de-000at-will-termination-uk-000grievance-uk-000first-day-global-001at-will-termination-us-002+3 more

Generated Answer

All new employees at Meridian Technologies are subject to a probationary period of 90 calendar days from their start date, applying to all full-time and part-time employees regardless of position level or department. During this period, HR facilitates check-in meetings at the 30-day and 60-day marks, and at the conclusion, the manager conducts a formal performance review to determine transition to...

Latency: 44430ms

05What benefits am I eligible for as a part-time employee?Agentic63%
57%
Faithfulness
41%
Relevancy
100%
Ctx Precision
56%
Ctx Recall

Retrieved Policy Chunks (25)

eap-global-000eap-global-001sick-leave-us-000jury-duty-de-000training-global-000sick-leave-de-000overtime-uk-000grievance-us-000sick-leave-uk-000work-hours-uk-000pay-schedule-us-000goal-setting-global-000+13 more

Generated Answer

I found 25 relevant policy sections. Here are the most relevant results for your query. Referenced Policies: Employee Assistance Program (Global): Summary: All Meridian Technologies employees globally, including full-time, part-time, and contracto; Employee Assistance Program (Global): Summary: All global Meridian employees can access eight confidential EAP counseling sessions annuall; Sick Leave...

Latency: 47896ms

06I need to take extended leave for a family medical issue. What are my options?Agentic76%
78%
Faithfulness
58%
Relevancy
93%
Ctx Precision
75%
Ctx Recall

Retrieved Policy Chunks (27)

family-medical-leave-us-000family-medical-leave-us-001sick-leave-us-002family-medical-leave-us-002family-medical-leave-de-001parental-leave-us-000pto-us-000family-medical-leave-uk-002family-medical-leave-de-000family-medical-leave-uk-000sick-leave-us-000family-medical-leave-de-002+15 more

Generated Answer

I found 27 relevant policy sections. Here are the most relevant results for your query. Referenced Policies: Family & Medical Leave (US): Summary: US employees working at locations with 50+ employees within 75 miles are eligible for up to; Family & Medical Leave (US): Summary: US employees must provide 30 days' advance notice for foreseeable FMLA leave and maintain h; Sick Leave (US): Summary: US...

Latency: 47788ms

07What should I know as a new remote employee?Agentic76%
71%
Faithfulness
53%
Relevancy
78%
Ctx Precision
100%
Ctx Recall

Retrieved Policy Chunks (29)

eligibility-global-000eligibility-global-001eligibility-global-002equipment-global-001equipment-global-000home-office-global-002work-hours-us-000first-day-global-001work-hours-uk-002final-pay-uk-000first-day-global-000probation-de-000+17 more

Generated Answer

I found 29 relevant policy sections. Here are the most relevant results for your query. Referenced Policies: Remote Work Eligibility (Global): Summary: Global full-time employees at Meridian Technologies must complete their six-month probation; Remote Work Eligibility (Global): Summary: Global full-time employees must submit remote work requests to their manager at least four ; Remote Work Eligib...

Latency: 67090ms

08What's the difference between FMLA and parental leave?Agentic55%
75%
Faithfulness
46%
Relevancy
0%
Ctx Precision
100%
Ctx Recall

Retrieved Policy Chunks (19)

family-medical-leave-us-000family-medical-leave-us-002parental-leave-us-000family-medical-leave-us-001probation-us-001health-insurance-us-002sick-leave-us-002at-will-termination-us-001grievance-us-000pto-us-000parental-leave-uk-001family-medical-leave-uk-001+7 more

Generated Answer

I found 19 relevant policy sections. Here are the most relevant results for your query. Referenced Policies: Family & Medical Leave (US): Summary: US employees working at locations with 50+ employees within 75 miles are eligible for up to; Family & Medical Leave (US): Summary: US employees may take Family and Medical Leave under FMLA protections with zero retaliation; Parental Leave (US): Summary...

Latency: 54691ms

09Compare PTO policies across US, UK, and GermanyAgentic65%
89%
Faithfulness
70%
Relevancy
0%
Ctx Precision
100%
Ctx Recall

Retrieved Policy Chunks (7)

pto-uk-000pto-uk-001pto-uk-002pto-de-000pto-de-001pto-us-000pto-us-001

Generated Answer

Meridian Technologies' PTO policies vary significantly by country: US employees receive 15-25 days annually based on tenure (0-2 years: 15 days, 3-5 years: 18 days, 6-10 years: 20 days, 11+ years: 25 days), UK employees receive 33 days annually (28 statutory + 5 company days), and German employees receive 35 days annually (30 statutory + 5 company days). All countries allow maximum 5-day carryover...

Latency: 33549ms

10How do I report a workplace safety concern?Agentic87%
100%
Faithfulness
74%
Relevancy
100%
Ctx Precision
75%
Ctx Recall

Retrieved Policy Chunks (12)

general-safety-global-000general-safety-global-002reporting-injuries-global-001reporting-injuries-global-000general-safety-global-001reporting-injuries-global-002ergonomics-global-002emergency-procedures-global-001emergency-procedures-global-002emergency-procedures-global-000ergonomics-global-000ergonomics-global-001

Generated Answer

You can report workplace safety concerns in two ways: (1) Submit hazard reports to your direct manager, or (2) Email the Health and Safety Team at safety@meridiantech.com. Reports will be investigated within five business days, and you'll be notified of findings and corrective actions. Meridian prohibits retaliation against employees who report safety concerns in good faith. Referenced Policies: ...

Latency: 30913ms

11What's the process for submitting an expense report?Agentic57%
69%
Faithfulness
84%
Relevancy
0%
Ctx Precision
75%
Ctx Recall

Retrieved Policy Chunks (15)

expense-reimbursement-global-002expense-reimbursement-global-000expense-reimbursement-global-001tuition-reimbursement-global-001pay-schedule-uk-001pay-schedule-de-001pay-schedule-us-001pay-schedule-uk-002pay-schedule-de-000tuition-reimbursement-global-002pay-schedule-uk-000pay-schedule-us-000+3 more

Generated Answer

To submit an expense report, use Meridian's expense management system (preferred) or submit to your manager with documentation. You must obtain manager approval before incurring expenses (except meal per diem), submit within 30 days of the expense date, and provide itemized receipts for expenses over $75. Finance will process approved requests within 15 business days via direct deposit. Reference...

Latency: 41917ms

12What's the best pizza near the office?Off-Topic100%
100%
Faithfulness
100%
Relevancy
100%
Ctx Precision
100%
Ctx Recall

Generated Answer

This question is not related to HR policies.

Latency: 0ms

13How do I change my car's oil?Off-Topic100%
100%
Faithfulness
100%
Relevancy
100%
Ctx Precision
100%
Ctx Recall

Generated Answer

This question is not related to HR policies.

Latency: 0ms

14Can I bring my dog to work?Agentic25%
56%
Faithfulness
0%
Relevancy
9%
Ctx Precision
33%
Ctx Recall

Retrieved Policy Chunks (40)

work-hours-us-000eligibility-global-002general-safety-global-002work-hours-uk-001general-safety-global-001work-hours-uk-000work-hours-de-001work-hours-uk-002work-hours-us-001hsa-fsa-uk-001work-hours-de-002home-office-global-002+28 more

Generated Answer

I was unable to find any specific policy about bringing pets or dogs to work in Meridian Technologies' HR knowledge base. The company's policies do not appear to explicitly address pets in the workplace. Important Note: Since there is no specific pet policy documented, you should contact your local HR department to inquire about this topic directly. They can provide guidance on whether pets are p...

Latency: 73963ms

15I'm a contractor in Germany, what leave do I get?Agentic42%
57%
Faithfulness
63%
Relevancy
50%
Ctx Precision
0%
Ctx Recall

Retrieved Policy Chunks (14)

sick-leave-de-000sick-leave-de-002sick-leave-de-001family-medical-leave-de-001jury-duty-de-000jury-duty-de-002family-medical-leave-de-002bereavement-global-001jury-duty-de-001bereavement-global-002parental-leave-de-000parental-leave-de-001+2 more

Generated Answer

I found 14 relevant policy sections. Here are the most relevant results for your query. Referenced Policies: Sick Leave (DE): Summary: German employees at Meridian's Munich office are entitled to paid sick leave under the Wage; Sick Leave (DE): Summary: Germany requires all employees to maintain confidentiality obligations during sick leave, a; Sick Leave (DE): Summary: Germany provides all emplo...

Latency: 48350ms

5

Methodology

01 Faithfulness

Extracts atomic factual claims from the generated answer, then verifies each claim against the retrieved policy chunks. A claim must be directly supported — not merely plausible. Score = supported claims / total claims.

02 Answer Relevancy

Generates hypothetical questions the answer would address, embeds them alongside the original query, and computes average cosine similarity. High similarity = answer stays on-topic.

03 Context Precision

For each retrieved chunk (in search rank order), judges relevance to the query. Computes precision at each rank position, rewarding relevant chunks appearing at the top. Measures semantic chunking + retrieval ranking quality.

04 Context Recall

Extracts factual claims from the ground truth answer, then checks whether the retrieved chunks contain supporting information. Measures how completely the retrieval covers the ideal answer.

Evaluation Pipeline

15 test casesSonnet agentSearch + Retrieve chunksParent-doc retrievalRAGAS scores × 4 metricsScores
6

Retrieval Ablation Study

Head-to-head comparison of 5 retrieval strategies on 13 labeled HR queries using standard IR metrics. Unlike RAGAS (end-to-end), these isolate pure retrieval quality from generation. The production mode (Hybrid + Semantic Reranking) is highlighted — this ablation shows how each retrieval component contributes to finding the right policy chunks.

Two granularities: Each HR policy is split into 2-4 chunks of ~1,500 characters. Chunk-level metrics check exact ID matches; the subsection-level variant credits retrieval for finding any chunk in a ground-truth policy, which matches how the agent actually consumes results.

Chunk-level (exact chunk_id match)

Retrieval ModeR@5R@10P@5P@10MRRNDCG@10Latency
BM25 Only

Pure keyword search using BM25 scoring. No vectors, no reranking.

62%72%37%23%73%62%98ms
Vector Only

Pure vector search using cosine similarity. No keyword matching.

61%67%39%22%65%60%132ms
Hybrid (no rerank)

BM25 + vector fusion. No semantic reranking layer.

57%67%35%22%66%61%141ms
Hybrid + Semantic RerankingProduction

Production: BM25 + vector + Microsoft semantic reranker.

58%76%35%25%65%63%132ms
Hybrid + Semantic + Field BoostingEnhanced

Production + scoring profile: subsection_title×4, topic×2, section_title×1.5 weights.

55%74%34%24%65%61%134ms
Hybrid + Semantic + Query Rewriter

Production + Claude Haiku extracts country filters (fan-out for comparison queries).

62%76%37%25%71%68%153ms
Hybrid + Semantic + Parent-Doc (MMR)

Production + over-fetch 40, dedupe by subsection, keep top 10 distinct policies.

25%34%17%10%57%33%146ms

Why these look low: the ground truth uses specific chunk IDs (e.g., pto-us-000), but retrieval often surfaces neighboring chunks from the same policy (e.g., pto-us-002). Those are still functionally correct — the subsection-level table below credits them.

Subsection-level (chunk counts if its policy/subsection is in ground truth)

Retrieval ModeR@5R@10P@5P@10MRRNDCG@10
BM25 Only
67%73%52%35%81%43%
Vector Only
72%74%52%34%89%45%
Hybrid (no rerank)
64%74%48%34%80%43%
Hybrid + Semantic RerankingProduction
74%89%55%40%79%47%
Hybrid + Semantic + Field BoostingEnhanced
72%86%54%39%79%47%
Hybrid + Semantic + Query Rewriter
80%89%62%40%83%50%
Hybrid + Semantic + Parent-Doc (MMR)
80%95%32%19%80%28%

Interpretation: at the subsection level, the production pipeline lands the correct policy in the top-5 for most queries. The Query Rewriter (country filters + fan-out) raises precision and NDCG, and Parent-Doc (over-fetch then dedupe by subsection) pushes recall to its highest values by diversifying top-K across distinct policies — which especially helps multi-topic queries like “what should I know as a new remote employee?”. Parent-doc trades chunk-level P@K and NDCG for that diversity, so the rewriter wins on position-sensitive metrics while parent-doc wins on coverage.

Per-Query Results

01How many PTO days do new employees get?simple lookup3 GT
Ground Truth: pto-us-000, pto-us-001, pto-us-002
BM25 Only
chunk:R@5=67%MRR=1.00NDCG=70%|sub:R@5=100%MRR=1.00NDCG=33%794ms
1. pto-us-0002. pto-uk-0023. pto-us-0014. pto-de-0005. first-day-global-0016. pto-de-0017. first-day-global-0008. pto-uk-0009. pto-uk-00110. probation-us-000
Vector Only
chunk:R@5=67%MRR=1.00NDCG=77%|sub:R@5=100%MRR=1.00NDCG=36%1153ms
1. pto-us-0002. pto-us-0013. pto-de-0014. pto-de-0005. first-day-global-0016. pto-uk-0027. pto-uk-0008. pto-uk-0019. probation-us-00010. final-pay-us-000
Hybrid (no rerank)
chunk:R@5=67%MRR=1.00NDCG=77%|sub:R@5=100%MRR=1.00NDCG=36%1111ms
1. pto-us-0002. pto-us-0013. pto-uk-0024. pto-de-0005. pto-de-0016. first-day-global-0017. pto-uk-0008. pto-uk-0019. probation-us-00010. final-pay-us-000
Hybrid + Semantic Reranking
chunk:R@5=33%MRR=1.00NDCG=64%|sub:R@5=100%MRR=1.00NDCG=30%141ms
1. pto-us-0002. pto-de-0003. pto-de-0014. pto-uk-0005. pto-uk-0026. pto-us-0017. pto-uk-0018. final-pay-uk-0009. probation-us-00010. hsa-fsa-uk-001
Hybrid + Semantic + Field Boosting
chunk:R@5=33%MRR=1.00NDCG=64%|sub:R@5=100%MRR=1.00NDCG=30%129ms
1. pto-us-0002. pto-de-0003. pto-de-0014. pto-uk-0005. pto-uk-0026. pto-us-0017. pto-uk-0018. final-pay-uk-0009. probation-us-00010. hsa-fsa-uk-001
Hybrid + Semantic + Query Rewriter
chunk:R@5=67%MRR=1.00NDCG=77%|sub:R@5=100%MRR=1.00NDCG=36%131ms
1. pto-us-0002. pto-us-0013. probation-us-0004. first-day-global-0015. final-pay-us-0006. at-will-termination-us-0017. separation-us-0018. eligibility-global-0019. jury-duty-us-00110. probation-us-001
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=33%MRR=1.00NDCG=47%|sub:R@5=100%MRR=1.00NDCG=22%139ms
1. pto-us-0002. pto-de-0003. pto-uk-0004. final-pay-uk-0005. probation-us-0006. hsa-fsa-uk-0017. at-will-termination-de-0028. first-day-global-0019. final-pay-us-00010. probation-uk-001
02What is the 401k match?simple lookup3 GT
Ground Truth: retirement-us-000, retirement-us-001, retirement-us-002
BM25 Only
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%38ms
1. retirement-us-0002. retirement-us-0023. retirement-us-0014. retirement-uk-0005. retirement-uk-0016. retirement-de-0007. retirement-de-0018. retirement-de-0029. incident-reporting-de-00110. hsa-fsa-uk-000
Vector Only
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%67ms
1. retirement-us-0002. retirement-us-0013. retirement-us-0024. retirement-uk-0005. retirement-de-0016. retirement-uk-0017. hsa-fsa-de-0018. health-insurance-us-0009. retirement-de-00010. hsa-fsa-us-000
Hybrid (no rerank)
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%47ms
1. retirement-us-0002. retirement-us-0013. retirement-us-0024. retirement-uk-0005. retirement-uk-0016. retirement-de-0017. retirement-de-0008. hsa-fsa-de-0019. health-insurance-us-00010. retirement-de-002
Hybrid + Semantic Reranking
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%89ms
1. retirement-us-0002. retirement-us-0023. retirement-us-0014. retirement-uk-0005. retirement-de-0016. retirement-de-0007. retirement-de-0028. retirement-uk-0019. hsa-fsa-de-00110. hsa-fsa-uk-000
Hybrid + Semantic + Field Boosting
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%94ms
1. retirement-us-0002. retirement-us-0023. retirement-us-0014. retirement-uk-0005. retirement-de-0016. retirement-de-0007. retirement-de-0028. retirement-uk-0019. hsa-fsa-de-00110. hsa-fsa-uk-000
Hybrid + Semantic + Query Rewriter
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%90ms
1. retirement-us-0002. retirement-us-0023. retirement-us-0014. hsa-fsa-us-0015. health-insurance-us-0016. hsa-fsa-us-0007. bonuses-global-0008. dental-vision-us-0029. goal-setting-global-00010. health-insurance-us-000
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=33%MRR=1.00NDCG=47%|sub:R@5=100%MRR=1.00NDCG=22%165ms
1. retirement-us-0002. retirement-uk-0003. retirement-de-0014. hsa-fsa-de-0025. final-pay-uk-0016. training-global-0007. hsa-fsa-uk-0018. life-insurance-global-0029. hsa-fsa-us-00110. tuition-reimbursement-global-002
03What's the password policy?simple lookup3 GT
Ground Truth: password-policy-global-000, password-policy-global-001, password-policy-global-002
BM25 Only
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%37ms
1. password-policy-global-0002. password-policy-global-0013. password-policy-global-0024. byod-global-0015. data-classification-us-0016. data-classification-de-0017. overtime-de-0028. overtime-uk-0029. overtime-de-00010. overtime-us-000
Vector Only
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%43ms
1. password-policy-global-0022. password-policy-global-0003. password-policy-global-0014. remote-access-global-0015. byod-global-0016. acceptable-use-global-0017. data-classification-us-0018. remote-access-global-0009. social-media-global-00110. social-media-global-002
Hybrid (no rerank)
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%64ms
1. password-policy-global-0002. password-policy-global-0023. password-policy-global-0014. byod-global-0015. data-classification-us-0016. acceptable-use-global-0017. social-media-global-0028. remote-access-global-0009. social-media-global-00110. remote-access-global-001
Hybrid + Semantic Reranking
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%133ms
1. password-policy-global-0002. password-policy-global-0023. password-policy-global-0014. confidentiality-global-0015. data-classification-de-0016. data-classification-us-0017. byod-global-0018. acceptable-use-global-0019. equipment-global-00210. remote-access-global-000
Hybrid + Semantic + Field Boosting
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%127ms
1. password-policy-global-0002. password-policy-global-0023. password-policy-global-0014. confidentiality-global-0015. data-classification-de-0016. data-classification-us-0017. byod-global-0018. acceptable-use-global-0019. equipment-global-00210. remote-access-global-000
Hybrid + Semantic + Query Rewriter
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=47%140ms
1. password-policy-global-0002. password-policy-global-0023. password-policy-global-0014. confidentiality-global-0015. data-classification-de-0016. data-classification-us-0017. byod-global-0018. acceptable-use-global-0019. equipment-global-00210. remote-access-global-000
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=33%MRR=1.00NDCG=47%|sub:R@5=100%MRR=1.00NDCG=22%127ms
1. password-policy-global-0002. confidentiality-global-0013. data-classification-de-0014. data-classification-us-0015. byod-global-0016. acceptable-use-global-0017. equipment-global-0028. remote-access-global-0009. social-media-global-00210. data-classification-uk-000
04What is the probationary period for new hires?simple lookup2 GT
Ground Truth: probation-us-000, probation-us-001
BM25 Only
chunk:R@5=100%MRR=0.50NDCG=62%|sub:R@5=100%MRR=0.50NDCG=30%39ms
1. probation-de-0002. probation-us-0003. probation-uk-0004. probation-de-0015. probation-us-0016. probation-uk-0017. probation-us-0028. first-day-global-0019. first-day-global-00010. reviews-global-000
Vector Only
chunk:R@5=50%MRR=0.50NDCG=61%|sub:R@5=100%MRR=1.00NDCG=44%41ms
1. probation-us-0022. probation-us-0003. probation-de-0014. probation-de-0005. probation-uk-0016. probation-us-0017. probation-uk-0008. first-day-global-0019. separation-us-00210. at-will-termination-de-000
Hybrid (no rerank)
chunk:R@5=50%MRR=1.00NDCG=82%|sub:R@5=100%MRR=1.00NDCG=39%55ms
1. probation-us-0002. probation-de-0003. probation-de-0014. probation-us-0025. probation-uk-0006. probation-uk-0017. probation-us-0018. first-day-global-0019. at-will-termination-de-00010. separation-us-002
Hybrid + Semantic Reranking
chunk:R@5=50%MRR=0.50NDCG=59%|sub:R@5=100%MRR=0.50NDCG=31%140ms
1. probation-uk-0002. probation-us-0003. probation-de-0004. probation-us-0025. probation-de-0016. probation-uk-0017. probation-us-0018. eligibility-global-0009. reviews-global-00010. hsa-fsa-de-001
Hybrid + Semantic + Field Boosting
chunk:R@5=50%MRR=0.50NDCG=59%|sub:R@5=100%MRR=0.50NDCG=31%141ms
1. probation-uk-0002. probation-us-0003. probation-de-0004. probation-us-0025. probation-de-0016. probation-uk-0017. probation-us-0018. eligibility-global-0009. reviews-global-00010. hsa-fsa-de-001
Hybrid + Semantic + Query Rewriter
chunk:R@5=100%MRR=1.00NDCG=92%|sub:R@5=100%MRR=1.00NDCG=47%117ms
1. probation-us-0002. probation-us-0023. probation-us-0014. eligibility-global-0005. reviews-global-0006. byod-global-0007. health-insurance-us-0008. dental-vision-us-0009. life-insurance-global-00010. bonuses-global-001
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=50%MRR=0.50NDCG=39%|sub:R@5=100%MRR=0.50NDCG=14%122ms
1. probation-uk-0002. probation-us-0003. probation-de-0004. eligibility-global-0005. reviews-global-0006. hsa-fsa-de-0017. at-will-termination-uk-0008. at-will-termination-de-0009. dental-vision-de-00010. sick-leave-de-000
05What benefits am I eligible for as a part-time employee?advisory3 GT
Ground Truth: eap-global-000, sick-leave-us-000, health-insurance-us-000
BM25 Only
chunk:R@5=33%MRR=1.00NDCG=47%|sub:R@5=33%MRR=1.00NDCG=22%38ms
1. eap-global-0002. life-insurance-global-0013. retirement-us-0004. retirement-uk-0005. dental-vision-de-0006. health-insurance-uk-0027. hsa-fsa-uk-0008. hsa-fsa-uk-0019. pto-us-00010. dental-vision-us-000
Vector Only
chunk:R@5=33%MRR=0.50NDCG=30%|sub:R@5=33%MRR=0.50NDCG=21%46ms
1. hsa-fsa-uk-0012. health-insurance-us-0003. health-insurance-uk-0024. probation-us-0015. life-insurance-global-0016. life-insurance-global-0027. work-hours-us-0008. retirement-us-0009. health-insurance-us-00110. retirement-uk-001
Hybrid (no rerank)
chunk:R@5=33%MRR=0.20NDCG=18%|sub:R@5=33%MRR=0.20NDCG=16%65ms
1. life-insurance-global-0012. hsa-fsa-uk-0013. health-insurance-uk-0024. retirement-us-0005. health-insurance-us-0006. health-insurance-us-0017. life-insurance-global-0028. retirement-uk-0019. probation-us-00110. work-hours-us-000
Hybrid + Semantic Reranking
chunk:R@5=67%MRR=1.00NDCG=70%|sub:R@5=67%MRR=1.00NDCG=40%151ms
1. eap-global-0002. health-insurance-uk-0023. sick-leave-us-0004. hsa-fsa-uk-0015. life-insurance-global-0016. tuition-reimbursement-global-0007. final-pay-uk-0018. equipment-global-0009. health-insurance-us-00210. final-pay-de-001
Hybrid + Semantic + Field Boosting
chunk:R@5=33%MRR=1.00NDCG=47%|sub:R@5=33%MRR=1.00NDCG=30%129ms
1. eap-global-0002. health-insurance-uk-0023. hsa-fsa-uk-0014. life-insurance-global-0015. final-pay-uk-0016. health-insurance-us-0027. final-pay-de-0018. retirement-us-0009. pto-us-00010. final-pay-us-001
Hybrid + Semantic + Query Rewriter
chunk:R@5=67%MRR=1.00NDCG=77%|sub:R@5=67%MRR=1.00NDCG=43%130ms
1. eap-global-0002. sick-leave-us-0003. training-global-0004. pay-schedule-us-0005. life-insurance-global-0016. tuition-reimbursement-global-0007. equipment-global-0008. goal-setting-global-0009. health-insurance-us-00210. grievance-us-000
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=33%MRR=1.00NDCG=47%|sub:R@5=33%MRR=1.00NDCG=29%151ms
1. eap-global-0002. health-insurance-uk-0023. hsa-fsa-uk-0014. life-insurance-global-0015. final-pay-uk-0016. equipment-global-0007. health-insurance-us-0028. retirement-us-0009. pto-us-00010. final-pay-us-001
06I need to take extended leave for a family medical issue. What are my options?advisory4 GT
Ground Truth: family-medical-leave-us-000, family-medical-leave-us-001, sick-leave-us-000, eap-global-000
BM25 Only
chunk:R@5=50%MRR=1.00NDCG=54%|sub:R@5=33%MRR=1.00NDCG=40%39ms
1. family-medical-leave-us-0012. family-medical-leave-de-0013. family-medical-leave-de-0004. family-medical-leave-us-0025. family-medical-leave-us-0006. family-medical-leave-de-0027. family-medical-leave-uk-0008. family-medical-leave-uk-0029. family-medical-leave-uk-00110. bereavement-global-000
Vector Only
chunk:R@5=75%MRR=1.00NDCG=71%|sub:R@5=67%MRR=1.00NDCG=54%43ms
1. family-medical-leave-us-0012. family-medical-leave-de-0013. family-medical-leave-uk-0024. family-medical-leave-us-0005. sick-leave-us-0006. parental-leave-us-0017. sick-leave-us-0028. family-medical-leave-de-0009. family-medical-leave-us-00210. bereavement-global-000
Hybrid (no rerank)
chunk:R@5=50%MRR=1.00NDCG=72%|sub:R@5=33%MRR=1.00NDCG=55%62ms
1. family-medical-leave-us-0012. family-medical-leave-de-0013. family-medical-leave-us-0004. family-medical-leave-de-0005. family-medical-leave-uk-0026. family-medical-leave-us-0027. sick-leave-us-0008. sick-leave-us-0029. parental-leave-us-00110. bereavement-global-000
Hybrid + Semantic Reranking
chunk:R@5=50%MRR=0.50NDCG=56%|sub:R@5=33%MRR=0.50NDCG=48%130ms
1. family-medical-leave-de-0012. family-medical-leave-us-0013. family-medical-leave-us-0004. family-medical-leave-de-0005. family-medical-leave-us-0026. sick-leave-us-0027. family-medical-leave-uk-0028. health-insurance-us-0029. sick-leave-us-00010. health-insurance-de-002
Hybrid + Semantic + Field Boosting
chunk:R@5=50%MRR=0.50NDCG=56%|sub:R@5=33%MRR=0.50NDCG=48%139ms
1. family-medical-leave-de-0012. family-medical-leave-us-0013. family-medical-leave-us-0004. family-medical-leave-de-0005. family-medical-leave-us-0026. sick-leave-us-0027. family-medical-leave-uk-0028. health-insurance-us-0029. sick-leave-us-00010. health-insurance-de-002
Hybrid + Semantic + Query Rewriter
chunk:R@5=50%MRR=1.00NDCG=78%|sub:R@5=67%MRR=1.00NDCG=64%127ms
1. family-medical-leave-us-0012. family-medical-leave-us-0003. family-medical-leave-us-0024. sick-leave-us-0025. health-insurance-us-0026. sick-leave-us-0007. final-pay-us-0018. probation-us-0019. parental-leave-us-00010. pto-us-000
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=25%MRR=0.50NDCG=25%|sub:R@5=67%MRR=0.50NDCG=25%134ms
1. family-medical-leave-de-0012. family-medical-leave-us-0013. sick-leave-us-0024. family-medical-leave-uk-0025. health-insurance-us-0026. health-insurance-de-0027. final-pay-us-0018. parental-leave-us-0009. pto-us-00010. pto-uk-002
07What should I know as a new remote employee?advisory6 GT
Ground Truth: eligibility-global-000, equipment-global-000, work-hours-us-000, communication-global-000, home-office-global-000, remote-access-global-000
BM25 Only
chunk:R@5=33%MRR=0.50NDCG=61%|sub:R@5=50%MRR=1.00NDCG=93%38ms
1. eligibility-global-0012. eligibility-global-0003. eligibility-global-0024. remote-access-global-0005. equipment-global-0016. work-hours-us-0007. work-hours-de-0008. home-office-global-0009. remote-access-global-00110. equipment-global-000
Vector Only
chunk:R@5=33%MRR=0.50NDCG=50%|sub:R@5=50%MRR=1.00NDCG=93%68ms
1. eligibility-global-0012. equipment-global-0003. eligibility-global-0024. communication-global-0015. eligibility-global-0006. equipment-global-0017. communication-global-0008. work-hours-us-0009. work-hours-uk-00110. home-office-global-002
Hybrid (no rerank)
chunk:R@5=33%MRR=0.33NDCG=47%|sub:R@5=33%MRR=1.00NDCG=94%66ms
1. eligibility-global-0012. eligibility-global-0023. eligibility-global-0004. equipment-global-0015. equipment-global-0006. work-hours-us-0007. communication-global-0018. communication-global-0009. home-office-global-00210. work-hours-uk-001
Hybrid + Semantic Reranking
chunk:R@5=17%MRR=0.50NDCG=58%|sub:R@5=50%MRR=1.00NDCG=100%164ms
1. eligibility-global-0012. eligibility-global-0003. eligibility-global-0024. remote-access-global-0015. communication-global-0016. remote-access-global-0007. home-office-global-0008. home-office-global-0029. work-hours-us-00010. communication-global-000
Hybrid + Semantic + Field Boosting
chunk:R@5=17%MRR=0.50NDCG=58%|sub:R@5=50%MRR=1.00NDCG=100%130ms
1. eligibility-global-0012. eligibility-global-0003. eligibility-global-0024. remote-access-global-0015. communication-global-0016. remote-access-global-0007. home-office-global-0008. home-office-global-0029. work-hours-us-00010. communication-global-000
Hybrid + Semantic + Query Rewriter
chunk:R@5=17%MRR=0.50NDCG=58%|sub:R@5=50%MRR=1.00NDCG=100%150ms
1. eligibility-global-0012. eligibility-global-0003. eligibility-global-0024. remote-access-global-0015. communication-global-0016. remote-access-global-0007. home-office-global-0008. home-office-global-0029. work-hours-us-00010. communication-global-000
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=33%MRR=0.25NDCG=35%|sub:R@5=83%MRR=1.00NDCG=72%151ms
1. eligibility-global-0012. remote-access-global-0013. communication-global-0014. home-office-global-0005. work-hours-us-0006. work-hours-uk-0017. equipment-global-0008. work-hours-de-0009. byod-global-00110. at-will-termination-uk-000
08What's the difference between FMLA and parental leave?comparison4 GT
Ground Truth: family-medical-leave-us-000, family-medical-leave-us-001, parental-leave-us-000, parental-leave-us-001
BM25 Only
chunk:R@5=50%MRR=1.00NDCG=78%|sub:R@5=50%MRR=1.00NDCG=51%37ms
1. parental-leave-us-0002. parental-leave-de-0003. parental-leave-uk-0014. parental-leave-de-0015. parental-leave-us-0016. parental-leave-uk-0007. parental-leave-uk-0028. family-medical-leave-us-0019. family-medical-leave-us-00010. family-medical-leave-us-002
Vector Only
chunk:R@5=100%MRR=1.00NDCG=98%|sub:R@5=100%MRR=1.00NDCG=63%42ms
1. family-medical-leave-us-0012. parental-leave-us-0013. parental-leave-us-0004. parental-leave-uk-0025. family-medical-leave-us-0006. family-medical-leave-us-0027. family-medical-leave-de-0018. parental-leave-uk-0019. parental-leave-de-00010. parental-leave-de-001
Hybrid (no rerank)
chunk:R@5=75%MRR=1.00NDCG=96%|sub:R@5=100%MRR=1.00NDCG=61%53ms
1. parental-leave-us-0002. parental-leave-us-0013. family-medical-leave-us-0014. parental-leave-de-0005. parental-leave-uk-0016. parental-leave-uk-0027. parental-leave-de-0018. family-medical-leave-us-0009. family-medical-leave-us-00210. family-medical-leave-de-001
Hybrid + Semantic Reranking
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=64%117ms
1. parental-leave-us-0002. family-medical-leave-us-0003. parental-leave-us-0014. family-medical-leave-us-0015. parental-leave-de-0016. family-medical-leave-us-0027. parental-leave-uk-0008. parental-leave-de-0009. health-insurance-us-00210. pto-us-000
Hybrid + Semantic + Field Boosting
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=64%116ms
1. parental-leave-us-0002. family-medical-leave-us-0003. parental-leave-us-0014. family-medical-leave-us-0015. parental-leave-de-0016. family-medical-leave-us-0027. parental-leave-uk-0008. parental-leave-de-0009. pto-us-00010. parental-leave-uk-002
Hybrid + Semantic + Query Rewriter
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=65%118ms
1. parental-leave-us-0002. family-medical-leave-us-0003. parental-leave-us-0014. family-medical-leave-us-0015. family-medical-leave-us-0026. health-insurance-us-0027. pto-us-0008. retirement-us-0019. probation-us-00110. grievance-us-000
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=50%MRR=1.00NDCG=64%|sub:R@5=100%MRR=1.00NDCG=36%130ms
1. parental-leave-us-0002. family-medical-leave-us-0003. parental-leave-de-0014. parental-leave-uk-0005. health-insurance-us-0026. pto-us-0007. probation-us-0018. at-will-termination-us-0019. sick-leave-us-00210. pto-uk-002
09Compare PTO policies across US, UK, and Germanycomparison3 GT
Ground Truth: pto-us-000, pto-uk-000, pto-de-000
BM25 Only
chunk:R@5=100%MRR=1.00NDCG=85%|sub:R@5=100%MRR=1.00NDCG=80%38ms
1. pto-us-0002. pto-us-0013. pto-uk-0024. pto-uk-0005. pto-de-0006. pto-de-0017. pto-uk-0018. final-pay-us-0009. family-medical-leave-uk-00010. overtime-uk-002
Vector Only
chunk:R@5=33%MRR=0.50NDCG=30%|sub:R@5=33%MRR=1.00NDCG=36%43ms
1. pto-de-0012. pto-de-0003. at-will-termination-de-0024. goal-setting-global-0025. overtime-de-0006. retirement-de-0027. retirement-de-0018. overtime-de-0019. hsa-fsa-uk-00110. eligibility-global-002
Hybrid (no rerank)
chunk:R@5=33%MRR=0.50NDCG=30%|sub:R@5=33%MRR=1.00NDCG=36%67ms
1. pto-de-0012. pto-de-0003. at-will-termination-de-0024. overtime-de-0005. hsa-fsa-uk-0016. goal-setting-global-0027. overtime-de-0018. retirement-de-0029. retirement-de-00110. eligibility-global-002
Hybrid + Semantic Reranking
chunk:R@5=67%MRR=0.50NDCG=62%|sub:R@5=67%MRR=1.00NDCG=75%127ms
1. pto-de-0012. pto-de-0003. at-will-termination-de-0024. pto-uk-0025. pto-uk-0006. pto-uk-0017. eligibility-global-0028. pto-us-0019. pto-us-00010. final-pay-us-000
Hybrid + Semantic + Field Boosting
chunk:R@5=67%MRR=0.50NDCG=62%|sub:R@5=67%MRR=1.00NDCG=75%170ms
1. pto-de-0012. pto-de-0003. at-will-termination-de-0024. pto-uk-0025. pto-uk-0006. pto-uk-0017. eligibility-global-0028. pto-us-0019. pto-us-00010. final-pay-us-000
Hybrid + Semantic + Query Rewriter
chunk:R@5=33%MRR=0.20NDCG=51%|sub:R@5=100%MRR=0.50NDCG=65%385ms
1. eligibility-global-0022. pto-uk-0023. pto-de-0014. pto-us-0015. pto-uk-0006. pto-de-0007. pto-us-0008. pto-uk-0019. at-will-termination-de-00210. final-pay-us-000
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=100%MRR=1.00NDCG=42%139ms
1. pto-de-0012. at-will-termination-de-0023. pto-uk-0024. eligibility-global-0025. pto-us-0016. final-pay-us-0007. hsa-fsa-uk-0018. goal-setting-global-0029. life-insurance-global-00110. bereavement-global-002
10How do I report a workplace safety concern?procedural3 GT
Ground Truth: reporting-injuries-global-000, reporting-injuries-global-001, general-safety-global-000
BM25 Only
chunk:R@5=67%MRR=1.00NDCG=82%|sub:R@5=100%MRR=1.00NDCG=73%38ms
1. general-safety-global-0002. general-safety-global-0023. general-safety-global-0014. reporting-injuries-global-0025. reporting-injuries-global-0006. reporting-injuries-global-0017. ergonomics-global-0028. emergency-procedures-global-0029. emergency-procedures-global-00110. emergency-procedures-global-000
Vector Only
chunk:R@5=100%MRR=1.00NDCG=100%|sub:R@5=100%MRR=1.00NDCG=73%44ms
1. reporting-injuries-global-0002. reporting-injuries-global-0013. general-safety-global-0004. general-safety-global-0025. reporting-injuries-global-0026. general-safety-global-0017. home-office-global-0028. incident-reporting-uk-0009. incident-reporting-us-00010. ergonomics-global-002
Hybrid (no rerank)
chunk:R@5=100%MRR=1.00NDCG=97%|sub:R@5=100%MRR=1.00NDCG=73%60ms
1. general-safety-global-0002. reporting-injuries-global-0003. general-safety-global-0024. reporting-injuries-global-0015. general-safety-global-0016. reporting-injuries-global-0027. ergonomics-global-0028. home-office-global-0029. incident-reporting-uk-00010. incident-reporting-us-000
Hybrid + Semantic Reranking
chunk:R@5=67%MRR=1.00NDCG=82%|sub:R@5=100%MRR=1.00NDCG=73%125ms
1. general-safety-global-0002. general-safety-global-0023. reporting-injuries-global-0024. general-safety-global-0015. reporting-injuries-global-0016. reporting-injuries-global-0007. ergonomics-global-0028. incident-reporting-us-0009. incident-reporting-uk-00010. incident-reporting-us-002
Hybrid + Semantic + Field Boosting
chunk:R@5=67%MRR=1.00NDCG=82%|sub:R@5=100%MRR=1.00NDCG=73%153ms
1. general-safety-global-0002. general-safety-global-0023. reporting-injuries-global-0024. general-safety-global-0015. reporting-injuries-global-0016. reporting-injuries-global-0007. ergonomics-global-0028. incident-reporting-us-0009. incident-reporting-uk-00010. incident-reporting-us-002
Hybrid + Semantic + Query Rewriter
chunk:R@5=67%MRR=1.00NDCG=82%|sub:R@5=100%MRR=1.00NDCG=73%124ms
1. general-safety-global-0002. general-safety-global-0023. reporting-injuries-global-0024. general-safety-global-0015. reporting-injuries-global-0016. reporting-injuries-global-0007. ergonomics-global-0028. incident-reporting-us-0009. incident-reporting-uk-00010. incident-reporting-us-002
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=33%MRR=1.00NDCG=47%|sub:R@5=100%MRR=1.00NDCG=36%203ms
1. general-safety-global-0002. reporting-injuries-global-0023. ergonomics-global-0024. incident-reporting-us-0005. incident-reporting-uk-0006. grievance-de-0007. emergency-procedures-global-0018. grievance-uk-0009. home-office-global-00210. incident-reporting-de-003
11What's the process for submitting an expense report?procedural2 GT
Ground Truth: expense-reimbursement-global-000, expense-reimbursement-global-001
BM25 Only
chunk:R@5=100%MRR=0.50NDCG=69%|sub:R@5=100%MRR=1.00NDCG=47%37ms
1. expense-reimbursement-global-0022. expense-reimbursement-global-0013. expense-reimbursement-global-0004. grievance-de-0025. grievance-uk-0016. grievance-uk-0027. reporting-injuries-global-0018. incident-reporting-us-0029. grievance-us-00210. incident-reporting-de-001
Vector Only
chunk:R@5=100%MRR=0.50NDCG=69%|sub:R@5=100%MRR=1.00NDCG=47%41ms
1. expense-reimbursement-global-0022. expense-reimbursement-global-0003. expense-reimbursement-global-0014. tuition-reimbursement-global-0015. reporting-injuries-global-0016. gifts-entertainment-global-0027. grievance-us-0018. equipment-global-0029. incident-reporting-uk-00010. training-global-001
Hybrid (no rerank)
chunk:R@5=100%MRR=0.50NDCG=69%|sub:R@5=100%MRR=1.00NDCG=47%62ms
1. expense-reimbursement-global-0022. expense-reimbursement-global-0003. expense-reimbursement-global-0014. reporting-injuries-global-0015. grievance-us-0016. incident-reporting-uk-0007. tuition-reimbursement-global-0018. gifts-entertainment-global-0029. equipment-global-00210. training-global-001
Hybrid + Semantic Reranking
chunk:R@5=100%MRR=0.50NDCG=65%|sub:R@5=100%MRR=1.00NDCG=45%123ms
1. expense-reimbursement-global-0022. expense-reimbursement-global-0003. hsa-fsa-us-0014. expense-reimbursement-global-0015. tuition-reimbursement-global-0016. gifts-entertainment-global-0027. equipment-global-0028. equipment-global-0019. dental-vision-uk-00110. training-global-000
Hybrid + Semantic + Field Boosting
chunk:R@5=100%MRR=0.50NDCG=65%|sub:R@5=100%MRR=1.00NDCG=45%162ms
1. expense-reimbursement-global-0022. expense-reimbursement-global-0003. hsa-fsa-us-0014. expense-reimbursement-global-0015. tuition-reimbursement-global-0016. gifts-entertainment-global-0027. equipment-global-0028. equipment-global-0019. dental-vision-uk-00110. training-global-000
Hybrid + Semantic + Query Rewriter
chunk:R@5=100%MRR=0.50NDCG=65%|sub:R@5=100%MRR=1.00NDCG=45%220ms
1. expense-reimbursement-global-0022. expense-reimbursement-global-0003. hsa-fsa-us-0014. expense-reimbursement-global-0015. tuition-reimbursement-global-0016. gifts-entertainment-global-0027. equipment-global-0028. equipment-global-0019. dental-vision-uk-00110. training-global-000
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=100%MRR=1.00NDCG=22%125ms
1. expense-reimbursement-global-0022. hsa-fsa-us-0013. tuition-reimbursement-global-0014. pay-schedule-uk-0015. pay-schedule-us-0016. gifts-entertainment-global-0027. equipment-global-0028. training-global-0009. reporting-injuries-global-00110. sick-leave-de-001
12Can I bring my dog to work?edge case1 GT
Ground Truth: ada-disability-us-000
BM25 Only
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=0%MRR=0.00NDCG=0%62ms
1. byod-global-0002. work-hours-uk-0023. eligibility-global-0014. eligibility-global-0005. work-hours-uk-0016. byod-global-0017. eligibility-global-0028. byod-global-0029. work-hours-de-00110. work-hours-us-000
Vector Only
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=0%MRR=0.00NDCG=0%43ms
1. work-hours-uk-0012. eligibility-global-0023. work-hours-uk-0024. work-hours-us-0005. byod-global-0006. work-hours-de-0017. work-hours-de-0028. eligibility-global-0019. work-hours-us-00110. pto-us-001
Hybrid (no rerank)
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=0%MRR=0.00NDCG=0%63ms
1. work-hours-uk-0022. byod-global-0003. work-hours-uk-0014. eligibility-global-0025. eligibility-global-0016. work-hours-us-0007. work-hours-de-0018. work-hours-de-0029. work-hours-us-00110. pto-us-001
Hybrid + Semantic Reranking
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=0%MRR=0.00NDCG=0%148ms
1. byod-global-0002. byod-global-0023. family-medical-leave-us-0004. byod-global-0015. work-hours-us-0016. equipment-global-0017. pto-us-0018. hsa-fsa-uk-0019. work-hours-uk-00210. eligibility-global-002
Hybrid + Semantic + Field Boosting
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=0%MRR=0.00NDCG=0%123ms
1. byod-global-0002. byod-global-0023. family-medical-leave-us-0004. byod-global-0015. work-hours-us-0016. equipment-global-0017. pto-us-0018. hsa-fsa-uk-0019. work-hours-uk-00210. eligibility-global-002
Hybrid + Semantic + Query Rewriter
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=0%MRR=0.00NDCG=0%123ms
1. byod-global-0002. byod-global-0023. family-medical-leave-us-0004. byod-global-0015. work-hours-us-0016. equipment-global-0017. pto-us-0018. hsa-fsa-uk-0019. work-hours-uk-00210. eligibility-global-002
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=0%MRR=0.14NDCG=33%|sub:R@5=0%MRR=0.14NDCG=7%133ms
1. byod-global-0002. work-hours-us-0013. acceptable-use-global-0004. equipment-global-0015. pto-us-0016. hsa-fsa-uk-0017. ada-disability-us-0008. work-hours-uk-0029. eligibility-global-00210. sick-leave-uk-001
13I'm a contractor in Germany, what leave do I get?edge case2 GT
Ground Truth: pto-de-000, at-will-termination-de-000
BM25 Only
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=0%MRR=0.00NDCG=0%39ms
1. parental-leave-de-0012. jury-duty-de-0003. parental-leave-de-0004. family-medical-leave-de-0005. sick-leave-de-0026. family-medical-leave-de-0027. sick-leave-de-0008. family-medical-leave-de-0019. jury-duty-de-00210. jury-duty-de-001
Vector Only
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=50%MRR=1.00NDCG=22%42ms
1. at-will-termination-de-0022. parental-leave-de-0013. health-insurance-de-0024. separation-de-0005. sick-leave-de-0016. separation-de-0027. sick-leave-de-0028. sick-leave-de-0009. parental-leave-de-00010. final-pay-de-000
Hybrid (no rerank)
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=0%MRR=0.17NDCG=8%61ms
1. parental-leave-de-0012. parental-leave-de-0003. sick-leave-de-0024. sick-leave-de-0005. sick-leave-de-0016. at-will-termination-de-0027. separation-de-0008. health-insurance-de-0029. separation-de-00210. final-pay-de-000
Hybrid + Semantic Reranking
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=50%MRR=0.25NDCG=16%134ms
1. separation-de-0002. sick-leave-de-0013. jury-duty-de-0004. at-will-termination-de-0025. health-insurance-de-0026. family-medical-leave-de-0017. sick-leave-de-0028. pto-de-0019. sick-leave-de-00010. family-medical-leave-de-002
Hybrid + Semantic + Field Boosting
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=50%MRR=0.25NDCG=16%134ms
1. separation-de-0002. sick-leave-de-0013. jury-duty-de-0004. at-will-termination-de-0025. health-insurance-de-0026. family-medical-leave-de-0017. sick-leave-de-0028. pto-de-0019. sick-leave-de-00010. family-medical-leave-de-002
Hybrid + Semantic + Query Rewriter
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=50%MRR=0.25NDCG=16%135ms
1. separation-de-0002. sick-leave-de-0013. jury-duty-de-0004. at-will-termination-de-0025. health-insurance-de-0026. family-medical-leave-de-0017. sick-leave-de-0028. pto-de-0019. sick-leave-de-00010. family-medical-leave-de-002
Hybrid + Semantic + Parent-Doc (MMR)
chunk:R@5=0%MRR=0.00NDCG=0%|sub:R@5=50%MRR=0.25NDCG=16%179ms
1. separation-de-0002. sick-leave-de-0013. jury-duty-de-0004. at-will-termination-de-0025. health-insurance-de-0026. family-medical-leave-de-0017. overtime-de-0018. pto-de-0019. parental-leave-de-00010. bereavement-global-002

IR Metrics

Recall@K — fraction of ground truth in the top K results
Precision@K — fraction of top K that are relevant
MRR — Mean Reciprocal Rank (1/rank of first relevant)
NDCG@10 — rank-weighted relevance over the top 10
Chunk-level — a hit requires the exact chunk_id
Subsection-level — a hit requires the chunk's policy/subsection to be in GT