Русский

#1 in AI People Search

Proven across real-world use cases — customer discovery, recruiting, and partnerships.

4 ScenariosWeb-VerifiedOpen Source
Read the Paper on arXiv →

Platform Comparison

119 real-world queries, scored independently through web verification. Scale: 0–100.

Lessie
Exa
Claude Code
Juicebox
020406080Overall65.2554645.8Relevance70.253.854.344.7Coverage69.158.141.141.8Utility56.453.142.750.9
Overall = Relevance + Coverage + Utility3All scores on a 0–100 scale

Performance by Scenario

How each platform performs across four real-world use cases.

Lessie
Exa
Claude Code
Juicebox
Influencer / KOL
Lessie
62.3
Claude Code
43.2
Exa
41.6
Juicebox
31.1
Expert / Deterministic
Lessie
70.4
Exa
61.2
Claude Code
57
Juicebox
44.2
B2B Prospecting
Lessie
60.6
Exa
55.2
Juicebox
51.4
Claude Code
43
Recruiting
Lessie
68.2
Juicebox
65.7
Exa
64.7
Claude Code
50.5
0255075100

Scenario Deep Dive

Breaking down Relevance, Coverage, and Utility across each scenario.

Influencer / KOL

Finding content creators across social platforms
RelevanceCoverageUtility65.262.858.9
Lessie
Exa
Claude Code
Juicebox

Expert / Deterministic

Queries with verifiable correct answers or that seek specific domain experts
RelevanceCoverageUtility7975.257.1
Lessie
Exa
Claude Code
Juicebox

B2B Prospecting

Finding decision-makers at target companies
RelevanceCoverageUtility62.863.555.5
Lessie
Exa
Juicebox
Claude Code

Recruiting

Finding candidates with specific skills, experience, and location
RelevanceCoverageUtility74.875.654.3
Lessie
Exa
Juicebox
Claude Code

Evaluation Dataset

Curated from real practitioner workflows in recruiting, sales, and research.

119Total Queries
Multi-languagePractitioner-driven

English, Portuguese, Spanish, Dutch.
Recruiting (30), B2B Prospecting (32), Expert (28), Influencer (29).

4Scenario Categories
RecruitingB2B ProspectingExpert / DeterministicInfluencer

The core use cases where AI people search creates business value.

3Evaluation Dimensions
RelevanceCoverageUtility

Independent axes measuring ranking quality, result volume, and data completeness.

Methodology

Fully automated, reproducible pipeline. Every result verified against live web sources.

1

Decompose the Query

“Senior ML engineer at a Series B startup in Berlin” becomes a checklist: role, seniority, domain, company stage, location.

2

Verify Against the Web

Every person returned by every platform is checked against LinkedIn, company sites, and social profiles. No self-reported data - only what can be independently confirmed.

3

Score on Three Axes

Relevance (did you find the right people?), Coverage (how many?), and Utility (is the profile data actually useful?). Combined into one Overall score.

What We Measure

Relevance

Padded nDCG@10

Measures whether returned people match the query and are correctly ranked. Each person is web-verified and graded against explicit criteria. Padded to 10 slots - returning fewer results is penalized.

Coverage

TCR × Yield

Measures how many qualified people are found per query. Combines task completion rate with average qualified result yield (capped at K=10). Rewards both reliability and volume.

Utility

(C + E + A) / 3

Measures whether returned data is complete and actionable. Averages three sub-dimensions: structural completeness (C), query-specific evidence (E), and actionability (A).

Key Findings

Highlights from 476 platform runs across 119 queries.

#1in All Four Scenarios

Lessie is the only platform to lead every category - Recruiting, B2B, Expert / Deterministic, and Influencer / KOL.

100%Completion Rate

Every query returned results. No other platform achieved this - especially on niche and abstract searches where others returned nothing.

Largest Relevance Gap

70.2vs54.3
+29% over next-best

The ranking quality difference is most pronounced on multi-criteria queries.

Influencer is the Widest Gap

Lessie
62.3
Claude
43.2

Lessie scored 62.3 overall; the runner-up scored 43.2. Single-source platforms struggle most here.

Utility is the Closest Race

Profile data completeness is the most competitive dimension - all platforms scored between 42.7 and 56.4.

Recruiting is the Most Competitive

Three platforms scored above 64 overall. This is the scenario where existing tools perform best - and where margins are thinnest.

Try Lessie

One search across professional networks, social platforms, and academic databases.