Question 1

How does Humaniti verify labels?

Accepted Answer

Every label routes to N independent Humans (default 3, configurable up to 7). Their answers are weighted by per-skill Elo and aggregated via Dawid-Skene, MACE, and CAZ peer prediction. The aggregator returns a consensus label and a confidence score. Disagreement triggers Steward adjudication.

Question 2

What is the Tier system?

Accepted Answer

Four tiers: T0 (signed up, no tasks), T1 (phone verified, low-stakes work), T2 (ID verified via Sumsub, standard work), T3 (reputation verified, sensitive work + Steward eligibility). Each Human also carries a per-skill Elo that gates which work they can pick up within their tier.

Question 3

What does a Builder configure?

Accepted Answer

N-Human consensus (1, 3, 5, or 7), gold-question density, target inter-Human agreement threshold, language and region constraints on the qualified pool, and Steward escalation thresholds. All defaults are tuned per task type.

Question 4

What happens when Humans disagree?

Accepted Answer

Mandatory rejection reason from any Human who disagrees with a peer. 72-hour appeal window for the original Human. If escalated, the item routes to a fresh pool of Humans (blind re-annotation), then a T3 Steward adjudicates. The outcome is final and posts to the append-only audit log.

Question 5

How does Humaniti detect cheating?

Accepted Answer

Behavioral telemetry (keystroke timing, pointer paths, dwell), text-detection ensemble (Binoculars + Fast-DetectGPT + Ghostbuster + SynthID-Text) for free-response fields, gold-question injection, wallet and device clustering, periodic Steward audits. Phase 3 adds slashing of staked credits.

How a label becomes verified.

From request to receipt.

Quality is a ladder.

Signed up

Phone verified

ID verified

Reputation verified

What a Builder controls.

Disagreement is data.

Why this can't be cheated.

What happens when Humans disagree.

Read deeper.