Steward
Purpose
For AUTO_DUPLICATES entities, the steward automatically processes MATCH buckets.
Steward Configuration
JSON
{
"type": "entity",
"id": "customers",
"type": "AUTO_DUPLICATES",
"steward": "customer_steward",
"stewardCron": "0 2 * * *",
"automatic": true
}
Steward Process
Find buckets classified as MATCH
Validate classification scores
Apply merger to create golden records
Move original records to history
Update entity statistics
Log results
Monitoring Steward
Check task executions:
BASH
GET /tasks/instances?status=COMPLETED
Review entity statistics:
BASH
GET /entities/customers
Look at duplicateBuckets count over time.