Deduplication policies

Duplicate types

Kernel categorizes all accounts into one of five types:

Type
Definition

Primary

Primary record

Note that duplicates of this account may exist, but this is the record that is recommended to survive the merge.

Exact

Exact match found after ‘cleaning’ and standardizing the URLs

Subdomain

Similar to exact match, e.g. shop.ccs.com vs. ccs.com

Regional

fr.amazon.com, amazon.fr, amazon.com are all regional duplicates, but apollo.de and apollo.com are not.

Potential

A catch-all category for all the hard cases that require extensive work, e.g. corporate or careers sites, investor relationship, or product/marketing sites.

Regional account policy

You can treat regional duplicates in one of two ways:

  1. Treat regional sites as subsidiaries (keep separate, e.g., amazon.fr is a child of amazon.com)

  2. Treat regional sites as duplicates (collapse into the global parent)

This setting is relevant for determining the Cleaning action

Last updated