We’ve raised $14M from top VCs and operators at Plaid, OpenAI, Slack and many more.
September 8, 2025

Kernel Deduplication

Kernel identifies groups of duplicate accounts and identifies the master record to preserve.

Kernel uses its proprietary, AI-driven algorithm to scan all accounts in the CRM.

The following data points are provided:

Duplicate types

Each record is associated with one of the following duplicate types

How Kernel identifies duplicates in your CRM

Kernel's deduplication works in two steps:

1. Candidate generation

For each account in your CRM, Kernel will scan the full CRM to create a long-list of potential duplicate candidates.

2. Candidate selection

Kernel will crawl the websites of all candidates and use data from the Website analysis to determine if the pair is a true duplicate pair. The duplicate type and group will also be calculated.

Kernel uses a contextual, AI-based approach to determine duplicate pairs, e.g. to decide that amazon.fr is a regional duplicate of amazon.com, but apollo.de is not a regional duplicate of apollo.com

Master record selection

Kernel uses a variety of factors to determine the master record (Primary), including

  • The top-level domain (TLD)
  • Redirecting domains

Duplicate groups

All associated duplicates are assigned a Duplicate group ID. Each duplicate group can only have 1 Primary account.

About the Author