Merge
When one or more pieces of data are determined to represent the same real-world object, the data is merged into a single i2 Analyze record. For the Information Store to merge data, the correlation identifiers must match, the implicit discriminators must be compatible, and the origin identifiers must be different.
- New data in the staging table contains the same correlation identifier as an existing record in the Information Store. The new data has an origin identifier that is not associated with the existing record.
- An update to an existing record with a single piece of provenance in the Information Store causes the correlation identifier of that record to change. The new correlation identifier matches with another record in the Information Store.
- Multiple rows of data in the staging table contain the same correlation identifier. The Information Store ingests the data as a new i2 Analyze record, or the record merges with an existing record in the Information Store.
- The record has a piece of provenance for all of the source information that contributed to the merged record.
- By default, the property values for the merged
i2 Analyze record are taken from the source
information associated with the provenance that
has the most recent value for the
source_last_updated
column.If only one piece of provenance for a record has a value for the
source_last_updated
column, the property values from the source information that is associated with that provenance are used. Otherwise, the property values to use are determined by the ascending order of the origin identifier keys that are associated with the record. The piece of provenance that is last in the order is chosen. To ensure data consistency, update your existing records with a value for thesource_last_updated
column before you start to use correlation, and continue to update the value.If the default behavior does not match the requirements of your deployment, you can change the method for defining property values for merged records. For more information, see Define how property values of merged records are calculated.
- If an existing record to be merged contained any notes, the notes are moved to the merged record.
- If an existing record to be merged was an
entity record at the end of any links, the links
are updated to reference the merged
record.Note: Any links that were created through Analyst's Notebook Premium are also updated to reference the merged record.
During ingestion, the number of merge
operations that occur is reported in the
merge_count
column of the
ingestion report.
The following diagrams demonstrate the merge operation.
OI.12
and one for the new data,
OI.22
.OI.32
and one for the new data,
OI.12
.