1.3.3.10.3 Output Selector: Earliest Value
The Earliest Value output selector evaluates a date stamp on each record to select a value to use for another attribute.
Use the Earliest Value output selector where records have date stamps, and the best value for an attribute is likely to come from the record with the earliest stamp.
For example, in some cases, data is carefully checked and cleaned on migration to a new system, but duplicate entries are created with errors. In this case, the earlier records may be better. The date stamp column must be selected as an input to the output selector, in addition to the attribute for which you want to determine a merged value.
The following table describes the configuration options:
Configuration | Description |
---|---|
Inputs |
For the actual output value, any input attributes of any type from any input data sets may be configured. A Date attribute must be configured in order to establish which record should be considered the 'Earliest'. |
Options |
Specify the following options:
|
Example
In this example, the Earliest Value output selector is used to select the Earliest Value for a Company Name field, based on the date stamps of the records.
Example configuration
Use first non-empty value if tied? = No
Example output
The following table shows example output using the Earliest Value selector:
Table 1-102 Example Output Using Earliest Value Selector
Input A (CompanyName, Date) | Input B (CompanyName, Date) | Output value (Earliest Value) |
---|---|---|
Barclays Bank plc, 10/01/1998 |
Barclays Bank (Bristol) PLC, 14/05/2002 |
Barclays Bank plc |
PriceWaterhouse Coopers, 10/01/1998 |
PWC, 24/03/2000 |
PriceWaterhouse Coopers |
Oracle Limited, 24/03/2003 |
Oracle, 24/03/2003 |
Selection error (needs manual resolution) |
Oracle, null |
Oracle, 24/01/1997 |
Oracle |