1.3.10.16 Extract Values
The Extract Values processor extracts values, or parts of values, to a new attribute, where those values match a reference list.
The matching against the list may be done in one of five ways:
-
Whole Value
-
Starts With
-
Ends With
-
Contains
-
Delimiter Match
This affects the way that values are extracted. For example, if you want to extract Business Suffixes from a Company Name attribute, you may want to extract them only if the value ends with the value in the list.
Use Extract Values to create a new attribute containing a distinct part of an input attribute that you want to treat separately.
For example, if you have a Product_Description
attribute containing values that represent the units of a product (for example, PINTS, PNTS, PTS etc.) you may want to extract these values to a separate attribute.
The following table describes the configuration options:
Configuration | Description |
---|---|
Inputs |
Specify one or more String or String Array attributes from which you want to extract values that match a list. |
Options |
Specify the following value map options:
Specify the following match options:
|
Outputs |
Describes any data attribute or flag attribute outputs. |
Data Attributes |
The following data attributes are output:
|
Flags |
The following flags are output:
|
The following table describes the statistics produced by the profiler:
Statistic | Description |
---|---|
Extracted |
The number of records which matched the list, and so where an extraction was performed. |
Unextracted |
The number of records which did not match the list and so no extraction was performed. |
Output Filters
The following output filters are available:
-
Records that matched the list
-
Records that did not match the list
Example
In this example, Extract Values is used to extract the County value from an ADDRESS3
attribute which normally just contains the County, but in some cases contains both the County and other trailing information, such as a Postcode. In this case, the list is matched using a Starts With option, and the matching values extracted to an output attribute named County:
ADDRESS3.trimmed | County |
---|---|
Cheshire |
Cheshire |
Kent |
Kent |
Surrey, CB0 8YN |
Surrey |
Herts, AL1 3HL |
Herts |
Cambridgeshire |
Cambridgeshire |
Essex, SS2 5QN |
Essex |
London, WC2E 8JG |
London |