1.3.6.8 Min/Max Profiler
The Max/Min Profiler examines the extremes of the data in each attribute, and returns:
-
The shortest value
-
the longest value
-
the 'lowest' value
-
the 'highest' value
Use the Max/Min Profiler to gain an initial understanding of your data. The Max/Min Profiler gives you a quick overview of whether or not your data conforms to its length and valid value restrictions, and allows you to find 'outliers'; that is, values that are clearly out of range, such as number amounts that are larger or smaller than expected, date values that are earlier or later than expected, or text values that consist only of invalid characters such as '#', or of data cheats such as 'aaa' or 'zzz'.
The following table describes the configuration options:
Configuration | Description |
---|---|
Inputs |
Specify any attributes in which you want to find Data Maxima and Minima. |
Options |
None. |
Outputs |
Describes any data attribute or flag attribute outputs. |
Data Attributes |
None. |
Flags |
The following flag is output:
|
The Max/Min Profiler requires a batch of records to produce useful statistics. It must therefore run to completion before its results are available, and is not suitable for a process that requires a real time response.
When executed against a batch of transactions from a real time data source, it will finish its processing when the commit point (transaction or time limit) configured on the Read Processor is reached.
The following table describes the statistics produced by the profiler for each attribute.
Statistic | Description |
---|---|
Minimum Length |
The number of characters of the shortest value in the attribute, in number of characters. |
Maximum Length |
The number of characters of the longest value in the attribute, in number of characters. |
Minimum Value |
The 'lowest' value in the attribute. For Number attributes, this is the lowest numeric value. For Date attributes, this is the earliest date. For Text attributes, this is the first value alphabetically. Note that Null values are ignored in this analysis, but other types of No Data (for example, values consisting only of spaces) are not. |
Maximum Value |
The 'highest' value in the attribute. For Number attributes, this is the highest numeric value. For Date attributes, this is the latest date. For Text attributes, this is the last value alphabetically. Note that Null values are ignored in this analysis, but other types of No Data (for example, values consisting only of spaces) are not. |
Clicking on the Additional Information button shows the number and percentage of records with the minimum length, maximum length, minimum value and maximum value, alongside the above statistics.
Example
In this example, the Max/Min Profiler examines all attributes in a table of Customer records:
Table 1-122 Max/Min Profiler
Input Field | Total number | Minimum Length | Maximum Length | Minimum Value | Maximum Value |
---|---|---|---|---|---|
CU_NO |
2010 |
2 |
6 |
10 |
875825 |
CU_ACCOUNT |
2010 |
7 |
12 |
00-0-XX |
OO-24282-LR |
TITLE |
2010 |
1 |
12 |
1 |
The Reverend |
NAME |
2010 |
4 |
29 |
# ADAMS |
aaaaaaaaa |
GENDER |
2010 |
1 |
1 |
1 |
M |
BUSINESS |
2010 |
2 |
41 |
Stoke Newington Town Hall |
e-sites.co.uk |
ADDRESS1 |
2010 |
1 |
50 |
(Brassfounders) LD, Coursington Road |
kjhkg |
ADDRESS2 |
2010 |
1 |
31 |
WARRINGTON |
jhgfhj |
ADDRESS3 |
2010 |
1 |
22 |
Aberdeen |
jhvgj |
POSTCODE |
2010 |
1 |
8 |
1P1 3HS |
gjhgj |
AREA_CODE |
2010 |
1 |
4 |
0 |
2920 |
TEL_NO |
2010 |
1 |
7 |
1 |
4227051 |
|
2010 |
1 |
50 |
5 |
zoe.peckham@btopenworld.com |
ACC_MGR |
2010 |
2 |
3 |
22 |
WH |
DT_PURCHASED |
2010 |
5 |
10 |
01/01/1995 |
Brian |
DT_ACC_OPEN |
2010 |
5 |
10 |
01/01/1995 |
Brian |
DT_LAST_PAYMENT |
2010 |
19 |
19 |
01-Jan-1970 00:00:00 |
21-Mar-2004 00:00:00 |
DT_LAST_PO_RAISED |
2010 |
19 |
19 |
01-Jan-1970 00:00:00 |
14-Feb-2004 00:00:00 |
BALANCE |
2010 |
1 |
10 |
-999999 |
410.5 |