Product Description
-
Matching between both the standardized and abbreviated input.
-
Matching between descriptions relating to the same product where the words are out of order.
-
Matching between descriptions where one description is a shorter description of the same product (higher priority given to those short descriptions where the “short” description contains more words, to avoid matches based on very sparse descriptions).
-
Matching between descriptions where there are character differences, but possible ID values or quantities all match are given a higher weighting (to avoid matches between similar product descriptions that have different sizes).
-
Looser matches that allow for typo matches on any token, including those that may be IDs or quantities.
Result | Score | Category | Comparisons |
---|---|---|---|
P001 Product description exact |
100 |
Exact |
Product desc exact = true |
P002 Product description stand all words |
90 |
Fuzzy |
Product descr stand WMP = 100 |
P003 Product description stand all words out of order |
85 |
Fuzzy |
Product descr stand WMP OOO= 100 |
P004 Product description stand all words relating to shorter, >= 4 matching words, WMP > 70 |
84 |
Fuzzy |
Product descr stand WMP shorter = 100, Product descr stand WMC >= 4, Product descr stand WMP OOO >= 70 |
P005 Product description stand all words relating to shorter out of order, >= 4 matching words, WMP > 70 |
82 |
Fuzzy |
Product descr stand WMP shorter OOO= 100 Product desc WMC OO >= 4 Product descr stand WMP OOO >= 70 |
P006 Product description stand all words relating to shorter, >= 4 matching words |
80 |
Fuzzy |
Product descr stand WMP shorter = 100 Product WMC OOO >= 4 |
P007 Product description stand all words relating to shorter out of order, >= 4 matching words |
79 |
Fuzzy |
Product descr stand WMP shorter OOO = 100 Product descr WMP OOO >= 4 |
P008 Product description stand all words relating to shorter, >= 2 matching words |
77 |
Fuzzy |
Product descr stand WMP shorter = 100 Product descr WMC OOO >= 2 |
P009 Product description stand all words relating to shorter out of order, >= 2 matching words |
75 |
Fuzzy |
Product descr stand WMP shorter OOO = 100 Product descr WMC OOO >=2 |
P010 Product description abbr - non number words 1 typo, number words exact |
70 |
Fuzzy |
Product descr abbr non number words CED <= 1 Product descr abbr number words exact = true |
P011 Product description abbr - all words out of order, number words exact, non number words typos |
68 |
Fuzzy |
Product descr abbr has multi tokens = true product descr abbr non number words WMP OO tolerant = 100 Product descr abbr number words WMP OOO = 100 |
P012 Product description abbr - all words out of order, number words no data, non number words typos |
67 |
Fuzzy |
Product descr abbr has multitokens = true product descr abbr non number words WMP OO tolerant = 100 Product descr abbr number words WMP OOO = no data |
P013 Product description abbr - all words out of order, number words exact, non number words shortened exact |
66 |
Fuzzy |
Product descr abbr has multitokens = true Product descr abbr shortened WMP OOO = 100 |
P014 Product description abbr - all words out of order relating to shorter, number words exact, non number words typos >= 4 matching words WMP> 70 |
65 |
Fuzzy |
Product descr abbr WMP shorter OOO tolerant Product descr abbr has multitoken = true Product descr abbr WMP OOO Tolerant >= 70 Product descr abbr WMC abbr tolerant >= 4 Product descr abbr non number words WMP shorter OOO tolerant = 100 Product descr abbr number words WMP shorter OOO = 100 |
P015 Product description abbr - all words out of order relating to shorter, number words no data, non number words typos >= 4 matching words WMP> 70 |
64 |
Fuzzy |
Product descr abbr WMP shorter OOO tolerant Product descr abbr has multitoken = true Product descr abbr WMP OOO Tolerant >= 70 Product descr abbr WMC abbr tolerant >= 4 Product descr abbr non number words WMP shorter OOO tolerant = 100 Product descr abbr number words WMP shorter OOO = no data |
P016 Product description abbr - all words out of order relating to shorter, number words exact, non number words typos >= 4 matching words |
63 |
Fuzzy |
Product descr abbr WMP shorter OOO tolerant Product descr abbr has multitoken = true Product descr abbr WMC abbr tolerant >= 4 Product descr abbr non number words WMP shorter OOO tolerant = 100 Product descr abbr number words WMP shorter OOO = 100 |
P017 Product description abbr - all words out of order relating to shorter, number words no data, non number words typos >= 4 matching words |
62 |
Fuzzy |
Product descr abbr WMP shorter OOO tolerant Product descr abbr has multitoken = true Product descr abbr WMC abbr tolerant >= 4 Product descr abbr non number words WMP shorter OOO tolerant = 100 Product descr abbr number words WMP shorter OOO = no data |
P018 Product description abbr - all words out of order relating to shorter, number words exact, non number words typos |
61 |
Fuzzy |
Product descr abbr WMP shorter OOO tolerant Product descr abbr has multitoken = true Product descr abbr non number words WMP shorter OOO tolerant = 100 Product descr abbr number words WMP shorter OOO = 100 |
P019 Product description abbr - all words out of order relating to shorter, number words no data, non number words typos |
60 |
Fuzzy |
Product descr abbr WMP shorter OOO tolerant Product descr abbr has multitoken = true Product descr abbr non number words WMP shorter OOO tolerant = 100 Product descr abbr number words WMP shorter OOO = no data |
P020 Product description abbr - all words out of order relating to shorter, number words exact, shortened non-number words exact |
59 |
Fuzzy |
Product descr abbr has multitokens = true Product descr abbr shortened WMP OOO relating to shorter = 100 |
P021 Product description stand all words relating to shorter, one record only has one word |
58 |
Fuzzy |
Product descr stand WMP shorter OOO |
P022 Product description stand one typo |
55 |
Fuzzy |
Product descr stand CED <= 1 |
P023 Product description stand two typos |
50 |
Fuzzy |
Product descr stand CED <= 2 |
P024 Product description stand all words out of order, typos |
40 |
Fuzzy |
Product descr stand WMP OOO tolerant = 100 |
P025 Product description stand all words out of order, relating to shorter, typos |
30 |
Fuzzy |
Product descr stand WMP shorter OOO tolerant = 100 |
P026 Product description abbr CMP > 90, all number words |
25 |
Fuzzy |
Product descr abbr CMP >= 90 Product descr abbr number words WMP shorter OOO = 100 |
P027 Product description abbr CMP > 90 , number words no data |
25 |
Fuzzy |
Product descr abbr CMP >= 90 Product descr abbr number words WMP shorter OOO = no data |
P028 Product description stand LCSP 90 relating to shorter, all number words |
23 |
Fuzzy |
Product descr stand Longest Common Substring Percentage >= 90 Product descr abbr number words WMP shorter OOO = 100 |
P029 Product description stand LCSP 90 relating to shorter, number words no data |
23 |
Fuzzy |
Product descr stand Longest Common Substring Percentage >= 90 Product descr abbr number words WMP shorter OOO = no data |
P030 Product description abbr CMP > 80, all number words |
22 |
Fuzzy |
Product descr abbr CMP >= 80 Product descr abbr number words WMP shorter OOO = 100 |
P031 Product description abbr CMP > 80, number words no data |
22 |
Fuzzy |
Product descr abbr CMP >= 80 Product descr abbr number words WMP shorter OOO = no data |
P032 Product description stand LCSP 80 relating to shorter, all number words |
21 |
Fuzzy |
Product descr stand Longest Common Substring Percentage >= 80 Product descr abbr number words WMP shorter OOO = 100 |
P033 Product description stand LCSP 80 relating to shorter, number words no data |
21 |
Fuzzy |
Product descr stand Longest Common Substring Percentage >= 80 Product descr abbr number words WMP shorter OOO = no data |
P034 Product description abbr CMP > 70, all number words |
20 |
Fuzzy |
Product descr abbr CMP >= 70 Product descr abbr number words WMP shorter OOO = 100 |
P035 Product description abbr CMP > 70, number words no data |
20 |
Fuzzy |
Product descr abbr CMP >= 70 Product descr abbr number words WMP shorter OOO = no data |
P036 Product description stand LCSP 70 relating to shorter, all number words |
19 |
Fuzzy |
Product descr stand Longest Common Substring Percentage >= 70 Product descr abbr number words WMP shorter OOO = 100 |
P037 Product description stand LCSP 70 relating to shorter, number words no data |
19 |
Fuzzy |
Product descr stand Longest Common Substring Percentage >= 70 Product descr abbr number words WMP shorter OOO = no data |
P038 Product description abbr CMP > 90 |
18 |
Fuzzy |
Product descr abbr CMP >= 90 |
P039 Product description stand LCSP 90 relating to shorter |
18 |
Fuzzy |
Product descr stand Longest Common Substring Percentage >= 90 |
P040 Product description abbr CMP > 80 |
16 |
Fuzzy |
Product descr abbr CMP >= 80 |
P041 Product description stand LCSP 80 relating to shorter |
16 |
Fuzzy |
|
P042 Product description no data |
0 |
No data |
Product descr stand exact = no data |
P043 Product description conflict |
-3 |
|
* |