Hi
Unduplicate match used.
I have the following blocking column:
MatchPrimaryWord1NYSIIS : PALY
And the follwing match command:
MULT_UNCERT on field MatField1
0.9
0.1
900
It matches the following records in the MatField1:
SUBRAYAN PILLAY
SOOBRAMONEY PILLAY
The output I need is for QS to only match records when the order might differ but the word should be the same. Example : JOHN DOE and DOE JOHN should be a match but not JOHNN DOE and DOE JOHN
What am I doing wrong?
Thanks
Dale
Matching MULT_UNCERT
-
- Participant
- Posts: 20
- Joined: Tue May 13, 2008 1:24 am
- Location: Dale Cooper
-
- Premium Member
- Posts: 425
- Joined: Sat Nov 19, 2005 9:26 am
- Location: New York City
- Contact:
DaleCooper,
Take a look at your cut off value. Is the cut off value too low?
If the cut off value is properly set then, give it a try with a higher m probability, let's say 0.95 or 0.99
The column gets a higher penalty when disagreeing, the full disagreement weight will be assigned, resulting in a no match
Thanks
Take a look at your cut off value. Is the cut off value too low?
If the cut off value is properly set then, give it a try with a higher m probability, let's say 0.95 or 0.99
The column gets a higher penalty when disagreeing, the full disagreement weight will be assigned, resulting in a no match
Thanks
Julio Rodriguez
ETL Developer by choice
"Sure we have lots of reasons for being rude - But no excuses
ETL Developer by choice
"Sure we have lots of reasons for being rude - But no excuses
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Try using a CHAR match. I suggest this because your requirement is that the words must be the same. An upstream Standardization will have placed main name and first name into the correct buckets, overcoming the order problem, and generated the NYSIIS value on which you're blocking.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Participant
- Posts: 20
- Joined: Tue May 13, 2008 1:24 am
- Location: Dale Cooper