Matching MULT_UNCERT

Infosphere's Quality Product

Moderators: chulett, rschirm

Post Reply
dalecooper
Participant
Posts: 20
Joined: Tue May 13, 2008 1:24 am
Location: Dale Cooper

Matching MULT_UNCERT

Post by dalecooper »

Hi

Unduplicate match used.

I have the following blocking column:
MatchPrimaryWord1NYSIIS : PALY

And the follwing match command:
MULT_UNCERT on field MatField1
0.9
0.1
900

It matches the following records in the MatField1:
SUBRAYAN PILLAY
SOOBRAMONEY PILLAY

The output I need is for QS to only match records when the order might differ but the word should be the same. Example : JOHN DOE and DOE JOHN should be a match but not JOHNN DOE and DOE JOHN

What am I doing wrong?

Thanks
Dale
JRodriguez
Premium Member
Premium Member
Posts: 425
Joined: Sat Nov 19, 2005 9:26 am
Location: New York City
Contact:

Post by JRodriguez »

DaleCooper,

Take a look at your cut off value. Is the cut off value too low?

If the cut off value is properly set then, give it a try with a higher m probability, let's say 0.95 or 0.99

The column gets a higher penalty when disagreeing, the full disagreement weight will be assigned, resulting in a no match




Thanks
Julio Rodriguez
ETL Developer by choice

"Sure we have lots of reasons for being rude - But no excuses
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Try using a CHAR match. I suggest this because your requirement is that the words must be the same. An upstream Standardization will have placed main name and first name into the correct buckets, overcoming the order problem, and generated the NYSIIS value on which you're blocking.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
dalecooper
Participant
Posts: 20
Joined: Tue May 13, 2008 1:24 am
Location: Dale Cooper

Post by dalecooper »

Thank you for the feedback! I'm marking this as resolved.
Post Reply