Page 1 of 1

Posted: Wed Aug 31, 2011 6:29 pm
by ray.wurlod
When you define a blocking field in a match pass specification, you can specify whether that field is character or numeric. You have left the default (character) when you should have changed to numeric.

Posted: Wed Aug 31, 2011 6:33 pm
by stuartjvnorton
When you specify a field to block on, it gives you the option to block as a number or as characters. So to fix the error, make sure it's set to compare them as numbers.

You might also want to block on something a bit looser than the actual value.
The matching part can be "fuzzy", but the blocking part is exact.
Maybe try NYSIIS of some of the names, part of the DoB (you can use a transformer to create some loose keys of your own in advance, remember) or something similar.

Re: Match specification Rules

Posted: Thu Sep 01, 2011 10:56 am
by vijaydasari
Thanks for the replies .

I am able solve this issue by moving the date of birth column to Match command section.

now my block commands are MatchFirstNameNYSIIS_USNAME, PrimaryName_USNAME, CityName_USAREA, StateAbbreviation_USAREA & ZipCode_USAREA.

Match commands colmuns are MatchFirstName_USNAME, MatchPrimaryName_USNAME, ZipCode_USAREA, CityName_USAREA, StateAbbreviation_USAREA,EMAIL_ADDRESS,AddressType_USADDR & DATE_OF_BIRTH.

Match specification identified master and duplicate records , but weight of master record is 0.91 and duplicate record weight is 0.07.

How can I utilize Cutoff values and weight overrides section ?

Posted: Thu Sep 01, 2011 5:11 pm
by ray.wurlod
Sounds like you're in need of some training. Why not work through the QualityStage tutorial, which ships with the software and has its own manual?

Cutoffs determine thresholds at which the composite weights calculated for records indicate a match, a non-match, or a "too close to call".
Weight overrides allow you to bias the calculation of agreement or disagreement weights based on some external knowledge (for example that some particular field, while not particularly selective - few distinct values - is important to the business in some way). You also need to look at variable special handling along the same lines.