Problem with name and address match
Posted: Wed Dec 23, 2009 7:40 am
Hi,
I am using the unduplicate stage to do the name(first, middle and last ) and address matching. I am getting 99% of the matches as per my requirement .
I have a problem when people who are residing in the same address and thier first name starts with the same initail. In this case quality stage considers them a match. In case we have only First initial of the one person then other persons name starts with the same initail then we like it to considered as match.
Example :
Jean Doe 123 Main Street , Warren, NJ, 09088
James Doe 123 Main Street , Warren, NJ, 09088
Quality stage considers them as match.
J Doe 123 Main Street , Warren, NJ, 09088
John Doe 123 Main Street , Warren, NJ, 09088
Quality stage considers them as match and we are fine with this result.
The column that is used in blocking is
CitynameNYSIIS_USAREA
ZIPCODE_USAREA
StreetName_NYSIIS_USAADDR
MatchPrimarywordNYSIIS_USNAME
MatchFirstnameNYSISS_USNAME
HouseNumber_USADDR
The coulmn that is used in Matching
HouseNumber_USADDR
StreetPrefixDirectional_USADDR
StreetPrefixtype_USADDR
StreetName_USADDR
StreetSuffixDirectional_USADDR
StreetSuffixtype_USADDR
UnitType_USADDR
UnitValue_USADDR
ZipCode_USAREA
Zip4AddonCode_USAREA
MatchFirstName_USNAME
MatchPrimaryName_USNAME
NameGeneration_USNAME
Thanks
jeesim
I am using the unduplicate stage to do the name(first, middle and last ) and address matching. I am getting 99% of the matches as per my requirement .
I have a problem when people who are residing in the same address and thier first name starts with the same initail. In this case quality stage considers them a match. In case we have only First initial of the one person then other persons name starts with the same initail then we like it to considered as match.
Example :
Jean Doe 123 Main Street , Warren, NJ, 09088
James Doe 123 Main Street , Warren, NJ, 09088
Quality stage considers them as match.
J Doe 123 Main Street , Warren, NJ, 09088
John Doe 123 Main Street , Warren, NJ, 09088
Quality stage considers them as match and we are fine with this result.
The column that is used in blocking is
CitynameNYSIIS_USAREA
ZIPCODE_USAREA
StreetName_NYSIIS_USAADDR
MatchPrimarywordNYSIIS_USNAME
MatchFirstnameNYSISS_USNAME
HouseNumber_USADDR
The coulmn that is used in Matching
HouseNumber_USADDR
StreetPrefixDirectional_USADDR
StreetPrefixtype_USADDR
StreetName_USADDR
StreetSuffixDirectional_USADDR
StreetSuffixtype_USADDR
UnitType_USADDR
UnitValue_USADDR
ZipCode_USAREA
Zip4AddonCode_USAREA
MatchFirstName_USNAME
MatchPrimaryName_USNAME
NameGeneration_USNAME
Thanks
jeesim