How does the source volume affects the frequency file generation process and in turn how it affects the matching process?
Match frequency file is generated for 100m records and the source data is having some 200M.
MatchFrequency file Vs. the weightage
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Re: MatchFrequency file Vs. the weightage
Hi,
MatchFrequency file describe how often a value appears in source column.
If a name appears 100 times in a column. MatchFrequency file output for that name will be a single row with number of occurance and statistical weight.
So 200M records can generate lesser output records.
In Matching process , less weight will be given to the value which occured many times and more weight will be given to the value which occured few times.
Regards
MatchFrequency file describe how often a value appears in source column.
If a name appears 100 times in a column. MatchFrequency file output for that name will be a single row with number of occurance and statistical weight.
So 200M records can generate lesser output records.
In Matching process , less weight will be given to the value which occured many times and more weight will be given to the value which occured few times.
Regards
vimalvik wrote:How does the source volume affects the frequency file generation process and in turn how it affects the matching process?
Match frequency file is generated for 100m records and the source data is having some 200M.
vairamuthu