Page 1 of 1

Whole name and salutation

Posted: Tue Sep 23, 2003 12:05 am
by zhaicaibing
Hi,

I have a salutation table that contain all the valid salutations e.g.
Dr.
Dato Dr.
Tan Sri
Dato Dr. Tan Sri
YB
YB DATIN
Yang Berhormat

I have another table that contain people information including the name and name with salutation in a column called WholeName.

The Name column contains e.g.
Peter Goh
Anita Mui Yen Fong

The WholeName column contain e.g.
Tan Dri Peter Goh
YB Anita

I would like to find the people table WholeName's salutation that cannot be found in the salutation table.

Please advice.

Posted: Wed Sep 24, 2003 4:50 am
by timwalsh
What Tools do you have to work with?
DataStage?
INTEGRITY?

Partial String references a column in another table

Posted: Wed Sep 24, 2003 11:48 pm
by zhaicaibing
I am using Quality Manager

Posted: Fri Sep 26, 2003 5:04 am
by timwalsh
I think that's going to be a little difficult with Quality Manager. That type of data profiling I would normally perform in INTEGRITY via pattern investigation.

The reason is that the "Salutation" has varying lengths and varying number of words within the "WholeName w/ Salutation" Field. Therefore, Quality Manager can not simple substring out or parse out the value.

Option 1: Via Pattern Investigation
I would first classify the term as salutation or name, then parse out the salutations from the whole name, then compare the parsed out salutations to the list of salutations that you already have. You can then determine the salutations that you have not determine.

Option 2: Via Pattern Investigation
If you can identify names, but not salutions, then I would use the above method, but parse out the names and not the salutions.

There are a few different methods that I can think of, but they all require pattern investigation.

Pattern Investigations is time consuming. I would be very interesting in different methods that would accomplish the same thing!!!

Tim