Groups | Blog | Home
all groups > sql server data warehouse > february 2004 >

sql server data warehouse : Help-Name Standardization Problem


Paul Stanial
2/11/2004 5:31:30 PM
Is there an application out there that will allow us to cleanse/merge-purge
the name fields for a consumer as well as for a commercial data warehouse?

In other words, we want to take, for example, Walmart, Wal-Mart, Walmart
Corp, etc and boil it down to a single WalMart Corporation entity.

An example of the consumer side would be Jim Smith, James E. Smith, Jimmy
Smith, etc.


Thanks!


Ray Higdon
2/11/2004 10:37:11 PM
Welcome to the world of data scrubbing. There is no corporate lookup
function, you will have to do some string manipulation and parsing to clean
the data. If you can convince the users/business units to do this, the
better.

--
Ray Higdon MCSE, MCDBA, CCNA
---
[quoted text, click to view]

Paul Stanial
2/12/2004 2:12:10 PM
Thanks for the response. I guess I'm thinking that if we could go against a
table with all registered U.S. corporations (e.g. D&B?), then we could
relate the names in the records, along with their addresses and City to the
correct corporate entity and then aggregate the relevent fields from each
into the new "master record."

Thanks again...
[quoted text, click to view]

Ray Higdon
2/12/2004 7:30:28 PM
I don't think it exists or at least published anywhere, I could be wrong.

--
Ray Higdon MCSE, MCDBA, CCNA
---
[quoted text, click to view]

AddThis Social Bookmark Button