all groups > sql server new users > december 2004 >
You're in the

sql server new users

group:

Identifying possible matches


Identifying possible matches Johnny
12/30/2004 10:03:02 AM
sql server new users: I have a customer with two companies. Both companies stock the same items
but not all of the item numbers are the same. They would like to have the
same item numbers in both company's and are looking for some help in
identifying the possible matches. For example, company A has item 12345 and
company B has the same item but calls it 98765. The description assigned to
both of them are different as well but very close. For example, item 12345
has a description of "ITEM-NUMBER" and item 98765 has a description of "ITEM
NUMBER". How can I create a sql statement that looks at company A's item
numbers as the baseline and provides a list of the closest matches of company
Re: Identifying possible matches Ross Presser
12/30/2004 1:33:45 PM
[quoted text, click to view]

Look at the levenshtein distance function
http://www.merriampark.com/ldtsql.htm

It returns zero for identical strings, and in general, smaller integers for
string pairs that are more similar, and larger integers for those that are
larger.

Another possibility is the soundex algorithm, which groups together words
that are pronounced alike. In my experience this is only really useful for
Re: Identifying possible matches Johnny
12/30/2004 7:55:02 PM
Ross,

I implemented the function and tested it. I need to get a copy of the
customer's data to test out the algorithm but it looks like this is the best
answer for the problem. Thanks a lot for the help.

Johnny

[quoted text, click to view]
AddThis Social Bookmark Button