Sunday, February 19, 2012

Calculation of a degree of similarity of phrases

Hi!
i make The Extended stored procedure for MS SQL Server which
Calculation of a degree of similarity of phrases.

Purpose:
One of the most complex and important problems for the developer and the operator of a database is maintenance of uniqueness names in the most important references of system.
Offered function can be used in SQL inquiry as criterion of sorting of the directory according to similarity with a required phrase.

Features:
incredibly high speed of data processing
Unique algorithm analyzing similarity of phrases even at significant divergences in required phrases
is not required installation of additional libraries to each client - library DLL must benn installed only on a server.
Result is all the table sorted in decreasing order phonetic similarity (probably use of operator TOP for sample only the limited quantity of the most similar variants)
Use of user server function supposes use in Stored procedures, Views and any SQL expressions
Spends a minimum of server memory

-----------------------
If the decision of the given problem is interesting to you and there is an opportunity desire and an opportunity to test http://kozin1.narod.ru (http://kozin1.narod.ru/newsite/index.html?english.htm)
Dll and sample scripts in rar archive (2,5 KB)
I thank in advanceI'd be happy to take a look at your code, but I got nothing but junk from your download.|||It is offered to not analyse a code.
To estimate an acceptability of the given way for the decision of a specific target:

Speed of data processing ( length of phrase is 50 ) and average length of names in the table 30 - 50 symbols and 100 % to loading CPU:
2 000 - 10 000 records in a second

10000 records is clear time processing by code per second
7 sec per 10000 records for use in real view with use of UDF and sorting

all tested on low pefomance server (1proc *Intel PIII 900 MZ)

No comments:

Post a Comment