Groups | Blog | Home
all groups > sql server full text search > january 2007 >

sql server full text search : General Search Strategy Advice


Hilary Cotter
1/22/2007 2:16:08 PM
Ideally they will all be stored in the database or the file system. It
really depends on whether you are joining them against relational tables
whether you should store them in the database or not. Indexing Services
isn't as scalable as SQL FTS, the boundary occurs somewhere between 1
million and 3 million documents.

--
Hilary Cotter

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com



[quoted text, click to view]

Larry Neylon
1/22/2007 5:30:06 PM
Hi there,

I'm looking for some advice on the best methods / tools to implement an
intranet search facility using ASP.NET and SQL2005.

I need the ability to search both SQL Server content and files stored on the
web servers file system (.doc, .xls, .pdf, etc).

Is there one tool that will perform both of these tasks for me, or am I
going to have to use a combination of fulltext search on the DB and Index
Server ?

I know I can store the files in the DB and use and IFilter to search the
contents, but if possible I'd want to keep the files in the file system
rather than move them all into the DB.

Any pointers would be most appreciated.

Regards,
Larry.

xkeops NO[at]SPAM gmail.com
1/22/2007 7:29:58 PM
Hi Larry,

You may try doing:
1. Using the index server on one hand as filesystem indexing tool and
read the result from SQLServer. To do this you must create a Linked
server (from your sql server) pointing to your indexing server.
And read the data from FileSystem with OPENQUERY (with select ....
sqlserver style)
2. Have a table with a varbinary column where you can upload the files
and index their content / based on the extension.


HPH,

xk



[quoted text, click to view]
Hilary Cotter
1/23/2007 7:35:27 AM
Using a linked server to Indexing Services (Index Server was an NT 4.0
Product) is simply not scalable.

--
Hilary Cotter

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com



[quoted text, click to view]

Hilary Cotter
1/23/2007 1:02:53 PM
Definitely, especially if you are querying them relationally. However,
returning them from the file system especially through a web server which is
optimized for returning documents offers better performance than returning
them through the db.

Also if you store up to 100 documents per subdirectory and nest all
subdirectories in groups of 100 you get optimal file system performance.

--
Hilary Cotter

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com



[quoted text, click to view]

Larry Neylon
1/23/2007 4:24:51 PM
Thanks for that Hilary,

Do you think actually storing the documents in SQL FTS would provide the
best performance ?

Are there any 3rd party tools that anyone knows about ?

Regards,
Larry.
[quoted text, click to view]

Larry Neylon
1/24/2007 12:16:19 PM
Thanks Hilary,

There's enough there for me to be working on for now.

Regards,
Larry.

[quoted text, click to view]

Simon Sabin
1/25/2007 2:02:32 PM
Hello Larry,

I breifly looked at SQL Turbo last year and was impressed. It allows the
filename to be stored in the DB rather than the whole document.

It could be worth a try.

Simon Sabin
SQL Server MVP
http://sqlblogcasts.com/blogs/simons


[quoted text, click to view]

AddThis Social Bookmark Button