Thank you Hilary for your quick reaction.
I'm a regular reader of this newsgroup and I frankly expected no less
from you...
I do have some comments about querying IS catalogs.
[quoted text, click to view] >> - Does anybody have experience on performance penalty of using
>> OPENQUERY to a remote machine?
> You could hit around a 3 orders of magnitude performance degradation using a
> linked server to a remote indexing services catalog as opposed to storing
> your data in your database and using SQL FTS. Your results may vary.
As I understand, you anticipate that my performance can suffer because
I don't save content of the files as image fields in the database?
That's good to know and can be understood. But, my dilemma was between
the 2 queries below:
SELECT *
FROM OPENQUERY(rISService,
'SELECT Path, Rank FROM SCOPE() WHERE CONTAINS(Contents, ''smth'')'
)
and
SELECT *
FROM OPENQUERY(rISService,
'SELECT Path, Rank FROM remoteMachine.catX..SCOPE() WHERE
CONTAINS(Contents, ''smth'')'
)
As you see, in the later query I instruct my SQL Server to query local
Indexing Service with the "remote" scope; the local IS recognizes
"remoteMachine.catX..SCOPE()" and distribute the query to
remoteMachine.
Since IS re-indexes the file system on idle, and my SQL Server's
machine is hardly ever "idle enough", I wanted to move the files and
the indexing of the files to a dedicated machine.
Ignoring the data transfer thru the LAN, I did not expect the
performance hit is significant. Or my scenario is not really complete?
[quoted text, click to view] >> - What is the advantage of "start_change_tracking" option over
>> "start_incremental"? (We do have a timestamp field per table) Does it
>> mean that "start_incremental" does not use some sort of change
>> tracking?
> Incremental means that every row is extracted and only changed/new rows are
> reindexed. Start change tracking kicks off a full population if none has
> been done, or an incremental population if a full population has been done
> and you have the time stamp column - which you do. After the incremental or
> full population is completed only changes to columns you are full text
> indexing will be extracted and indexed. You get much better performance and
> near real time indexing.
Thank you for clarifying this to me; I expected too much from the word
"incremental". "start_change_tracking" with update index in background
should indeed give me the desired performance.
Another trade-off is "one catalog" vs. "multiple catalogs", but this is
another subject and maybe another thread in this news group...
Thanks again.
Eric