[quoted text, click to view] On 5 Sep 2006 06:05:12 -0700, Pim75 wrote:
>Hello Hilary,
>
>Thanks for your reply!
>Indeed the search for PDP* returns the correct records.
>
>Do you know if it's possible to turn of the Dutch word breaker? Or has
>this any negative effects on performance?
>
>regards,
>Pim
Beste Pim,
Hillary asked me to have a look at this thread. I don't normally read
this group, since I have no experience with fulltext search - so please
take my answer with a huge pile of salt.
The issue here, if I understand the messages correctly, is that the
hyphen is considered to be part of a word, not a connector between two
seperate words. That is of course necessary to ensure that Dutch words
that include hyphens, such as kop-van-jut, kop-en-schotel, or
kop-hals-rompboerderij can be found. Unfortunately, this also means that
you won't get all the hits you want for the few words that, even under
new spelling rules, are still combined with a hyphen (such as
niet-roker, pianiste-componiste, zwart-Amerikaans, etc). And, as you
noted, hyphens in model numbers etc are also considered as part of a
single word.
If you use the neutral word breaker, you'll find the PDP-436 screen in
your search. But if you ever have to find a kop-van-jut, you'll probably
have to search for CONTAINS (..., '"kop" AND "van" AND "jut"').
Also, check out other uses of the word breaker. As I said, I have no
fulltext experience, but I do know that it also enables you to find
conjugations of words - for example, as I understand it, a search for
FORMSOF (INFLECTIONAL, vallen) should also match 'viel' {for the English
readers, "vallen" is "to fall" (infinitive), and "viel" is "fell" (past
tense, singular)}. If you also need searches like that, *and* if the
word breaker is used for this as well (ask Hillary - he knows), then you
definitely don't want to switch to a different word breaker.
Met vriendelijke groeten,
--