dotNetCoder,
As you're using UTF8 encoding with Arabic text, you might want to checkout
the new
"Microsoft Arabic Word-Breaker (Arabic Search Engine) - Beta" at
http://www.microsoft.com/middleeast/arabicdev/beta/search/ and download the
Installation Guide and the Microsoft Arabic Word-Breaker. Under the
"Installation Requirements for SQL Server 2000" section, you will find that
the Arabic Word-Breaker is "...the Full-Text Search service is enabled for
running queries using the new Arabic Word-Breaker."
Please, download it and let this newsgroup know if you find it effective in
resolving your FTS issues with Arabic HTML files, although, you most likely
will have to include the meta name="MS.Locale" content="AR">" (Arabic) html
tags.
You might find the following material useful - In addition to the BOL
documentation, there is now on MSDN - "Arabic
Language Support in Microsoft SQL Server 2000" at
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnsql2k/html/sql_arabicsupport.asp
that might also be helpful to you.
Additionally, and assuming that the HTML documents that you are storing in
your IMAGE column PageText are in Arabic, could you confirm that all of the
HTML files have the correct language identifiers? Specifically, confirm if
there are any <meta> tags within the <header> tags and that they are
"<head><meta name='ms.locale' content='EN-US'></head>" vs. "<meta
name="MS.Locale" content="AR">" (the latter is Arabic). For more info see
the "HTML Filter" at
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/indexsrv/html/ixufilt_2uuq.asp
as well as
http://www.otal.umd.edu/uupractice/non_english/
Thanks,
John
--
SQL Full Text Search Blog
http://spaces.msn.com/members/jtkane/ [quoted text, click to view] "dotNetCoder" <dotNetCoder@devdex.com> wrote in message
news:#anw3vtJFHA.2764@tk2msftngp13.phx.gbl...
> Hello.
> I'm trying to create a search engine for my Web Site using Indexing
> Service. The content is stored in text files in UTF8 Encoding (arabic
> text).
> The search utility works well on my local server but it throws an
> exception online: the query return only ignored words.
> Every file contains the following statement in its header:
> <meta http-equiv="Content-Type" content="text/html; charset=utf-8">.
> I will be thankful for your help .
>
>
>
> *** Sent via Developersdex
http://www.developersdex.com ***
> Don't just participate in USENET...get rewarded for it!