all groups > sql server full text search > december 2003 >
You're in the

sql server full text search

group:

On the fly conversion of word documents to html documents using SQL or something



On the fly conversion of word documents to html documents using SQL or something sharmauiuc NO[at]SPAM yahoo.com
12/7/2003 10:08:41 AM
sql server full text search: Hi,

My platform is SQL2000 Server and Windows 2000 Server. I am using JSP
to retrieve my documents. My FTS appears to be working fine. I am
storing
Word documents (.doc) extension on image column.

However, I would like to retrieve the word documents content as html
document. Is there a way to retrieve image data and perform on the fly
html conversion using some SQL functions?

Alternatively, I can create an extra image column and store the
converted WORD
documents into the extra column. I could then retrieve this image
column to display the contents of the word document. However, I would
like this to be
performed using store procedure that will convert word document to
html document automatically.

Any pointers to perform this work will be highly appreciated.

Thanks and regards,

Re: On the fly conversion of word documents to html documents using SQL or something John Kane
12/7/2003 1:07:58 PM
Sanjay,
Interesting question/request... What version of MS Word (2000, XP or 2003)
will you be using? If you're using XP or 2003, you can always save the
documents as HTML files and then upload those files into a TEXT column for
easier retrieval. See the "SaveAs Method" at
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/vbawd10/html/womthSaveAs.asp
for more info on this.

I don't think you can read directly from the IMAGE column unless you want to
read byte-by-byte and do the conversion yourself. You might also look at the
newly released XML Schema for the MS Office 2003 file versions that
Microsoft has made public at
http://www.microsoft.com/downloads/details.aspx?FamilyID=fe118952-3547-420a-a412-00a2662442d9&DisplayLang=en

Regards,
John



[quoted text, click to view]

Re: On the fly conversion of word documents to html documents using SQL or something sharmauiuc NO[at]SPAM yahoo.com
12/8/2003 12:48:36 PM
Thanks for a quick reply. Unfortunately, I am storing word 97, 2000
as well XP versions.


[quoted text, click to view]
Re: On the fly conversion of word documents to html documents using SQL or something John Kane
12/8/2003 1:16:21 PM
You're welcome, Sanjay,
I'm not even sure if MS Word 97 and 2000 version can be saved as HTML as I
do not have those versions to test. You should consider 3rd party software
that can convert these older versions of MS Word documents to HTML. A quick
search using Google, turned up the following links:
http://www.w3.org/Tools/Word_proc_filters.html#MSWord
http://www.solutionsoft.com/w2w.htm
http://www.softinterface.com/Convert-File-Programs/Convert-File-Program.HTM
http://www.acii.com/fmn.htm
Note, the last software does "lets you convert to accurately formatted
Microsoft Word 97, 2000 or XP documents."
The Google search I use was:
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&c2coff=1&q=%22microsoft+word+97%22++%2BHTML+%2Bconversion+%2Bsoftware

Regards,
John



[quoted text, click to view]

AddThis Social Bookmark Button