RoboHELP Tip - December 2000

* Tip published on eHelp website.

This month: converting large Microsoft Word documents to HTML.  We've explored this subject before, but it's worth looking into again since I've discovered a workaround for converting long Word "DOC" files into HTML topics.

This tip is useful for converting Word documents (any version) to HTML topics and is especially if you've been frustrated by Word 2000's use of XML. This technique removes the extraneous XML code produced by Word 2000 out of your topics automatically, saving you a lot of time.

For those unfamiliar with the problem, Microsoft Word 2000 uses eXtensible Markup Language (XML) in Word 2000 documents. When you attempt to import a DOC file into RoboHELP HTML, the XML tags can cause older versions of RoboHTML to literally "choke" without the use of Microsoft's HTML Filter 2.0. RoboHELP 9 has been re-engineered to allow DOC files to be converted to HTML files, but it does not filter the XML from the HTML. The resulting HTML becomes ladened with XML and embedded style sheets.  The result is a loss of formatting control of your documents, which makes it nearly impossible to control the appearance of your topics  On the surface the document looks and works fine, but the TrueCode reveals hundreds of lines XML tags and embedded style elements.

Here's a list of solutions to the Word 2000 DOC to HTML dilemma:

First, let's look at third party conversion utilities