[jdom-interest] <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML Basi c 1.0//EN" "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd"> error?

philip.nelson at omniresources.com philip.nelson at omniresources.com
Mon Nov 19 23:39:50 PST 2001


> 
> This is the only thing that is in the document.  These pages 
> come from another web application that my company will not 
> allow to be modifed.
> 
> <!DOCTYPE html
> PUBLIC "-//W3C//DTD XHTML Basic 1.0//EN" 
> "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd">
> 
> It JTidy has no problem building a DOM object with these 
> files, but JDOM throws errors because of the DOCTYPE 
> declaration. I need to know how build a JDOM document that 
> contains this type of DOCTYPE.  I would be happy if I could 
> just ingore it.

Jtidy is outputing a DOM with this doctype, correct?  If you look at this
doctype, it is full of parameter entities which are then expanded in the
dtd.  I don't know this dtd but you can probably expect that there are parts
of the result document that don't match this dtd.  So, as far as I can tell
from skimming this thread, Jtidy saw only the original html document,
according to whatever options you set, output a dom with this doctype.  In
this document, somehow I don't fully understand, something is interpeted as
a PI instead of an attribute or element.  Since this is DOMBuilder, a bug is
not out of the question but what we really need I think is the xml version
of the DOM document Jtidy output.  

It could turn out to be a parser bug, a Jtidy bug, a DOMBuilder bug or
something I haven't thought of ;-)



More information about the jdom-interest mailing list