[jdom-interest] Questions regarding implementation of DocTyp e.internalSubset[eg]

philip.nelson at omniresources.com philip.nelson at omniresources.com
Wed Jun 13 14:03:12 PDT 2001


> > * what to do (if anything) about character entities in the 
> source doc like
> >   <!ENTITY Ouml '&#214;'>
> > 
> > The parser turns this into a String from the parsed entity 
> and that is what
> > gets output.
> 
> Try to create a string as close to the original as possible.

I think all the other questions have been addressed and I agree with the
conclusions from Jason and Harry.  This one is not so easy.  The problem is,
we have no way of knowing whether the original data came from a character
entity or not.  The parser doesn't report this in any way I could find. we
just get a Unicode string. 

Harry, off the top of my head, I have modified DocType, XMLOutputter,
SAXBuilder and SAXHandler.  None of this is in CVS.  I'll send you what I
have tonight and if you could do the same, we may save a few steps.  My code
seemed to work pretty well except for the issues I have brought up. 

Could you send yours?



More information about the jdom-interest mailing list