[jdom-interest] Format problem?

Mike Kyle m_t_k_nospam at yahoo.co.uk
Wed Oct 22 08:56:27 PDT 2008


Because the formatter defaults to utf-8 if one doesn't specify it (which is what the XML document claims to be).

I'm certain that it's not the System console not showing it - the
characters in question would convert to 6 UTF-8 chars. The output spits
out 2 chars.

Cheers, Mark 

From: Michael Kay <mike at saxonica.com>
To: Mike Kyle <m_t_k_nospam at yahoo.co.uk>; jdom-interest at jdom.org
Sent: Wednesday, 22 October, 2008 13:47:42
Subject: RE: [jdom-interest] Format problem?

I think it's more likely that System.out is not displaying 
the Unicode string correctly - generally my experience is that the operating 
system console is not capable of handling full Unicode, though it no doubt 
depends on the operating system and its configuration.
I'm not sure why you would expect to see UTF-8 (as distinct 
from other representations of Unicode).
Michael Kay

 From: jdom-interest-bounces at jdom.org  [mailto:jdom-interest-bounces at jdom.org] On Behalf Of Mike  Kyle
Sent: 22 October 2008 11:35
To: jdom-interest at jdom.org
Subject: [jdom-interest] Format  problem?

The following code does NOT produce the UTF-8 that I had expected. As far  as I can tell the Text element only seems to work with ASCII text. I would  have expected it to work with non-ASCII text. Or am I doing something  dumb?

    private void jdomTest() throws  IOException
        Element element = new  Element("doc");
         element.addContent(new  Text("\u4E2D\u6587"));
        Document  document = new  Document(element);

         StringWriter out = new  StringWriter();
        Format f =  Format.getPrettyFormat();
        new  XMLOutputter(f).output(document,  out);
        System.out.println("XML:  "+out);

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.jdom.org/pipermail/jdom-interest/attachments/20081022/901c1f1f/attachment.htm

More information about the jdom-interest mailing list