[jdom-interest] Character escaping

Malachi de AElfweald malachi at tremerechantry.com
Mon Mar 17 08:35:20 PST 2003


On Sun, 16 Mar 2003 13:49:54 -0700, Alex Rosen <arosen at novell.com> wrote:
> The second issue is with characters > 16 bits, which I understand only
> partially. (Elliotte you'll have to help me out here.)  It seems that
> Java doesn't fully support this now, since there's a JSR to add support
> for them in JDK 1.5. Presumably this support will use surrogate pairs,
> where it takes two Java chars to represent these new Unicode characters.
> But CharsetEncoder in 1.4 seems to take this into account, it talks
> about surrogate pairs. I guess this API was written with the future in
> mind, for when Java does fully support them?

You can convert to UTF-8, in which case it supports Unicode 3.  If you want
Unicode 3, however, you have to upgrade to XML 1.1.

Malachi 



More information about the jdom-interest mailing list