[jdom-interest] escape characters within a element text

River river at verat.net
Wed Oct 1 11:08:40 PDT 2003


Well, first about parsers, and option you want. Answer is very simple:
if parsers had that kind of option, witch disables "well formed"
property of document, than the document you are parsing is not any more
caled xml. That is since xml is very strict regarding format of the
document. This is not advanced theory, or philosophy of parser makers,
it is simply standard. 

And something about your problem: If string as you say, comes from
external application, and that app declares that that given string is
xml document (or fragment), external application have serious bug. So if
you can pass the problem as bug to app's manufacturer. 

And one tought for end: If you know that illegal chars ocures only in
specific elements of document you can try to wrap that string in
InputStream, and insert CDATA markers after start and just before end of
offending elements. This way you will avoid unnecessary string
manipulation step before parsing document, since insertion will occure
"while parsing".

Mit freundlichen Grüßen,
Best Regards,
River

-----Original Message-----
From: Niroshan [mailto:niroshan at markus.no] 
Sent: Wednesday, October 01, 2003 3:36 PM
To: River
Subject: RE: [jdom-interest] escape characters within a element text

hi,
	Well the problem i have is that the xml string that i mentioned,
comes out from another application which i have no control. In reality
the xml is larger than what i sent, so to wrap 	up with CDATA on my side
i would have to perform a string manupulation on my side. What i am
wondering is whether JDOM or any other xml parser can be told to treat
all the 	text within elements as unparsed(CDATA) or to say to the
parser using XPAth perhaps, to treat a particular nodes(elements) text
as CDATA.

	Wonder why JDOM/any other does not have that option, in the
SAXBuilder or some where.( dont know really the advanced theory behind
not having such a thing). 
	hope to hear from u on this issue.

regards  and thanks 
niroshan



-----Original Message-----
From: River [mailto:river at verat.net]
Sent: 01 October 2003 02:33
To: jdom-interest at jdom.org
Subject: RE: [jdom-interest] escape characters within a element text


You can try to wrap it up inside CDATA sections, ex:

<person>
  <name>jack</name>
  <description><![CDATA[height of jack is < 6 feet]]></description>
</person>

Mit freundlichen Grüßen,
Best Regards,
River


-----Original Message-----
From: jdom-interest-admin at jdom.org [mailto:jdom-interest-admin at jdom.org]
On Behalf Of Niroshan
Sent: Tuesday, September 30, 2003 3:31 PM
To: jdom-interest at jdom.org
Subject: [jdom-interest] escape characters within a element text

hi
	if i have a string which is a xml, but contains invalid
characters within the elements text, how can i create a xml document out
of it. 
	If the string is as below

	<person>
		<name>jack </name>
		<description> height of jack is < 6 feet </description>
	</person>

	here the description text contains the character '<', so if we
try to create a dom document using the SAXBuilder it gives an exception.
Is there a way in JDOM to say replace those characters with the required
escape characters?

	before creating the dom document?

regards
niroshan


	
_______________________________________________
To control your jdom-interest membership:
http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@you
rhost.com

_______________________________________________
To control your jdom-interest membership:
http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@you
rhost.com




More information about the jdom-interest mailing list