[jdom-interest] JDOM Performance
Mike Brenner
mikeb at mitre.org
Fri Jun 6 09:13:12 PDT 2003
Kevin wrote:
> http://www.sosnoski.com/opensrc/xmlbench/index.html
> It compares Java XML processing models for DOM, JDOM, DOM4J and a number of
> XML api's. It compares performance in terms of memory utilization, parsing
> performance, etc. It's a great analysis.
> Only problem is it's a couple years old and discusses Jdom b7
After it is updated for 2003, and since JDOM is built upon a parser,
I guess we would not be concerned about the DOM and SAX numbers,
because they are parsers, not trees built upon parsers like JDOM is.
What is EXML? A web search shows that General Dynamics, Eiffel,
Electric XML, and possibly others have a product built on top
of an xml parser whose abbreviation is exml? Oh, I see where
the answer it -- it is Electric XML.
Since EXML is similar to JDOM (built upon a parser),
and because it was faster than JDOM at the time of the posted benchmark,
perhaps that is the benchmark to compete against for performance?
> ... I suspect JDOM allocates too many temporary objects triggering many GC runs. ...
But, to avoid the memory leakage in StringBuffer, it's not just JDOM that has to
allocate many temporary objects. Since the JDOM solution posted on this list
was to replace
return myStringBuffer.toString();
with
return new String(myStringBuffer.toString());
we ourselves now have to allocate "too many" temporary objects also.
In addition to this memory leakage, I have learned that there is the
same problem when you use substring -- it does not make a copy, but
keeps the longer string around.
[ Why do so many languages mess up their implementations of variable
length strings, the most important data structure in software?
Perhaps someone could write a variable length string package that
has the kinds of methods that are in String and StringBuffer along
with the methods that always have to be added to it in every project
like:
- string-to-number parsers with defaults
- printf
- pixel length of a string given the font
- html/xml/sql/unix/dos/etc. encodings for special characters and quotings
- dictionary (i.e. a HashMap of Strings indexed by Strings requiring no casts)
- xml-ifying an array of strings or a HashMap of Strings, etc. ]
More information about the jdom-interest
mailing list