Hy Christian,
Currently i try the basex version 7.0.1. Now, the speed/performance isn't the problem (is very fast now :-) ) but i get an problem with my xml data and i think, it is an heisenbug.
In my xml file i have a structure with content (including entities) like this (in real with multiple articles):
<root> <article ID="xys"> <values> .... <value ..>.....</value> <value ..>.....</value> <value ID="abcdefg" refcontent="other article">&t;b>System requirements: </b></value> </values> </article> </root>
The problem: i import the same file multiple times and execute a query like for $n in //article[(@ID='Article' or @ID='Other')]/values return <pv><utid>{data($n/../@ID)}</utid>{$n}</pv>
Sometime, all is ok without any problem, but after the next import (same file) i get the error from java from my next processing step with sax/xpath: Error on line 115 of document : The entity "t" was referenced, but not declared. Nested exception: The entity "t" was referenced, but not declared. If looked into <pv>..</pv> part, which throws the exception i see, that the data is corrupt: basexserver returns: ..... <value ID="abcdefg" refcontent="other article">&t;b>System requirements: </b></value>.... (look at ">&t;" , there is a missing l from <) After the exception i looked into my basexserver with basexclient to check this data and see the correct data in the database. Also if i only query this specific article, i get the correct data.
I also started the basexserver with the apache commons xml resolver.jar in the classpath, but no effort.
What i'm doing wrong? Or is this a bug? Why this problem was never seen in the basex 6.7.1 version?
king regards Sven
On 27.10.2011 22:40, Christian GrĂ¼n wrote:
Hi Sven,
thanks for your mail. Usually, there shouldn't be any notable differences between Windows and Linux (only the Mac JVM is known to be slower). Before giving any further advices, it might be interesting to hear if the problems persist with the latest version of BaseX (Version 7.01) ?
Best, Christian ___________________________
On Thu, Oct 27, 2011 at 10:36 PM, Sven Regasven.rega@gmx.de wrote:
Hy,
i'm using the basexserver 6.7.1 on a windows (i7; 8GB; 2,8GHz) and on different linux machines (i7; 16GB, 3,4GHz). the same xml file (~2,5GByte) and one simple query has i big difference in speed between both machines and i don't know, where the bottleneck is coming from. on the windows machine i get througput of ~600 to 1000 xml entities per second. On the linux machine i only get max. 110 xml entities per second.
I use the same jvm setting on both machines: -server -Xmx3g I also use the index (text, path, attribtue) on both machines.
The query:
"declare function local:ProductValues() { for $n in //Product[(@ID='A' or @ID='D' or @ID='G')]/Values return <pv><utid>{data($n/../@ID)}</utid><id>{data($n/../@SecondID)}</id>{$n}</pv> }; local:ProductValues()"
I read from the basex server over lan with 1GBit.
Does anybody have a idea, what could be the reason for such speed difference between windows and linux?
king regards Sven _______________________________________________ BaseX-Talk mailing list BaseX-Talk@mailman.uni-konstanz.de https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk