http://www.ins.cwi.nl/projects/xmark/Assets/xmlquery.txt Since there was not enough time to run them all, I chose queries 1,3,6,8,13,14 as an indicative subset.
Should be fine. XMark queries 8-12 are the most expensive ones, and the MonetDB guys have put much effort into optimizing these queries via loop lifting. BaseX uses the index structures to speedup queries 8-10, which is slower, but ensures constant memory usage.
In addition I tried to run on 1g instance and baseX was the only one to successfully run all the queries. For both eXist & Monet my 3gb of ram was not enough to open such a large instance. Do you think I should report that? (I still I am not exactly sure that I configured Monet the best way possible.)
It's worth mentioning that; you can stress that you've run all database engines with the default settings. You shouldn't forget to exactly specify which versions you've used for testing, so that the results are better reproducible.
It might be interesting to swap the order in which you're running the queries. A query might be executed faster in Java if it's executed at the end of the test, as the JIT compiler will delay initial processes; garbage collection might influence runtimes as well, etc. If you don't restart the database server every time you run new queries, the differences should be less significant.
If you want to review the benchmark results before the final version -maybe you spot an anomaly and some query needs rerun- ask for it and I will send you asap.
Feel free to send me a preview.
Christian