Yeah, I get the same error using this command, too.
ThanksOn Thu, 27 Feb 2025 at 17:43, Christian Grün <christian.gruen@gmail.com> wrote:Just some quick feedback: Does it work if you specify the input along with CREATE DB?basex -c"CREATE DB taurus SPANYOLORSZÁG.xml"You can also specify a directory as input.Thanks,ChristianCsaba Fekete <feketecsaba@gmail.com> schrieb am Do., 27. Feb. 2025, 17:36:Hi ChristianSorry, I thought I was sending this to the mailing list. Thanks for answering anyway!Now I'm trying with a smaller dataset and I am adding the documents one by one. I also upgraded BaseX to the latest version.The largest document is 1151M in size and it can't be imported, even if I use attrindex and textindex.The file is actually publicly available: http://taurusreisen.hu/partner/v2/SPANYOLORSZAG.zipHere is my command and the output:/opt/basex/bin/basex -Oattrindex=true -Otextindex=true -v -V -c"OPEN taurus; ADD ./SPANYOLORSZÁG.xml"
Database 'taurus' was opened in 18.21 ms.
Out of Main Memory.I am thinking of solving the problem by splitting the file to several chunks, which will be CPU-demanding but could make it work.Any ideas are welcome.
Thank you again, and a million thanks for BaseX! It is a fantastic tool.
Regards,
CsabaOn Thu, 27 Feb 2025 at 15:52, Christian Grün <christian.gruen@gmail.com> wrote:Hi Csaba,It’s difficult to give a general advice; XML documents are just too different. In principle, a few GB or even MB can be sufficient to create databases for very large collections (10 GB and more), but sometimes namespaces are a showstopper. See [1] for some statistics.What’s the total size of your XML documents? Can you create the database if you enable the text and attribute index?Best,ChristianOn Tue, Feb 25, 2025 at 2:10 PM Csaba Fekete <feketecsaba@gmail.com> wrote:HiI have a web server that runs Basex 11.1. The server is a VPS with 18G of RAM.I have a directory of documents in various sizes, ranging from a few kilobytes up to 2G.I am trying to import these documents with the command
CREATE DB mydb /path/to/docsWith the default jvm max heap size (2GB) I get the error: Out of main memoryIf I raise the max heap size to 4GB, I get the same error.If I raise it to 8GB, the system becomes unresponsive.How can I determine how much system memory I need to be able to carry out this task?Thanks