We have set up a system in which we have about 17 million BaseX
databases, but my operating system does not allow 17 million
subdirectories in one directory ( the one set in the .basex file). In
order to resolve this, we had to set up four (4) BaseX servers on four
different machines so we could resolve this. Depending on the name of
the BaseX database, we know on which BaseX server we have to look.
As this is a work around, it would be nice to have either the
possibility to have a hierarchy of databases, so we can store them into
more manageable directories, or to allow to run more than one BaseX
server onto a single machine (for instance, listening on different
ports).
For more info on the reasons why we did this: Vandeghinste and
Augustinus (2014). Making Large Treebanks Searchable. The SONAR case. In
Marc Kupietz, Hanno Biber, Harald Lüngen, Piotr Bański, Evelyn
Breiteneder, Karlheinz Mörth, Andreas Witt, Jani Takhsha (eds.),
Proceedings of the 2nd workshop on Challenges in the management of large
corpora (CMLC-2) at the Ninth International Conference on Language
Resources and Evaluation (LREC). Reykjavik, Iceland. pp. 15-20.
Is there another way to solve this?
thanks,
v.