Hi Bridger,
Sorry for letting you wait.
> I'm able to create a database with the GUI from an XML document that
> contains an invalid character (U+0000) -- I guess BaseX does some character
> scrubbing, which is awesome :).
By default, XML documents with invalid characters should be rejected;
but if you turn on the internal parser in the parsing tab of the
Database Creation dialog, all invalid characters will be replaced with
FFFD. Maybe that’s what you have done?
> Here are my commands in standalone mode:
> BaseX 8.5.3 [Standalone]
> Try 'help' to get more information.
>> create db test-bad-char /usr/home/bridger/src/another-test.xml This should work:
> SET INTPARSE on
> CREATE DB ...
I have slightly extended our Wiki entry for the INTPARSE option [1];
hope this helps,
Christian
[1] http://docs.basex.org/wiki/Options#INTPARSE