Hil Christian,

On Fri, Sep 30, 2016 at 8:10 AM, Christian Grün <christian.gruen@gmail.com> wrote:
Hi Bridger,

Sorry for letting you wait.

No trouble at all. 

> I'm able to create a database with the GUI from an XML document that
> contains an invalid character (U+0000) -- I guess BaseX does some character
> scrubbing, which is awesome :).

By default, XML documents with invalid characters should be rejected;
but if you turn on the internal parser in the parsing tab of the
Database Creation dialog, all invalid characters will be replaced with
FFFD. Maybe that’s what you have done?

 
That's exactly what I've done! :) I've habitually used the internal parser and didn't realize that I needed to add it as an option.
 
> Here are my commands in standalone mode:
> BaseX 8.5.3 [Standalone]
> Try 'help' to get more information.
>> create db test-bad-char /usr/home/bridger/src/another-test.xml

This should work:

> SET INTPARSE on
> CREATE DB ...

I have slightly extended our Wiki entry for the INTPARSE option [1];
hope this helps,
Christian


Absolutely yes. 
Thank you kindly.
Best,
Bridger
 
[1] http://docs.basex.org/wiki/Options#INTPARSE