Unicode codepoints would be fine, Sometimes data providers adopt very strange data formats ...
Done [1]; feel free to check out the latest snapshot [2]. Christian
[1] http://docs.basex.org/wiki/Parsers#CSV_Parser [2] http://docs.basex.org/wiki/Releases