unicode characters missing in the output

9 Jun 2026

      Hi,

I'm trying to figure out a character display issue when using basex.

I have a document originally created by using a unicode font called 
fedorovsk.otf (https://sci.ponomar.net/fonts.html)

When I open the file inside a text editor, I can see all characters and 
ligatures (since the font is installed on the system), and it also works 
in the browser when using the font with @font-face.

When I load the document with basex (either by the specifying the path 
or after indexing it), ir returns the word with certain characters missing.

You can see the difference here : https://gprt.fr/unicode/test.html

Apparently, the codes for the first entity are  or 

When adding those codes to the XML document, they disappear in the output.

Do you have any idea on what's going on and how to fix it?

Regards

Guillaume

Guillaume Porte

Graydon Saunders

Gunther Rademacher

tags

participants (3)