[basex-talk] full text search collation

22 Jun 2012


      Hi
I have a question relating to full text search. I am trying to use it with
Greek texts and the default collation assumes that ά != α
In short, accented vowels are treated as different letters.
Is this something that has to do with the collation used not being Greek,
or does it have something to do with the tokenizer?
As a side question i noticed that the stemmers used from lucne are quite
outdated. 3.6.0 also includes a Greek stemmer. I tried to include the 3.6.0
stemmers  instead but language codes seem to be hardcoded in
util/ft/Language.java  Any chnce of that part of the code being updated to
use the latest stemmers so more languages can be integrated? (i am not
proficient in java myself unfortunately so i can't directly help)
thanks,
Alex

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

[basex-talk] full text search collation