Hi I have a question relating to full text search. I am trying to use it with Greek texts and the default collation assumes that ά != α In short, accented vowels are treated as different letters. Is this something that has to do with the collation used not being Greek, or does it have something to do with the tokenizer?
As a side question i noticed that the stemmers used from lucne are quite outdated. 3.6.0 also includes a Greek stemmer. I tried to include the 3.6.0 stemmers instead but language codes seem to be hardcoded in util/ft/Language.java Any chnce of that part of the code being updated to use the latest stemmers so more languages can be integrated? (i am not proficient in java myself unfortunately so i can't directly help)
thanks, Alex