Re: [basex-talk] Using stemming via the BaseX API

28 Mar 2012


      Dear Michael,
thanks for your mail. Just to get sure: did you have a look at our
Wiki page on Japanese tokenization [1]?
I believe there may be quite different reasons why the files are not
found. Could you provide us with a simple, self-contained example that
allows us to reproduce the problem?
Christian
[1] http://docs.basex.org/wiki/Full-Text:_Japanese
___________________________
On Wed, Mar 28, 2012 at 1:21 AM, Michael Wilson wilsonma77@gmail.com wrote:
...
I’m working on a hobbyist project of mine, and am having trouble with
stemming when running queries from a java app (I’m using the BaseX API to
search XML japanese dictionary files via a custom swing/GUI application).
Specifically, I receive the error message: “[FTST0009] No tokenizer
available for language 'Japanese'.”  I do have the etc/ja folder present.
This problem does not occur when I run the same query from the BaseX GUI.  I
replicated the same class path I observed in the basexgui.bat file: I added
basex.jar, igo-0.4.3.jar, lucene-stemmers.3.4.0.jar, xml-resolvers-1.2.jar,
tagsoup-1.2.jar to the Path environment variable, but it didn’t help.
Does anyone have any ideas on how I can make stemming work from my
application?  I appreciate any insights or solutions.

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

Re: [basex-talk] Using stemming via the BaseX API