Dear Michael,
thanks for your mail. Just to get sure: did you have a look at our Wiki page on Japanese tokenization [1]?
I believe there may be quite different reasons why the files are not found. Could you provide us with a simple, self-contained example that allows us to reproduce the problem?
Christian
[1] http://docs.basex.org/wiki/Full-Text:_Japanese ___________________________
On Wed, Mar 28, 2012 at 1:21 AM, Michael Wilson wilsonma77@gmail.com wrote:
I’m working on a hobbyist project of mine, and am having trouble with stemming when running queries from a java app (I’m using the BaseX API to search XML japanese dictionary files via a custom swing/GUI application). Specifically, I receive the error message: “[FTST0009] No tokenizer available for language 'Japanese'.” I do have the etc/ja folder present.
This problem does not occur when I run the same query from the BaseX GUI. I replicated the same class path I observed in the basexgui.bat file: I added basex.jar, igo-0.4.3.jar, lucene-stemmers.3.4.0.jar, xml-resolvers-1.2.jar, tagsoup-1.2.jar to the Path environment variable, but it didn’t help.
Does anyone have any ideas on how I can make stemming work from my application? I appreciate any insights or solutions.