27 Apr
2022
27 Apr
'22
9:43 p.m.
"men" is stemmed to both "man" and "cocksman"--shaking my head on that one!
Reads like a subversive Easter egg :·) Thanks for the efforts, Tim. The results (memorandum → memo, others) indicate to me that a dictionary-based solution has been chosen by Bitext. I have decided to enhance our Porter algorithm with a simple dictionary that contains the most common irregular plural forms, including the ones you analyzed and various others. A new snapshot is available [1,2]. Best, Christian [1] https://github.com/BaseXdb/basex/issues/2097 [2] https://files.basex.org/releases/latest/