Fuzzy full-text query
Hello, I am trying to match a local dataset against entries in a full-text index. I ran into the following case and was confused about the fuzzy querying performance: <x>Baxter, Richard, 1615-1691</x>[text() contains text "Baxter, Richard, 615-1691" using fuzzy] The BaseX documentation says that the default fuzzy search will preserve a minimum of 1 error. However, the query above returns no result. If I add the "errors" parameter with a value of 1, it does work: <x>Baxter, Richard, 1615-1691</x>[text() contains text "Baxter, Richard, 615-1691" using fuzzy 1 errors] Is this expected behavior? Thanks in advance, Tim -- Tim A. Thompson Librarian for Applied Metadata Research Yale University Library
Hi Tim, Thanks for the hint, I have updated the documentation. The computed error value is 0 if a token has 0 to 3 characters. All the best, Christian On Tue, Jul 6, 2021 at 9:24 PM Tim Thompson <timathom@gmail.com> wrote:
Hello,
I am trying to match a local dataset against entries in a full-text index. I ran into the following case and was confused about the fuzzy querying performance:
<x>Baxter, Richard, 1615-1691</x>[text() contains text "Baxter, Richard, 615-1691" using fuzzy]
The BaseX documentation says that the default fuzzy search will preserve a minimum of 1 error. However, the query above returns no result. If I add the "errors" parameter with a value of 1, it does work:
<x>Baxter, Richard, 1615-1691</x>[text() contains text "Baxter, Richard, 615-1691" using fuzzy 1 errors]
Is this expected behavior?
Thanks in advance, Tim
-- Tim A. Thompson Librarian for Applied Metadata Research Yale University Library
participants (2)
-
Christian Grün -
Tim Thompson