On February 2, 2016 at 12:13:14 PM, Christian Grün (christian.gruen@gmail.com) wrote:
Hi Ron,
I’m pretty sure that the default tokenizer discards the ampersand and
doesn’t pass it on as token at all.
Hope this helps (…at least for understanding the query result),
Christian
On Tue, Feb 2, 2016 at 6:10 PM, Ron Katriel <rkatriel@mdsol.com> wrote:
> Hi,
>
> Given this thesaurus entry
>
> <thesaurus xmlns="http://www.w3.org/2007/xqftts/thesaurus">
> <entry>
> <term>&</term>
> <synonym>
> <term>and</term>
> <relationship>USE</relationship>
> </synonym>
> </entry>
> </thesaurus>
>
> I was expecting the following query to return true (file path omitted for
> clarify)
>
> 'Frontier Science and Technology Research Foundation, Inc.' contains text
> 'Frontier Science & Technology Research Foundation, Inc.' using
> thesaurus at "thesaurus.xml”
>
> but it returns false. Switching the order of the term and synonym makes no
> difference.
>
> I tried getting around this using a stop word file (which includes ‘and’,
> ‘&’, and '&’, just in case) but it does not work either.
>
> Am I missing something?
>
> Thanks,
> Ron
>