Hi Wiard,
you could use the base-uri function of XQuery, like (probably can be done easier):
for $i at $pos in db:open("DB")//* where $i[text() contains text 'xml'] return <hit>{base-uri($i)}<score>{ft:score($i[text() contains text 'xml'])}</score></hit>
-- Andreas
Am 03.04.2011 um 12:42 schrieb Wiard Vasen:
Dear Christian and Andreas,
Thanks for your great help! I used Christians solution: ft:score(db:open("DB")//*[text() contains text 'xml']) And it works fine.
The next step is that I want to get the associated documents with these scores.
Could you help me with this step?
The results I get now is a list with frequencies, without the references to the particular documents. I think what is needed is the tf/idf score.
Regards,
Wiard
2011/3/31 Christian Grün christian.gruen@gmail.com Hi Wiard,
the tf/idf scoring is only available if you are working with full-text index structures. If you have built a full-text index for your database "DB", the following query will yield different scoring results, depending on the chosen scoring model:
ft:score(db:open("DB")//*[text() contains text 'xml'])
As Andreas indicated, however, you may either set the SCORING property with a db command or explicitly choose the type of scoring in the GUI's database creation dialog (Database -> New -> Fulltext -> TF/IDF Scoring).
Christian
On Thu, Mar 31, 2011 at 8:33 AM, Andreas Weiler andreas.weiler@uni-konstanz.de wrote:
Dear Wiard Vasen, you just need to set the scoring property once. If you work in the GUI: Go to the top input bar, choose command and type: set scoring *
as * set the scoring algorithm you like. In the console just type: set scoring * After setting this you can use the score function, like in the 8th query of our online demo (basex.org/products/live-demo): let $names := ('Jack London', 'Jack', 'Jim Beam', 'Jack Daniels') for $name at $pos score $score in $names[. contains text 'Jack'] order by $score descending return <name pos='{ $pos }'>{ $name }</name> Don't hesitate to ask for more, Andreas Am 30.03.2011 um 22:02 schrieb Wiard Vasen:
Dear sirs of Basex, I am doing my Master thesis on the letters of Vincent van Gogh at the University of Amsterdam. For that purpose I use BaseX to analyze the letters. I wonder, is there the possibility to generate a tf/idf score automatically? In your faq I noticed there needs to be a special term like 'SET SCORING 0' to be able to get a tf/idf score. This information I get from the following page: http://docs.basex.org/wiki/Full-Text Could you help me with this? I would be very grateful. Kind regards, _______________________________________________ BaseX-Talk mailing list BaseX-Talk@mailman.uni-konstanz.de https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk
BaseX-Talk mailing list BaseX-Talk@mailman.uni-konstanz.de https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk