As the data is quite huge around 2GB, which creates problems while
creating FT Index, I have divided it into 2 parts. For a better picture of
what I have done, I am explaining it with a suitable example.
Lets say I have articles on Physics in one single DB. So, I divided them
into 2 DBs, viz; short description and long description. So, now I have 2
DBs -
PhysicsSD
PhysicsLD
By doing so, I am not getting OOM now while creating FT Index. (Wow!!).
But now, I am facing another issue...
Say, I want to search for words "emf" and "waves" into the Physics
DBs, I would do this -
for $dbname in ('physicsSD', 'physicsLD')
for $x in doc($dbname)//Doc[SD/Info/
text() contains text {"emf waves"} all words or
LD/Info/Para/text() contains text {"emf waves"} all words]
order by xs:integer($x/Details/Year) descending
return $x/Doc
This query retrieves data in approx. 83000 ms (83 Sec)
But when executed on INDIVIDUAL DBs, the total time is very very less
compared to the above. Its only 4500 ms (4.5 sec) for BOTH !!!
--
Have a nice day
JBest