As the data is quite huge around 2GB, which creates problems while
creating FT Index, I have divided it into 2 parts. For a better picture
of
what I have done, I am explaining it with a suitable example.
Lets say I have articles on Physics in one single DB. So, I divided
them
into 2 DBs, viz; short description and long description. So, now I
have 2 DBs -
PhysicsSD
PhysicsLD
By doing so, I am not getting OOM now while creating FT Index. (Wow!!).
But now, I am facing another issue...
Say, I want to search for words "emf" and "waves" into the Physics
DBs, I would do this -
for $dbname in ('physicsSD', 'physicsLD')
for $x in doc($dbname)//Doc[SD/Info/text() contains text {"emf waves"} all words or
LD/Info/Para/text() contains text {"emf waves"} all words]
order by xs:integer($x/Details/Year) descending
return $x/Doc
This query retrieves data in approx. 83000 ms (83 Sec)
But
when executed on INDIVIDUAL DBs, the total time is very very less
compared to the above. Its only 4500 ms (4.5 sec) for BOTH !!!
--
Have a nice day
JBest