Hello all,

I need to merge any XML files located in 500 GB of zips into a single DB for further analysis. Is there any faster or more efficient way to do it in BaseX than this? TIA.

 

for $zip in file:list($src, false(), '*.zip')

  let $arch := file:read-binary(concat($src, '\', $zip))

  for $a in archive:entries($arch)[ends-with(., 'xml')]

  return db:add('my_db', archive:extract-text($arch, $a), $a)

 

 

TIA,

Constantine

 



Elsevier B.V. Registered Office: Radarweg 29, 1043 NX Amsterdam, The Netherlands, Registration No. 33156677, Registered in The Netherlands.