lWrote test harness to load 1000 CVs then do some searches
lTried about 5 CPAN modules
lPLucene search speed okay for small volumes but
exponential increase in insert time
>60 seconds per
insert
lWhy? Tokenises doc,
multi-lingual word stemming, adds doc id to reverse lookup index for each stem token
lOther modules faster but search options weak
lNeed to look further