09/06/2006
Perl XS and SWIG interface to CLucene C++ text search engine
9
Investigating Perl Options
lWrote test harness to load 1000 CVs then do some searches
lTried about 5 CPAN modules
lPLucene search speed okay for small volumes but exponential increase in insert time
>60 seconds per insert
lWhy? Tokenises doc, multi-lingual word stemming, adds doc id to reverse lookup index for each stem token
lOther modules faster but search options weak
lNeed to look further