[Date Prev] [Date Next] [Thread Prev] [Thread Next] | Indexes: Main | Date | Thread | Author |
Peter Jones wrote: (01) > The other way to do things parallels (I think) some of the > stuff that Chris Dent has done. > 1- Parse the existing archive for terms, recording locations of terms > 2- Cull out anything useless like stop-words (e.g. 'the', 'and', etc.) > 3- Parse any new mails against this growing index, recording locations > of terms (02) Second the motion! (03)