Project Members
The following list only gives the function in regard to the
corpus preprocessing.
Some of them and other people have contributed to DCU CLEF 2004
in other ways as well.
- Gareth J. F. Jones
- coordinator
- Michael Burke
- stemming and integration of modules
- John Judge
- tokenisation, stopword removal and decapitalisation
- Anna Kasin
- Russian expertise and character set conversion
- Adenike Lam-Adesina
- restrictions of OKAPI system
- Joachim Wagner
- word encoding, punctuation,
character set conversion and
decapitalisation
Tuesday, 17-May-2005 09:56:15 GMT