Index of /2008/thesis

Name                       Last modified      Size  
Parent Directory - thesis.pdf 2008-05-31 18:06 805K re2c.patch 2008-05-31 18:06 18K python-optimisations.patch 2008-05-31 18:06 16K lucille-r111.tar.bz2 2008-05-31 18:06 1.2M indexinput.patch 2008-05-31 18:06 29K clucene-search.patch 2008-05-31 18:06 1.3M 20_newsgroup.tar.gz 2008-05-31 18:06 17M
Contents of this CD-ROM
-----------------------

thesis.pdf
	Thesis document, as submitted electronically 28/5/2008

lucille-r111.tar.bz2
	The original version of Lucille, without any of the performance 
	improvements discussed in the thesis applied

indexinput.patch
	_store extension module and related changes (see Chapter 2)

re2c.patch
	analysis._standard extension module using RE2C, and related changes (see Chapter 4)

python-optimisations.patch
	Python-level optimisations (see Chapter 5)

clucene-search.patch
	CLucene search module using Boost.Python, and related changes (see Chapter 6)

20_newsgroup.tar.gz
	The 20 Newsgroups test corpus, available from 
	<http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/news20.tar.gz>


Other notes
-----------

Lucille requires Python 2.5.