Natural Language Computing
Resources for Assignment 1
Check the 401
home page and the course bulletin
board regularly for course announcements, assignment hints, etc.
- The Brown Corpus (gzipped tar file
for
Unix
/ Linux, 2.9Mb)
- The Brown Corpus (zipped file for
PC,
3.2Mb)
- The Brill tagger
(gzipped
tar file for Unix / Linux, 1.3Mb)
- Brill tagger for Windows - you have a few options here:
- Scripts:
- Word lists:
- C4.5 package,
release
8 (approx
145Kb). The link is at the bottom of Professor Quinlan's home page; you
get a gzipped tar file with source code and installation instructions.
Note restrictions on distribution! This package runs under Unix / Linux
only; not for Windows.
- BrownStats.cases
Python
Download Python from the Python
website for all platforms. Remember to test your scripts on
CDF
before submitting -- your assignment
must work on CDF in order to
get any marks!
Other resources
Last modified by Gerald Penn, 21 January 2008
This web-page was adapted from the web-page for another course,
created by Graeme Hirst.