This page is no longer being maintained (as of 11 September 2012). Please check out my Melbourne Uni webpage instead!
McKenzie Postdoctoral Fellow, Language Technology Group, Department of Computing and Information Systems, University of Melbourne
Until June 2011 I was a MITACS Elevate postdoctoral fellow in Computational Linguistics in Computer Science at the University of Toronto.
Contact info
Paul Cook
Department of Computing and Information Systems
University of Melbourne
Victoria 3010, Australia
email: paulcook at unimelb dot edu dot au
Twitter: @cpaulcook
ALTA 2012
I'm co-organising the Australasian Language Technology Association
(ALTA) 2012 Workshop.
Publications
BibTex entries for my publications
My Google scholar profile
To appear
Bo Han, Paul Cook and Timothy Baldwin. Lexical Normalisation of Short Text Messages. To appear in ACM Transactions on Intelligent Systems and Technology.
2012
Paul Cook. 2012. Using social media to find English lexical blends. In Proceedings of the 15th EURALEX International Congress (EURALEX 2012), pages 846–854, Oslo, Norway. .pdf code Sample output
Bo Han, Paul Cook and Timothy Baldwin. 2012. Automatically Constructing a Normalisation Dictionary for Microblogs. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and Natural Language Learning (EMNLP-CoNLL 2012), pages 421–432, Jeju, Korea. .pdf
The normalisation dictionary we produced in this paper is available.
Paul Cook and Graeme Hirst. 2012. Do Web corpora from top-level domains represent national varieties of English? In Actes des 11es Journées Internationales d'Analyse Statistique des Données Textuelles / Proceedings of the 11th International Conference on Textual Data Statistical Analysis, pages 281–293, Liège, Belgium. .pdf
Jey Han Lau, Paul Cook, Diana McCarthy, David Newman and Timothy Baldwin. 2012. Word Sense Induction for Novel Sense Detection. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), pages 591–601, Avignon, France. .pdf
Timothy Baldwin, Paul Cook, Bo Han, Aaron Harwood, Shanika Karunasekera, and Masud Moshtaghi. 2012. A Support Platform for Event Detection using Social Intelligence. In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 69–72, Avignon, France. .pdf
2011
Paul Cook and Graeme Hirst. 2011. Automatic identification of words with
novel but infrequent senses. In Proceedings of the 25th Pacific
Asia Conference on Language Information and Computation (PACLIC
25), pages
265–274,Singapore. .pdf
Paul Cook. 2011. Book review of “A Way with Words: Recent advances in lexical theory and analysis: A Festschrift for Patrick Hanks.” Gilles-Maurice de Schryver (editor). Computational Linguistics 37(2). .pdf
2010
Paul Cook. 2010. Exploiting linguistic knowledge to infer properties of
neologisms. Ph.D. thesis, University of
Toronto, November. .pdf
Paul Cook and Suzanne Stevenson. 2010. No sentence is too confusing to ignore. In Proceedings of the ACL 2010 Workshop on NLP and Linguistics: Finding the Common Ground, pages 61–69, Uppsala, Sweden. .pdf
Paul Cook and Anna Feldman, editors. 2010. Proceedings of the NAACL HLT 2010 Second Workshop on Computational Approaches to Linguistic Creativity (CALC-10), Los Angeles, California. .pdf
Paul Cook and Suzanne Stevenson. 2010. Automatically identifying changes in the semantic orientation of words. In Proceedings of the 7th International Conference on Language Resources and Evaluation, pages 28–34, Valletta, Malta. .pdf
Paul Cook and Suzanne Stevenson. 2010. Automatically identifying the source
words of lexical blends in English. Computational
Linguistics. 36(1):129–149. .pdf
The dataset of blends used in this study is available. Please contact me if you're interested.
2009
Paul Cook and Suzanne Stevenson. 2009. An unsupervised model for text
message normalization. In Proceedings of the NAACL HLT 2009
Workshop on Computational Approaches to Linguistic Creativity,
pages 71–78, Boulder, Colorado. .pdf
Afsaneh Fazly, Paul Cook, and Suzanne Stevenson. 2009. Unsupervised type and token identification of idiomatic expressions. Computational Linguistics 35(1):61–103. .pdf
2008
Paul Cook, Afsaneh Fazly, and Suzanne
Stevenson. 2008. The VNC-Tokens Dataset. In Proceedings of the
LREC Workshop: Towards a Shared Task for Multiword Expressions (MWE
2008), pages 19–22, Marrakech, Morocco. .pdf
The VNC-Tokens dataset is available from
the Multiword
Expressions Web.
2007
Paul Cook and Suzanne Stevenson. 2007. Automagically
inferring the source words of lexical blends. In Proceedings of
the 10th Conference of the Pacific Association for Computational
Linguistics (PACLING 2007), pages 289–297, Melbourne,
Australia. .pdf
Paul Cook, Afsaneh Fazly, and Suzanne Stevenson. 2007. Pulling their weight: Exploiting syntactic forms for the automatic identification of idiomatic expressions in context. In Proceedings of the ACL Workshop on A Broader Perspective on Multiword Expressions (MWE 2007), pages 41–48, Prague, Czech Republic. .pdf
2006
Paul Cook. 2006. Automatically Classifying English Verb-Particle Constructions by Particle Semantics. M.Sc. thesis, University of Toronto, August. .pdf
Paul Cook and Suzanne Stevenson. 2006. Classifying particle semantics in English verb-particle constructions. In Proceedings of the ACL/COLING Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties (MWE 2006), pages 45–53. Sydney, Australia. .pdf
Places to spot Paul when he's not in his office
At home in Brunswick
Hanging out with Hannah