This page is no longer being maintained (as of 11 September 2012). Please check out my Melbourne Uni webpage instead!
McKenzie Postdoctoral Fellow, Language Technology Group, Department of Computing and Information Systems, University of Melbourne
Until June 2011 I was a MITACS Elevate postdoctoral fellow in Computational Linguistics in Computer Science at the University of Toronto.
Department of Computing and Information Systems
University of Melbourne
Victoria 3010, Australia
email: paulcook at unimelb dot edu dot au
I'm co-organising the Australasian Language Technology Association (ALTA) 2012 Workshop.
BibTex entries for my publications
My Google scholar profile
Bo Han, Paul Cook and Timothy Baldwin. Lexical Normalisation of Short Text Messages. To appear in ACM Transactions on Intelligent Systems and Technology.
Paul Cook. 2012. Using social media to find English lexical blends. In Proceedings of the 15th EURALEX International Congress (EURALEX 2012), pages 846–854, Oslo, Norway. .pdf code Sample output
Bo Han, Paul Cook and Timothy Baldwin. 2012. Automatically Constructing a Normalisation Dictionary for Microblogs. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and Natural Language Learning (EMNLP-CoNLL 2012), pages 421–432, Jeju, Korea. .pdf
The normalisation dictionary we produced in this paper is available.
Paul Cook and Graeme Hirst. 2012. Do Web corpora from top-level domains represent national varieties of English? In Actes des 11es Journées Internationales d'Analyse Statistique des Données Textuelles / Proceedings of the 11th International Conference on Textual Data Statistical Analysis, pages 281–293, Liège, Belgium. .pdf
Jey Han Lau, Paul Cook, Diana McCarthy, David Newman and Timothy Baldwin. 2012. Word Sense Induction for Novel Sense Detection. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), pages 591–601, Avignon, France. .pdf
Timothy Baldwin, Paul Cook, Bo Han, Aaron Harwood, Shanika Karunasekera, and Masud Moshtaghi. 2012. A Support Platform for Event Detection using Social Intelligence. In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 69–72, Avignon, France. .pdf
Paul Cook and Graeme Hirst. 2011. Automatic identification of words with novel but infrequent senses. In Proceedings of the 25th Pacific Asia Conference on Language Information and Computation (PACLIC 25), pages 265–274,Singapore. .pdf
Paul Cook. 2011. Book review of “A Way with Words: Recent advances in lexical theory and analysis: A Festschrift for Patrick Hanks.” Gilles-Maurice de Schryver (editor). Computational Linguistics 37(2). .pdf
Paul Cook. 2010. Exploiting linguistic knowledge to infer properties of neologisms. Ph.D. thesis, University of Toronto, November. .pdf
Paul Cook and Suzanne Stevenson. 2010. No sentence is too confusing to ignore. In Proceedings of the ACL 2010 Workshop on NLP and Linguistics: Finding the Common Ground, pages 61–69, Uppsala, Sweden. .pdf
Paul Cook and Anna Feldman, editors. 2010. Proceedings of the NAACL HLT 2010 Second Workshop on Computational Approaches to Linguistic Creativity (CALC-10), Los Angeles, California. .pdf
Paul Cook and Suzanne Stevenson. 2010. Automatically identifying changes in the semantic orientation of words. In Proceedings of the 7th International Conference on Language Resources and Evaluation, pages 28–34, Valletta, Malta. .pdf
Paul Cook and Suzanne Stevenson. 2010. Automatically identifying the source
words of lexical blends in English. Computational
Linguistics. 36(1):129–149. .pdf
The dataset of blends used in this study is available. Please contact me if you're interested.
Paul Cook and Suzanne Stevenson. 2009. An unsupervised model for text message normalization. In Proceedings of the NAACL HLT 2009 Workshop on Computational Approaches to Linguistic Creativity, pages 71–78, Boulder, Colorado. .pdf
Afsaneh Fazly, Paul Cook, and Suzanne Stevenson. 2009. Unsupervised type and token identification of idiomatic expressions. Computational Linguistics 35(1):61–103. .pdf
Paul Cook, Afsaneh Fazly, and Suzanne Stevenson. 2008. The VNC-Tokens Dataset. In Proceedings of the LREC Workshop: Towards a Shared Task for Multiword Expressions (MWE 2008), pages 19–22, Marrakech, Morocco. .pdf
The VNC-Tokens dataset is available from the Multiword Expressions Web.
Paul Cook and Suzanne Stevenson. 2007. Automagically inferring the source words of lexical blends. In Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics (PACLING 2007), pages 289–297, Melbourne, Australia. .pdf
Paul Cook, Afsaneh Fazly, and Suzanne Stevenson. 2007. Pulling their weight: Exploiting syntactic forms for the automatic identification of idiomatic expressions in context. In Proceedings of the ACL Workshop on A Broader Perspective on Multiword Expressions (MWE 2007), pages 41–48, Prague, Czech Republic. .pdf
Paul Cook. 2006. Automatically Classifying English Verb-Particle Constructions by Particle Semantics. M.Sc. thesis, University of Toronto, August. .pdf
Paul Cook and Suzanne Stevenson. 2006. Classifying particle semantics in English verb-particle constructions. In Proceedings of the ACL/COLING Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties (MWE 2006), pages 45–53. Sydney, Australia. .pdf
Places to spot Paul when he's not in his office
At home in Brunswick
Hanging out with Hannah