This page presents datasets and resources for public use and research in natural language processing, computational linguistics and cognition.Child overextension dataset described here includes over 230 word-referent pairs of overextended noun usages (e.g., ball → "balloon") recorded in young children, with an accompanying code repository.
- Citation: Ferreira Pinto Jr., R. and Xu, Y. (2021) A computational theory of child overextension. Cognition, 206, 104472.
- Citation: Xie, J. Y., Hirst, G., and Xu, Y. (2020). Contextualized moral inference. arXiv preprint arXiv:2008.10762.
- Citation: Tanchip, C., Yu, L., Xu, A., and Xu, Y. (2020) Inferring symmetry in natural language. In Findings of the Conference on Empirical Methods in Natural Language Processing.
- Citation: Zinin, S. and Xu, Y. (2020) Corpus of Chinese dynastic histories: Gender analysis over two millennia. In Proceedings of the 12th International Conference on Language Resources and Evaluation.