Rare Word

Descriptions

Minh-Thang Luong et al. introduced a new dataset focusing on rare words. Its 2034 word pairs contain more morphological complexity than other well-established word similarity datasets, e.g. crudeness—impoliteness.. Details can be found in this paper.

Reference

Minh-Thang Luong, Richard Socher, and Christopher D. Manning. 2013. Better word representations with recursive neural networks for morphology. In Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pages 104–113. Association for Computational Linguistics.

Download

rw.zip     Downloads  21  times