Rare Word


Minh-Thang Luong et al. introduced a new dataset focusing on rare words. Its 2034 word pairs contain more morphological complexity than other well-established word similarity datasets, e.g. crudeness—impoliteness.. Details can be found in this paper.


Minh-Thang Luong, Richard Socher, and Christopher D. Manning. 2013. Better word representations with recursive neural networks for morphology. In Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pages 104–113. Association for Computational Linguistics.


rw.zip     Downloads  81  times