Word Similarity 353

Descriptions

WordSim 353 is a standard dataset for evaluuating vector-space models. It consists of 353 pairs of words. Each pair is presented without context and rated by 13 or 16 human on similarity or relatedness on a scale from 0 (totally unrelated words) to 10 (very much related or identical words).

Details of the test set can be found is this page.

Reference

Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan and Gadi Wolfman, and Eytan Ruppin. 2002. Placing search in context: The concept revisited. ACM Trans. Inf. Syst., 20(1):116–131, January.

Download

wordsim353.zip     Downloads  7  times