Measuring Semantic Similarity Between Words and Improving Word Similarity by Augumenting PMI
Measuring the semantic similarity between words is an important component in various tasks on the web such as relation extraction, community mining, document clustering, and automatic metadata extraction. Despite the usefulness of semantic similarity measures in these applications, accurately measuring semantic similarity between two words. Point wise Mutual Information (PMI) is a widely used word similarity measure, but it lacks a clear explanation of how it works. PMI max estimates the maximum correlation between two words, i.e., the correlation between their closest senses.