You Are What You Like! Information Leakage Through Users' Interests
Suppose that a Facebook user, whose age is hidden or missing, likes Britney Spears. Can one guess his/her age? Knowing that most Britney fans are teenagers, it is fairly easy for humans to answer this question. Interests (or "Likes") of users is one of the highly-available on-line information. In this paper, the authors show how these seemingly harmless interests (e.g., music interests) can leak privacysensitive information about users. In particular, they infer their undisclosed (private) attributes using the public attributes of other users sharing similar interests. In order to compare user-defined interest names, they extract their semantics using an ontologized version of Wikipedia and measure their similarity by applying a statistical learning method.