Association for Computing Machinery
The real-time information on news sites, blogs and social networking sites changes dynamically and spreads rapidly through the Web. Developing methods for handling such information at a massive scale requires that the authors think about how information content varies over time, how it is transmitted, and how it mutates as it spreads. They describe the News Information Flow Tracking, Yay! (NIFTY) system for large scale real-time tracking of \"Memes\" - short textual phrases that travel and mutate through the Web. NIFTY is based on a novel highly-scalable incremental meme-clustering algorithm that efficiently extracts and identifies mutational variants of a single meme.