“The anonymity of the Netflix Prize dataset has been broken by a pair of computer scientists from the University of Texas[…]. It turns out that […] it’s straightforward to find a match by comparing the anonymized data against publicly available ratings on the Internet Movie Database (IMDb)[…] in the process possibly working out their political affiliation, sexual preferences and a number of other personal details”
I suppose I should consider this a privacy violation of some sort, since I’m almost certainly represented in the data set, but whatever.
For that matter, my ratings data may very well have been one very useful data set in helping the computer scientists crack the anonymizing code.
Whoops. Sorry I’m such a movie geek.
In honor of my likely indirect contribution to this situation, I offer my first-ever LOLcat.