[Lccd-internal] Yahoo releases largest Machine Learning dataset
Srijan Kumar
srijan at cs.umd.edu
Fri Jan 22 17:27:30 EST 2016
New dataset:
http://yahoolabs.tumblr.com/post/137281912191/yahoo-releases-the-largest-ever-machine-learning
"The dataset stands at a massive ~110B events (13.5TB uncompressed) of
anonymized user-news item interaction data, collected by recording the
user-news item interactions of about 20M users from February 2015 to May
2015. The Yahoo News Feed dataset is a collection based on a sample of
anonymized user interactions on the news feeds of several Yahoo properties,
including the Yahoohomepage, Yahoo News, Yahoo Sports, Yahoo Finance, Yahoo
Movies, and Yahoo Real Estate."
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.umiacs.umd.edu/pipermail/lccd-internal/attachments/20160122/ac65b1de/attachment.html>
More information about the Lccd-internal
mailing list