Looking to do research based on data gathered from across the web? That's one of the purposes of Common Crawl, and the group has just released new data, as well as a contest to encourage use of that data
The 2012 data, which contains 3.8 billion web documents, shows stats such as 63% of top level domains being .com or there being 61 million domains overall.
Common Crawl is also currently running its first-ever Common Crawl Code Contest challenging developers to do something innovative using the data relating to job trends or social impact analysis. Three winners will each get $1,00 [...]
Related Topics: Channel: Consumer | Common Crawl | Top News