Today we are adding a new tool that reports users which sites they have visited while walking the link graph of … More
Category: Human-Computer Interaction
The Human and Civil Rights Collection Miner
The Human and Civil Rights Collection is a new miner built with the Minerazzi platform. It is available at http://www.minerazzi.com/hcrc. … More
Mining SEO conference sites and their speakers with Minerazzi
The SEOMiner (http://www.minerazzi.com/seominer) can be used to illustrate the power of recursively crawling sites and social networks with … More
Mining Stanford, Cornell, and MIT Universities
Yesterday we launched the US University Sites Collection (http://www.minerazzi.com/usc). This is a miner built with the Minerazzi platform that allow … More
The US University Sites Collection
We have launched a new miner: The US University Sites Collection (US). Available at http://www.minerazzi.com/usc, this miner lets you mine … More
Mining Hubs with Minerazzi
This morning we added an entire new data set to the Puerto Rico Collection (http://www.minerazzi.com/prbusca) . The set added was … More
The Information Retrieval Collection (IRC)
A new miner is available at Minerazzi.com: The Information Retrieval Collection (http://www.minerazzi.com/irc). What you can do with it? Use this … More
Building topic-specific collections, the easy way
We have improved the Minerazzi platform (http://www.minerazzi.com) by adding new features. That includes an internal filter for deduplicating urls, which … More
Unveiling Link Honey Pots with Minerazzi
In Web Spam Taxonomy, Gyongyi and Garcia-Molina, describe several web spam techniques, one being honey pots. They describe these as … More
Minerazzi: Allowing Users to Recrawl Search Results
Effectively immediately Minerazzi (http://www.minerazzi.com) allows users to recursively recrawl search results. Why is recrawling so important? The purpose of allowing … More
You must be logged in to post a comment.