Wednesday, August 15, 2007

YaCy crawl of this site

At the suggestion of some IRCers I tried the open source, p2p web search and crawling tool YaCy. I have to say I am impressed with the features and the speed of the web crawl.

Here is the visual result of a web crawl of hotwigati.blogspot.com:



And an action shot of the crawl:

The exciting new feature in YaCy is the ability to distribute the web crawling among many peers resulting in fast and extensive crawls.

Of course there are privacy and security concerns. The software is Java-based, reducing the risk of exploitation, but there still remains the 'risk' of having your IP in the logs of sites crawled by anonymous peers through your machine.
StumbleUpon Toolbar Stumble It!

0 Comments:

Post a Comment

Subscribe to Post Comments [Atom]

<< Home