people using blo.gs ping data
if you create a new blog, and only ping blo.gs, you'll soon be visited by robots from: radio vox populi (apparently twice: once from a server at the mit media lab, and another from the media lab europe), mercubot, que pasa corporation (related to this?), ibm/sequent (which will try to fetch various filenames looking for rss or atom feeds), feedster (which will also try various filenames), and blogshares.
the only one which will fetch robots.txt
is the one from the quepasa.com. the robots from radio vox populi and ibm/sequent only identify themselves as libwww-perl.
popdex and hostmon are two more. the robot from hostmon only identifies itself as "Jakarta Commons-HttpClient/2.0final"