blo.gs growth
some people have been interested in the growth of the data that blogs collects. here’s some numbers. this is from a snapshot of the database from sunday morning, so it isn’t to-the-minute.
of the 6,602,676 entries in the blo.gs database snapshot i used, rss/atom feeds were known for 2,512,959 of them. that doesn’t mean more didn’t have rss/atom feeds, just that blo.gs didn’t know about them.
of the blogs that were updated in the last 30 days of the snapshot data, 71% had known rss/atom feeds.
this is the number of blogs added to the database each month. i wasn’t tracking this before september 2003, so the earlier number is all the blogs prior to that.
| month | total |
|---|---|
| earlier | 671455 |
| 2003-09 | 60205 |
| 2003-10 | 136789 |
| 2003-11 | 134431 |
| 2003-12 | 88575 |
| 2004-01 | 131432 |
| 2004-02 | 126280 |
| 2004-03 | 124106 |
| 2004-04 | 123790 |
| 2004-05 | 185993 |
| 2004-06 | 203727 |
| 2004-07 | 319432 |
| 2004-08 | 273679 |
| 2004-09 | 304170 |
| 2004-10 | 294377 |
| 2004-11 | 290778 |
| 2004-12 | 380422 |
| 2005-01 | 1472004 |
| 2005-02 | 789753 |
| 2005-03 | 491278 |
the big jump in january is from getting a feed that includes the livejournal data from pubsub.com.
here’s the count of blogs last updated in each given month:
| month | total |
|---|---|
| 2000-04 | 3 |
| 2000-09 | 2 |
| 2001-03 | 1 |
| 2001-05 | 2 |
| 2001-06 | 1 |
| 2001-07 | 3 |
| 2001-08 | 14 |
| 2001-09 | 3 |
| 2001-10 | 76 |
| 2001-11 | 56 |
| 2001-12 | 98 |
| 2002-01 | 2814 |
| 2002-02 | 4562 |
| 2002-03 | 3322 |
| 2002-04 | 2851 |
| 2002-05 | 2512 |
| 2002-06 | 2519 |
| 2002-07 | 2945 |
| 2002-08 | 3229 |
| 2002-09 | 3464 |
| 2002-10 | 3743 |
| 2002-11 | 3476 |
| 2002-12 | 3501 |
| 2003-01 | 5836 |
| 2003-02 | 7145 |
| 2003-03 | 9628 |
| 2003-04 | 14953 |
| 2003-05 | 68570 |
| 2003-06 | 71683 |
| 2003-07 | 84950 |
| 2003-08 | 93637 |
| 2003-09 | 105079 |
| 2003-10 | 116137 |
| 2003-11 | 121437 |
| 2003-12 | 88687 |
| 2004-01 | 103237 |
| 2004-02 | 108730 |
| 2004-03 | 109480 |
| 2004-04 | 106468 |
| 2004-05 | 142024 |
| 2004-06 | 157521 |
| 2004-07 | 262064 |
| 2004-08 | 254421 |
| 2004-09 | 264621 |
| 2004-10 | 271386 |
| 2004-11 | 278655 |
| 2004-12 | 339761 |
| 2005-01 | 694421 |
| 2005-02 | 1021597 |
| 2005-03 | 1661351 |
and based on the ip address, here’s the top hosts:
| host | count |
|---|---|
| unknown | 1942533 |
| blogspot.com | 1873034 |
| livejournal.com | 1445296 |
| spaces.msn.com | 307079 |
| persianblog.com | 64654 |
| journals.aol.com | 26333 |
| blogdrive.com | 23335 |
| blog.pt | 22388 |
| blogdrive.com | 18817 |
| blogware.com | 16109 |
| 20six.fr | 15900 |
| buzznet.com | 15748 |
| spammer | 15369 |
| canalblog.com | 14950 |
| blogdrive.com | 14695 |
| typepad.com | 13706 |
| blogdrive.com | 12686 |
| blogfa.com | 11585 |
| cocolog-nifty.com | 9902 |
needless to say, the entries for the spammer are gone now. blogdrive.com shows up multiple times because they use multiple ip addresses.
here is where i would insert all sorts of caveats about how these numbers are derived if i cared to hold people’s hands when dealing with numbers like this. these are free numbers, and you’re getting what you paid for them.
Add a comment
Sorry, comments on this post are closed.