Entries tagged 'blogging'
Or maybe flambéd
In Chris Ferdinandi’s article about different static site generators (SSGs), he says that they “store content in flat markdown files instead of a database” and that immediately brought to mind “Bake, Don’t Fry,” a blog post from 2002 by Aaron Swartz where he started exploring the concept of baked sites as compared to fried sites.
At the time, you had blogging systems like Blogger and Movable Type that stored their data in a database but generated static pages for viewing, and systems like b2/cafelog (this was pre-WordPress) and OpenACS that stored their data in a database but generated pages on the fly. The first blogging system that I remember that just used the file system to store all of its files was blosxom.
I started this blog on Blogger but quickly switched over to a homegrown PHP/MySQL system in October 2000. I have rewritten it from scratch once or twice since then.
I have been thinking about moving to a setup that baked and published a static version of my site. A reason I haven’t done it is because doing it all fried hasn’t been a performance problem so far.
I’m not convinced that I want all of the content in flat files and not a database. I guess I just don’t see the advantages of working with flat files for how I like to write, and I can’t imagine expecting less technical users deal with things like text files with front-matter and tools to manage those files.
It is strange that all of the popular static-site generators I’ve looked at have filesystem storage baked into their design. It should be rather straightforward to abstract that out a bit, and then you’d be able to pull your content from a database but generate static output. Like Blogger did twenty-five years ago.
Is this Twig or Jinja? Maybe both!
A project I have been playing around with the last couple of weekends has been making a Python version of this site. The code, which is very rough because I barely know what I’m doing and I’m in the hacking-it-together phase as opposed to trying to make it pretty, is in this GitHub repository.
I am using the Flask framework with SQLAlchemy and Jinja.
I was interested to see if I could just use the same templates as my PHP version, which uses Twig, but there have been a few sticking points:
- The Twig
escape
filter takes an argument to more finely control the context it is being used in so it knows how to escape within HTML, or a URI, or an HTML attribute. Jinja’sescape
doesn’t take an argument. I was able to override it take an extra argument, but mostly ignore it for now. - Jinja doesn’t have Twig’s ternary
?:
operator. Not surprising, Python doesn’t either. I rewrote those bits of templates to use slightly more verboseif
blocks. - Jinja doesn’t have Twig’s string comparators like
matches
andstarts with
. Looks like I can get rid of the need for them, but I just punted on those for now. - Jinja doesn’t have a
block()
function. I think I can also avoid needing it. - Jinja’s
url_for()
method expects a more Python-ic argument list, likeurl_for('route', var = 'value')
but Twig uses a dictionary likeurl_for('route', { 'var' : 'value' })
. I was able to override Jinja’s version to handle this, too. - I’ll need to implement versions of Twig’s
date()
function and filter.
I had cobbled together a way on the Twig side to let me store some templates (side navigation, the “Hire me!” message on the front page) in the database, so my next trick is going to implement template loaders for both the PHP and Python versions so that is more cleanly abstracted. I have the Python side of that done already.
I hope to eventually create a Rust version of this, too, and it will be interesting to see what new complications using Tera will bring.
Taste the rainbow
When I added the support for dark mode, I had to add a little support to the syntax highlighting styles so they looked okay in both light and dark modes.
Today I went in and worked on the JavaScript code a bit to make it a little more modern and refined the styles a little bit more. I also added the language file for YAML, and built it out so it does a better job of highlighting some of YAML’s more special syntax.
I am sticking with this JavaScript-based syntax highlighting for now, mostly because it works.
Here is a YAML sample pulled from the 1.2.2 spec plus a couple of minor additions to show off additional syntax that is handled.
%YAML 1.2
--- !<tag:clarkevans.com,2002:invoice>
invoice: 34843
date : 2001-01-23
bill-to: &id001
given : Chris
family : Dumars
address:
lines: |
458 Walkman Dr.
Suite #292
city : Royal Oak
state : MI
postal : 48046
ship-to: *id001
product:
- sku : BL394D
quantity : 4
description : Basketball
price : 450.00
- sku : BL4438H
quantity : 1
description : Super Hoop
price : 2392.00
tax : 251.42
total: 4443.52
shipped:
- false
comments:
Late afternoon is best.
Backup contact is Nancy
Billsmer @ 338-4338.
Embrace the dark
I finally took some time to implement a “dark mode” style for this site. It will default to whatever you have your system set to, but there is also a set of radio buttons that you can use to force it to a particular mode. (It doesn’t save the setting, so you will have to do it on every page if you are browsing around.)
This talk by Sara Joy about light mode and dark mode was part of the inspiration for getting around to this, and this article about light-dark
from Bramus is where I stole borrowed the idea for handling the mode switching.
Because I am using the relatively new CSS light-dark()
function to implement this, it is possible (likely?) it will degrade less than gracefully on older browsers.
Now with Markdown, finally
Since the beginning (almost 24 years ago), my entries here have always been written in HTML with just a simple expansion of blank lines to create paragraphs.
A little over 20 years ago, a syntax for marking up plain text called Markdown was released and it has proven to be a fairly persistent and resilient success. And technically, my HTML with blank lines is compatible with Markdown.
That means that now I have plugged in a Markdown parser where my paragraph expansion was, and it just works and I can start using other syntax.
This is all heading down the path where I can plug in more text transformations, like server-side syntax highlighting for code.
<?php
echo "That would be nice.";
?>
(That won’t be highlighted yet, but whenever I do plug in syntax highlighting it should just happen.)
I’m using the Parsedown PHP library for turning Markdown into HTML, but it seems to be unmaintained and I may swap in something else.
It is possible that the Markdown parsing will mangle some old entries, but a quick scan of entries from this year looks okay, and I can easily fix stuff as I run across it. (I did already fix one very old entry that had some mangled HTML which just kinda worked before but got escaped with the new parser.)
Who is getting the ping data?
The excellent journalists at 404 Media uncovered information about Automattic selling data to “AI” companies from WordPress.com and Tumblr. A number of people have jumped in to say that “WordPress.com != WordPress” which is a fair point, but this is where I touch the onion on my belt and point out that even WordPress sends pings to Ping-o-Matic by default and does anyone know who gets a real-time feed of that data?
Making a smaller footprint
I have been working on shutting down some of the (virtual) servers that we used for the store and other things. There were three: one for the store’s website, one for some WordPress sites/blogs, and one for this site.
The server hosting the store was the most expensive one, but now the site has been reduced to just a couple of static pages, I set those up to run off Bunny.net (that’s a referral link). That should cost a few pennies per months, so the $40 credit I have should cover that indefinitely.
I haven’t figured out what to do with the WordPress blogs, but none of them are active and I’ll probably just turn them into static sites and host them on Bunny.net, too.
The last thing is this server which could be a static site but I like tinkering my little blog engine and think I want to keep it dynamic. I may move the hosting over to Oracle Cloud Infrastructure (OCI) because they offer some “always” free services that would be more than adequate. I have been really curious to play with their Arm-based compute servers but there has been no availability in the region I picked when signing up.
I need to decide how I want to set up the server on OCI to run this. I’m currently using a pretty simple docker-compose.yml
setup to handle running a php-fpm
container, a MySQL database server, and Sphinx, with the system nginx
fronting things. I already know I want to replace Sphinx, and I think I want to try using Caddy on the front. I guess if I really wanted to be extra and take advantage of the free resources from Oracle that I’d spin up an Autonomous Database instead of using MySQL.
Between credits I have with Bunny.net and Oracle’s free offering, I’ll be paying almost nothing for hosting all of this except for the domain name renewals. That soothes my cheapskate soul.
Now with a different style
It has been almost six years since I did much to the design of this place, and I figured it was time to pay a little attention to it. Nothing major, just some tweaks to the colors, finally applying a little more style to the forms, and generally just cleaning things up. I will keep tweaking things. I have an SVG version of the “Barrel of Monkeys” monkey that I want to incorporate into the logo but I want to try and simplify it and get it down to a size where I am okay with putting it inline into the HTML.
I am using CSS nesting for the form styles which should be okay with recent browsers. Everything should degrade pretty sensibly, or maybe abruptly. It is great that the state of the art for CSS is advancing fairly quickly again and it is possible to do fun stuff without having to adopt big frameworks or tools that require a build step.
I used Adobe’s (free) Color Contrast Analyzer to pick the colors and (try to) make sure they were accessible.
Not sure what’s next, but dark mode is on the list somewhere.
Same as it ever was
As I add (or re-add) features to this blog again, I find myself spelunking in some of the archives and hitting a lot of dead links. I was a little curious to see what the oldest still-alive link was, and it brought me to this very short link post from the year 2000 to this article by Greg Beato about a media startup called Verde.com that made me think of recent stories about The Messenger, a media website that went bust in under a year.
I have to hold back from digging into this sort of thing too deeply, or I start to wonder what happened to Scient, or Peek Garlington, and suddenly a day is gone chasing geese.
I also came across this old post about the Los Angeles Times partnering with a technology company to launch an RSS reader application and got a little sad that I have never managed to work my way into a job that is journalism-adjacent. Not because I want to take the money and run, but because I think it is a space where I could tremendously effective given enough of a mandate to do so.
Titles are where I can be abstruse
When I added auto-posting of entries here to my Mastodon account, it just took the title of the entry and posted that along with the link. But those end up being kind of cryptic, so now I made it possible to actually write the Mastodon post alongside the blog entry.
Just a small step in building out this online presence more fully and making sure it’s connected to other places more thoughtfully.
The next thing I want to do is build a new home for my photos. I stopped renewing my Flickr Pro subscription. I’ve thought about setting up a Pixelfed instance but that seems like overkill. I may use it as an excuse to build something in a language other than PHP because my resume could use that. Nearly all of my non-PHP work has been lost to me because it wasn’t open source.
New year, who dis?
With the store now mostly behind me, I’m planning to get back to writing more, baking and cooking more, playing with more technology, and just generally casting off the shackles of small business ownership and all that has entailed (and derailed).
I have also finally decided to find the shift key and get past that all-lowercase-affectation that I stuck with for the first twenty-three-ish years of this site.
No resolutions, just some hopes about directions.
(Also, eschewing corporate social media for now even as a consumer. If the internet is going to get weird again, I want to swim in that.)
now with new comments
not that i think that there is anyone reading this, but you can now comment on entries for seven days after they have been posted.
you can throw html in your comment, but it will get filtered by html purifier.
spam will be deleted promptly if it wasn’t already blocked or sequestered by akismet.
now with old comments
slowly rebuilding the scaffolding beneath this blog and i re-enabled the display of old comments, which led me to re-discover the time when random people would get mad at me because i mistakenly said redman was a member of the wu tang clan.
what really cracks me up in the comments is my wife celia casually noting that she had dinner with odb and had talked to him about being confused as to wu tang clan’s membership.
you can’t post new comments. i am still deciding if i want to implement that again and deal with all the nonsense that will come with it.
now affiliated with bookshop.org
one of the features i have had in this blogging software since almost the beginning is some simple expansion of pseudo-urls, one of them being isbn:###
. until recently, that would take you to amazon with an (old, expired) affiliate tag.
bookshop is a service that lets you order from one of the major book distributors (ingram) and kicks a portion to affiliates and independent bookstores. because they’re just connected with the one distributor, not every book is available, but i’m happier with broken links for those books than directing any traffic towards amazon.
and if you happen to buy a book that i have mentioned here using one of those links, i get a little commission. maybe i’ll be able to buy a potato with the proceeds some day.
(i do virtually all of my book reading using the libby app with my los angeles public library membership.)
tooting while blogging
i figured that i should do something clever like automatically post to my mastodon account when i posted here, but i was surprised to find that the state of mastodon api clients for php is pretty sad. php-mastodon was what i used to get it working, but it's really an incomplete implementation and the error handling is pretty much non-existent so it took way longer than it should have to get going.
(and put me down as someone who is glad that “tooting” is being pushed into the background as the term of art for posting on mastodon, but couldn’t resist using it this time.)
made a new saddle
while being able to write entries and send them via email seemed like fun, the reality is that the setup was fragile. so it was enough of a hurdle to writing anything here that i rarely wanted to deal with it.
i do want to write more here, so i knocked together a basic web interface that will allow me to do that.
the biggest thing that i still haven't figured out is how i want to handle is images. i could go back to using flickr and embedding from there, or i could implement a basic media library. i think the long-term solution is probably doing it myself because that's kind of the reason for this place.
migrated to slim framework 4
a couple of weeks ago i finally took some time to upgrade the code for this blog to the latest major version of the slim framework. it is still a strange mash-up of framework and hand-coded sql queries and old php code but this should make it easier to for me to tinker with going forward. the main thing i need to do is add a way to post images again.
another three years
another three years between entries. some stuff has happened. the store is still going, and i am still finding excuses to code and learn new things.
i wrote before about how i was converting scat from a frankenstein monster to a more modern php application built on a framework, which has more or less happened. there’s just a little bit of the monster left in there that i just need to work up the proper motivation to finish rooting out.
i also took what was a separate online store application built on a different php framework and made it a different face of scat. it is still evolving and there’s bits that make it work that aren’t really reflected in the repository, but it’s in production and seems to sort of work, which has been gratifying to get accomplished. the interface for the online store doesn’t use any javascript or css frameworks. between that and running everything behind cloudflare, it’s much faster than it used to be.
now i’ve done it
i noodled around a little more, and came up with an ugly way to supply and extract tags in my postings via email. the next step will be extracting attachments so i can post images again.
progress
yes, this is still on. and now it is on a new server with new code, even though it looks the same.
i still need to knock together the "writing a post" bit of code so that i can post without using manual SQL queries. details.
stirrings
it turns out that this thing is still on. it is kind of funny to me that it still just chugs along, but such is the joy of writing your own software running on your own (virtual) server. it is long past time to rebuild the infrastructure here, which may or may not happen. but it is something that i am thinking about again.
comments closed
i get tired of dealing with spam on very old entries here, so the comments now automatically close on posts after one week. you know, on the off chance that i ever post anything of interest again.
plus c'est la même chose
you might remember a little over a year ago, when automattic acquired blo.gs. no signs of life yet.
blo.gs lives?
so today i happened to stumble across the news from last week that yahoo “transferred” blo.gs to automattic.
i guess i can finally let the domain names lapse that i had planned to create a service like blo.gs on.
the march of time continues
i haven’t had much to say. very out of character, i know.
or maybe i’ve had a lot to say and don’t know how to say it.
i am looking forward to the dr. horrible’s sing-along-blog session at paley fest 2009.
maybe i’ll write about that next month.
boring from another continent
as celia wrote earlier, we are in riga, latvia for a meeting of the mysql developers. she is holed up in the hotel room working on a screenplay (or maybe in the atrium where the wifi is better), and i am in a presentation about blogging.
celia already posted pictures from our excursion day on sunday (the day we didn’t sit around in the meeting rooms at the hotel). i took some video which i will figure out how to deal with once we are back home.
don’t ask too many questions
chyrp is a nice looking piece of blog software. individual posts can have different styles, something it borrowed from the hosted tumblr service. i was interested to read about “the sql query massacre of january 19th, 2008” but the numbers gave me pause — 21 queries to generate the index page? that is down from an astounding 116, but that still seems ridiculous to me.
the number of queries to generate the index of this site? two. one of them is SET NAMES utf8
. i could see boosting that to three or four if i moved some things like the list of links in the sidebar into the database, or added archive links. call it five if i had user accounts.
but right now, the number of queries used to load the index page on a chyrp site grows with the number of posts displayed on the front page. not only that, it grows by two times the number of posts on the front page.
chyrp could use a security audit, too.
new voices
if downtown los angeles was the 3rd bloggiest neighborhood six months ago, things are really exploding now. walter has become frighteningly prolific at scribeskidrow. joe, of joe and ruby fame, has started a blog at big city poz. the guys at angelenic are doing a fantastic job of digging up the news on the downtown development front. white boy/white dog has all the news from 2nd and main. nearby, the lofty dog is one of the first businesses to start up a downtown blog (disclaimer: with my wife and dog’s help). bert green also has a blog as a business owner, resident and player. and dodger is another downtown dog with a blog.
that is a whole lot of new blood in the downtown blog-la-sphere. but i think my favorite part about is that i have been lucky enough to meet all of these new bloggers in person, and they are all fantastic people (and dogs!).
at a party the other night, i was talking with another of our new downtown friends (not yet a blogger) who mentioned how small-town that downtown feels. it is amazing to see how this community is growing up, and to be a part of it while it happens. and i know that the little network of people i know is just one of many such networks downtown.
but enough about me, what do you think of me?
in his infoworld blog, zack urlocker (vp products at mysql) passes on a good link about smaller software teams. and says very kind things about me, since he read the article after i posted it to our internal business-intelligence list. i used to report directly to zack, but i have managed to shoehorn in three other people between us on the orgchart since then.
that business-intelligence list is kind of a funny beast. it is mostly industry news (who bought who), with some interesting mentions of mysql in the press and blogs, and my ongoing implicit criticism of our development processes. it would make a pretty good blog. i should at least start posting the things i have been sending to the list.
and so the puppyblogging begins
5 things you didn’t know about me
i knew that dave would get me back for tagging him eventually.
- i am a published poet.
- i have appeared on a local newscast.
- i did telemarketing (cold-calling) for an insurance agent.
- i used to work the weekend opening shift at burger king.
- i failed two classes in my major in college.
now for step 3
over at franklin avenue, mike coined the ridiculous term blog-la-sphere and a few misguided domain-name look-ups later, i ended up registering blog-la-sphere.com
(and variants). now i’m proud to launch blog-la-sphere. or relieved. or mildly amused. or maybe i just ate something bad for breakfast.
i don’t like “better feed”
i finally saw someone make mention of the plugin that adds the load of awful crap to the end of some people’s blog entries which clutters things up on sites like planet mysql. it’s called “better feed.”
clearly, “better” is in the eye of the beholder. i find it to be an eyesore.
i asked, and she said yes
so we started a new blog.
not quite formless
last night was the ambassador hotel wake, which mack of lavoice.org has already covered (although our paths did not cross). there was a large number of los angeles bloggers there: mike of franklin avenue, will and jillian of blogging.la, josh of curbed la (no, i don’t know which one, but i’m just “a reader”), carolyn of laist, kevin of la observed, joseph of martini republic, shannon, and my girlfriend celia.
there may have been more we met and i forgot, and almost certainly more that we just didn’t meet.
toast a dead building
next january 24, there will be a wake for the now-gone ambassador hotel at the hms bounty, across the street from the hotel (or where it was).
unfortunately, this is the same night as craig “craigslist” newmark’s appearance at the los angeles central library. maybe i’ll try to make both, but it seems more likely that i’ll ditch the library thing.
update: or not. the date for the wake will likely be shifting, as the event may combine with another that the los angeles conservancy was planning.
déjà vu
laist used on of my photos on a recent entry, which was kind of strange. i thought the picture looked very familiar, and then saw the shout-out at the bottom of the entry and it made sense. here’s the original.
and to be clear, there’s no problem at all that they used the picture like this. all my pictures on flickr are tagged with the creative commons attribution license, and that’s only because there’s no “public domain” option in the flickr licensing widget.
i bet you think this song should be about you
this “outside the tent” piece whining that the recent los angeles times blogging article didn’t feature political bloggers is colossally dumb. “wah, the times didn’t write about me and my friends. they suck!”
lack of a discernible soul
if you read any los angeles blogs, expect to see a lot of mentions of this los angeles times article about los angeles blogs. no, i’m not mentioned.
someone had to be last
okay, since everybody else has finally tossed a link their way, now i have to point out the new curbed la, a blog about los angeles real estate and neighborhoods. they’ve gotten off to a great start.
it finally dawned on me what the building was in their header — it’s city hall. you’d think i would have picked up on that sooner considering i can see city hall from my desk.
what color is my parachute?
evan williams (of blogger and odeo) says that most people read blogs via their web sites, and not an rss reader. (he doesn’t cite any evidence, but i believe it is probably true.) this was in response to information from bloglines about how many feeds “matter.”
i’m one of those — i like reading personal blogs via their web pages, not in an rss aggregator. it is why i built blo.gs. it is why i might have to build something to replace blo.gs, now that blo.gs has become nearly useless for tracking updated blogs.
why do i prefer to read blogs via their sites and not rss? i like to see jason kottke’s remaindered links, reviews, and longer posts in their overall flow. i like to see the random pictures and latest comments on blogdowntown. i like to see what picture of herself shannon has decided to feature now. i like to look at the comments left on recent posts i found interesting by jeremy zawodny.
and for all of the blogs i read, the design of their site gives me a little context, and a little reminder of who they are. (not in the “who the hell is that?” sense, but the “hey, shannon likes pink!” sense.)
five years and two days ago, i wrote my first blog entry. i started out using blogger. four days later, i made my first entry with my own blog software.
mostly the truth
jakob nielsen’s list of the top ten design mistakes in weblog usability are generally reasonable advice, but i think it is funny that cory at boing boing decided to call out the one i probably pay the least heed to: #3, nondescript posting titles.
it’s not that my titles are nondescript so much as non-sequitors, or inside jokes so inside that i’m the only one who could possibly understand them. some recent highlights:
- “mostly the truth” came about because i noticed i had several strategic uses of the weasel-word mostly in this entry. and now i’ve applied it to the whole thing.
- “not bursting into tears” is a reference to the fact that there were numerous times during the day where i was literally on the verge of bursting into tears. (mostly because of my back, which appears to be fine now. knock on wood.)
- “don’t get the wrong idea” is the antidote to the content, which is really deliberately crafted to feed various people’s suspicions that shannon and i are involved.
- “let’s all go to the lobby” is a pop culture reference, of course, to the classic movie theatre advertisement.
- “mmm, rabbits,” besides being a part of the quote i was calling attention to, is sort of an implied simpsons reference.
- “solid gold” is what the students in the film say the professor is made of, because he is so pure.
- “i say hello” is about as obscure as i can get — i originally wrote a bit about how my back pain had flared up to the oh-my-god-where’s-the-vicodin level, and this is a line from a song called “vicodin” by the trainables, a band you’ve never heard of (unless you’re marcus, who was in it).
- “tax that fellow behind the tree” is a fragment of a quote from russell long: “tax reform means “don’t tax you, don’t tax me, tax that fellow behind the tree.’”
- “the soul of this man is in his clothes” is a quote from william shakespeare.
- “more on me and friends” is actually a bit of a play on words — say it out loud, and it is “moron me and friends.”
- “she sets the summer sun on fire” is a line from “i want candy” by good charlotte.
- “looking minnesota, feeling california” is a reversal of a line from “outshined” by soundgarden.
- “flat denial” is one of those titles that i look back on and don’t understand. as i said there, it was in reference to what i started writing about, but i don’t remember what that was any more.
and then there are all the entries i don’t even title.
at the end of the day, i’m (mostly) writing for me, so i like to think that gives me license to flaunt jakob’s list.
weblogs.com was acquired by verisign. congrats to dave!
feedtree is a peer-to-peer system for distributing weblog updates, bearing at least a passing resemblance to what i described once upon a time.
i wonder how well it would handle having the livejournal firehose hooked up to it, and if it is of any use to a service like pubsub that would want to get everything.
she sets the summer sun on fire
the los angeles times interviews christine moore of the little flower candy company, who makes what sound like delicious caramels.
speaking of candy, candy blog is one of the coolest blogs i’ve found through lablogs.com. but after my bad reaction to the last candy bar i had, i’ve been staying away from chocolate bars. very sad.
new design
in the spirit of just getting it done, i kicked out the new design. as usual, it is best viewed from my desk. (second to that, using safari to get the ever-changing color effect. or possibly the latest firefox beta releases.)
i also “upgraded” the feeds to almost-valid atom 1.0, and got rid of the rss feeds. it is almost-valid atom 1.0 because apparently the atom 1.0 standard requires a non-empty titles. i wrote what i think about that three years ago.
atom 1.0 is even worse than other formats in this regard, if the complaints of the feed validator are to believed. not only is the <title> element required on each entry, but it can’t even be empty. nuts to that.
other things are undoubtedly broken. some day i may fix them.
i feel like i’ve already seen this discussion about elevator/metro etiquette.
kaj arnö is the latest mysql executive to get bitten by the blogging bug. he’s going to be taking up the mantle of vp/community relations soon. zack urlocker, our vp/marketing, was the first executive to jump on the bandwagon.
doc searls has sometimes explained how he blogs as just answering emails in public.
it has come up in some recent conversations with people i know that they keep up with what i’m up to by reading this site. (which, it frightens me to say, is probably pretty effective. there’s not much that goes on behind the curtains here.)
in conversation, i ask few questions and give short answers. i wonder if blogging has become a sort of conversational crutch for me. by blogging about something, i don’t have to talk about it.
or maybe by blogging about something, i figure out a good answer for the questions that nobody needs to ask because they’ve already read the answer.
cnet news.com reports on the results of the pew internet study that says for nearly half of the bloggers surveyed, blogging is a form of therapy.
that doesn’t surprise me. a lot of what i’ve been blogging recently has been motivated by that.
i felt like i’ve been posting a lot lately — here’s a graph of posts per month since i started (way back in october 2000).
for the 1792 days i’ve been blogging, i have averaged 1.38 posts per day. my longest entry was my notes from the vsda home entertainment conference, the shortest entry was the percentage chance i gave to moving out of my then-current apartment.
trying to read through my archives can be a bit of a chore. it’s one of the things i’m looking to fix with the redesign that i’m slowly working on. so far, it’s not really too radical of a change — but i’ve just been focused on removing things.
businessweek’s blogspotting noticed in a wall street journal article that technorati is making deals to get ping data first, or at least that’s the claim in the article. it would be interesting to know who those deals are with. i can’t imagine it’s with either blogger (google) or six apart.
of course, i’m not sure it helps for technorati to have the data first if nobody can ever actually get at it because of their persistent inability to handle their search load.
jeremy zawodny says that “trackback is dead” and jason kottke says “so long, technorati”, which are both observations i find fairly interesting.
but not so interesting that i have much of anything to add. i don’t get into the stats/links/whatever navel-gazing side of blogging very often. it’s been nearly a couple of years since i’ve even bothered to run stats on the logs for this site.
the only stat that really matters most to me are the number of cool people i’m connecting with, and if they’re just reading, it’s not a very meaningful connection.
three of a perfect pair
metafilter is a funny place. i find myself generally more interested in ask metafilter and metatalk than the main site. some of the questions/answers on ask can be fascinating, and metatalk is good for entertainment value.
coverage of the blo.gs purchase
i think the dialpad acquisition stole my thunder. (that’s not a bad thing.)
blo.gs has been acquired by yahoo!
the sale of blo.gs has been completed, and i'm proud to announce that yahoo! has acquired the service. as of right now, give or take a few minutes, yahoo! is running blo.gs.
this is the sort of good home that i was looking for — yahoo! obviously has the resources to run and improve blo.gs in pace with the incredible growth of blogs (and syndication in general), and in talking with them it was also clear that we had some of the same vision for the future of the service and the ping/notification infrastructure.
for users of the website and the cloud interface, nothing much is changing. the service will continue to be completely open, and both yahoo! and i hope you continue to use it and help it grow.
even though i’ll no longer be operating blo.gs, i'm not going to disappear from the community. i’m still very interested in blogging and syndication, and believe that blo.gs will continue to have a major impact as a key player in the evolving ping and blogging infrastructure.
some people have asked about the privacy policy during the transition. yahoo! is keeping the blo.gs privacy policy. the data collected on blo.gs will continue to be subject to that privacy policy and you will be given the opportunity to consent to future changes.
boing boing corrects error
in a posting about a comic strip by charles schultz featuring adults characters, mark frauenfelder wrote that peanuts never showed grown-ups, which i knew to be false thanks to the fantastic complete peanuts books.
i forgot to mention that the reason i ever got the books to know this is when mark blogged about the complete peanuts books last year.
in fact, i just ordered li’l beginnings after seeing that in the archives to dig up the other posting.
last night i went to an event at the central library put on by the greater los angeles chapter of the society of professional journalists, a panel discussion on “ethics and entertainment journalism.”
there were only about two dozen people in the audience, and five on stage, but it was a good panel discussion, and managed to go on for nearly two hours. there were five panelists: a news director from e! entertainment television (who admitted that sometimes she thought about getting out of the business because of the direction it is headed), the west coast editor for playboy (responsible for booking celebrity covers and centerfolds), a former reporter for us weekly, an associate los angeles bureau chief for people, and a reporter for the los angeles times.
something that struck me as i left the event was that on the whole, the ethical issues the panel talked about were being handled pretty responsibly (at least as they told it). while it seems like entertainment and especially celebrity journalism is where you would see the most compromises, i can’t think of instances where it has failed as utterly as the business press did in covering the dotcom bubble or the national press does in covering just about everything political (like the justification for the war in iraq, for example).
something that didn’t really come up at all is how online journalism fits into the picture, although the woman from people made a comment about how today’s writers for gawker and defamer may be the editors of tomorrow’s spy and then eventually an editor at time. i’d say that as things are going, tomorrow’s spy and time are going to be online. it remains to be seen whether those publications will “come down” from the printed world, or be born on the net.
(i originally spotted the event over on la observed. i’m such a sucker for excuses to go to the library.)
not journalists
boing boing passes on ross mayfield’s report that the number of books you can carry on to planes will be (and is) limited. this is the sort of case where a journalist would do something tedious like contact the tsa and actually find out what the reality is, rather than relying on second-hand information that originated from a screener.
i think it is pretty clear that the policy is in regard to books of matches, regardless of ross’s impression of what the screener was talking about.
this is the sort of thing that undermines the bloggers-as-journalists argument.
pinhead
there’s a very persistent trackback spammer who i have managed to complete shut down. unfortunately, my trap for this pinhead is currently set up to email the blocked trackbacks to my email. so every time he does one of his 150 trackback spam runs, i get 150 of these dumb emails.
i need to flesh out my comment/tracback spam blocking a little more so i can flag some of these to just get dropped entirely.
does anyone else find it odd that nobody who is participating y!q for publishers beta has disclosed that they’re getting paid to put it on their site?
(and why do the results pop up in-page on the page i’ve linked to, but not on anybody’s blogs?)
andy baio, of upcoming.org fame, writes about “wordpress website’s search engine spam”. it certainly adds a new flavor to this entry about how many sites link to wordpress.org.
this isn’t entirely without precedent. they’re not there now, but for a while there were small text-links at the bottom of the phorum website that were obviously capitalizing on their pagerank. i don’t know how the phorum developers may have disclosed that.
the intersection of open source and business is always pretty messy, and i believe even more so when you’re starting with a community project. (as opposed to something entirely or mostly developed by those trying to start a business around it.)
that’s another way to go
more fun with stats
the number of unique blogs added to or updated in the blo.gs database, per day:
date | count |
---|---|
2005-01-18 | 395240 |
2005-01-19 | 391968 |
2005-01-20 | 407524 |
2005-01-21 | 383478 |
2005-01-22 | 355494 |
2005-01-23 | 364662 |
2005-01-24 | 402927 |
2005-01-25 | 323980 |
2005-01-26 | 387421 |
2005-01-27 | 418325 |
2005-01-28 | 307999 |
2005-01-29 | 316423 |
2005-01-30 | 327232 |
2005-01-31 | 353976 |
(the data for this is a couple of months old because i stopped logging this information. the way i was logging it was having trouble with the volume of data, and i lacked the interest to do it right.)
blo.gs growth
some people have been interested in the growth of the data that blogs collects. here’s some numbers. this is from a snapshot of the database from sunday morning, so it isn’t to-the-minute.
of the 6,602,676 entries in the blo.gs database snapshot i used, rss/atom feeds were known for 2,512,959 of them. that doesn’t mean more didn’t have rss/atom feeds, just that blo.gs didn’t know about them.
of the blogs that were updated in the last 30 days of the snapshot data, 71% had known rss/atom feeds.
this is the number of blogs added to the database each month. i wasn’t tracking this before september 2003, so the earlier number is all the blogs prior to that.
month | total |
---|---|
earlier | 671455 |
2003-09 | 60205 |
2003-10 | 136789 |
2003-11 | 134431 |
2003-12 | 88575 |
2004-01 | 131432 |
2004-02 | 126280 |
2004-03 | 124106 |
2004-04 | 123790 |
2004-05 | 185993 |
2004-06 | 203727 |
2004-07 | 319432 |
2004-08 | 273679 |
2004-09 | 304170 |
2004-10 | 294377 |
2004-11 | 290778 |
2004-12 | 380422 |
2005-01 | 1472004 |
2005-02 | 789753 |
2005-03 | 491278 |
the big jump in january is from getting a feed that includes the livejournal data from pubsub.com.
here’s the count of blogs last updated in each given month:
month | total |
---|---|
2000-04 | 3 |
2000-09 | 2 |
2001-03 | 1 |
2001-05 | 2 |
2001-06 | 1 |
2001-07 | 3 |
2001-08 | 14 |
2001-09 | 3 |
2001-10 | 76 |
2001-11 | 56 |
2001-12 | 98 |
2002-01 | 2814 |
2002-02 | 4562 |
2002-03 | 3322 |
2002-04 | 2851 |
2002-05 | 2512 |
2002-06 | 2519 |
2002-07 | 2945 |
2002-08 | 3229 |
2002-09 | 3464 |
2002-10 | 3743 |
2002-11 | 3476 |
2002-12 | 3501 |
2003-01 | 5836 |
2003-02 | 7145 |
2003-03 | 9628 |
2003-04 | 14953 |
2003-05 | 68570 |
2003-06 | 71683 |
2003-07 | 84950 |
2003-08 | 93637 |
2003-09 | 105079 |
2003-10 | 116137 |
2003-11 | 121437 |
2003-12 | 88687 |
2004-01 | 103237 |
2004-02 | 108730 |
2004-03 | 109480 |
2004-04 | 106468 |
2004-05 | 142024 |
2004-06 | 157521 |
2004-07 | 262064 |
2004-08 | 254421 |
2004-09 | 264621 |
2004-10 | 271386 |
2004-11 | 278655 |
2004-12 | 339761 |
2005-01 | 694421 |
2005-02 | 1021597 |
2005-03 | 1661351 |
and based on the ip address, here’s the top hosts:
host | count |
---|---|
unknown | 1942533 |
blogspot.com | 1873034 |
livejournal.com | 1445296 |
spaces.msn.com | 307079 |
persianblog.com | 64654 |
journals.aol.com | 26333 |
blogdrive.com | 23335 |
blog.pt | 22388 |
blogdrive.com | 18817 |
blogware.com | 16109 |
20six.fr | 15900 |
buzznet.com | 15748 |
spammer | 15369 |
canalblog.com | 14950 |
blogdrive.com | 14695 |
typepad.com | 13706 |
blogdrive.com | 12686 |
blogfa.com | 11585 |
cocolog-nifty.com | 9902 |
needless to say, the entries for the spammer are gone now. blogdrive.com shows up multiple times because they use multiple ip addresses.
here is where i would insert all sorts of caveats about how these numbers are derived if i cared to hold people’s hands when dealing with numbers like this. these are free numbers, and you’re getting what you paid for them.
the wealthy blogger is new blog about money-management issues by my coworker mike hillyer, and non-coworker jeremy c. wright. a couple of canadians talking aboot money, eh.
on a similar note, i ran across liquid ledger today, which looks like nice money management software for mac os x. the current version doesn’t do any portfolio tracking, but it is promised for the next major version.
matt haughey noticed a bogus blogspot site in his spam. when i was on the suicide mission known as homepage.com (second-generation geocities), it was software pirates that caused us huge problems with their automated signups. i can imagine it’s only worse with the spammers and scam artists these days.
i’m surprised that blogger doesn’t seem to have done much to prevent this. you can see the automated crap via searches at blo.gs for things like “herbal” and “hilton”.
it would be helpful if services like blogspot published information on sites that are deleted as well as updated, so services like blo.gs, technorati, feedster, pubsub.com, etc could drop the sites from their databases, too.
more than 75 emails
i always like when someone throws out a number that is apparently designed to impress, but really seems rather pathetic in comparison to something else. like this washington post ombudsman piece about the response to an article about declining circulation. over 75 emails, letters, and phone calls! here’s a /. posting about apple getting a favorable ruling in its case against mac daily news that has nearly 700 comments.
geourl, a real-location-to-web-location service that used to be run by joshua schachter (also infamous for del.icio.us), has been taken over by ask, who is breathing renewed life into it.
24weblog, a weblog about the television show 24, is the closest weblog of those near me. but that weblog is actually from someone in england, so apparently the address they happened to choose to correspond to the show’s los angeles setting is just around the corner from me.
congrats to mark fletcher and the bloglines team on being acquired by ask jeeves.
once upon a time, a business development person at ask jeeves called me to talk about blo.gs. as i recall, the gist of the conversation from my side was that the site made no money, i spent no time on it, and there really wasn’t much to it.
this is another of those obvious-in-retrospect services that i can now kick myself for not building. having the vision, courage, and patience to execute on the idea is the hard part, of course.
at last year’s foo camp, when the limping feedmesh thing kicked off, someone suggested that we set up a yahoo group for discussion, and i made some comment about preferring something “more real.” a funny thing to say when one of the guys who built it (mark) was in the room. but in hindsight, i’m glad nobody wasted the time trying to create anything more real.
who’s next? i would think pubsub.com would be a likely acquisition for someone.
i also find this acquisition funny because there was a time when i almost ended up working at ask jeeves because i knew the ceo at the time. i got an early-morning call (or what was early-morning for me those days) where i agreed to fly up to interview, but i turned around a few hours later and cancelled, once i was awake and realized that i had no desire to work at a place using iis and asp, or relocate to the bay area.
typepad moderation and spf
i left a comment on a typepad-hosted blog, and got the bounce message resulting from the email that typepad sends to the blog owner with the comment. typepad forges the SMTP envelope sender of the message to be the email address used in the comment, and that failed the spf check on the owner’s mail system.
so if you are a typepad blog owner that uses a mail system protected by spf (or other spam measures that make decisions based on the SMTP envelope sender), you can get comments posted to your blog for which you will not see the notification.
(and to be clear, typepad is in the wrong here. it has no reason to forge my email address as the envelope sender. and remember, the SMTP envelope sender is not the same thing as the From header.)
rumor du jour
rumor has it that six apart (makers of movable type blogging software and typepad blogging service) are going to buy live journal (and by live journal, i think they mean danga interactive). it seems like it would be a good fit from what i know of the people involved in both companies and their development platforms. (they’re both perl shops.) and six apart would be getting some of the folks doing the most interesting low-budget, open-source web scalability work that i’ve seen.
rumors that anyone is about to buy blo.gs are completely untrue. unless they aren’t.
2005 prediction?
preshrunk is a new blog dedicated to cool t-shirts you can buy online. i bet we’ll see a whole lot of these blogs focused on little niches like this. (with the existing ones like pvrblog and the gizmodo and weblogs, inc. empires being the pioneers here.) here’s one i thought up over pancakes this morning: a blog dedicated to american animation. anime is well-covered, i think, and i don’t have any interest in anime.
oh, and many of these will be obvious-in-retrospect sorts of ideas. i pointed out the up-swing in online t-shirt retailers back in october.
pingback client
to kill time (see previous entry), i’ve implemented pingback 1.0 client code for when i post. if it works, this entry will ping the spec and ian hickson’s initial announcement of pingback. what are the odds it will work on the first try? answer: not good. it took two tries.
next time i need to kill a few moments, i’ll do the server side.