Local boy makes good

It’s always interesting to hear about (or from!) people you know but have lost touch with.

The beauty of the synchrotron is the fact that it’s not a specialized tool anymore, said Ian Coulthard, a staff scientist at the facility who moved to Saskatchewan from Ontario.

Ian was one of the long-time members of my Reach for the Top team in high school, and along with Murray (now a priest), Jason (now a doctor) and several others was a powerhouse in the competitive trivia game. His specialties were science, as you might guess from the quote above, and… British royalty (!).

(For those of you who are only familiar with the current wannabe-hip RFTT, it used to be a lot better. I should know: I have the tapes.)

Anyway, it was good to hear Ian’s voice again tonight via the national airwaves. It’s been too long….

Greatness thrust upon them

Nominally via Tim Bray (it’s the next article down from the one he links to):

In your heart, you know we have to get Don Cherry elected as The Greatest Canadian…. It would annoy everyone worth annoying in this country.

I’ve never been a fan of Cherry’s, but Cosh points out some of his accomplishments that are often glossed over. Do they make him the greatest Canadian ever? Hardly… but he’s not as bad as he gets the rap for being, either.

Super-Size This!

Reported by my parents this afternoon (emphasis mine):

While I was waiting to pay at Wal-mart the clerks were discussing the movie Super-Size Me. They had just taken it off the shelves. Wal-mart (or McDonald’s) had ordered it off the shelves because of the relationship with McDonald’s.

E-mail

I received an interesting e-mail yesterday. It’s a good thing I check my spam folder from time to time.

Bad bots (what you gonna do?)

(Written a month ago, but not posted until now due to lack of editing. I figure if it’s public I’ll be shamed into updating it.)

The following list is gathered from observations of two somewhat-related events. The first event is that I inadvertently sent out Trackback pings from a local copy of my weblog, which set the blogbots all a-twitter; when I realized what I’d done, I began to block access to the URLs in question with HTTP status code 410 (Gone). The second event is an examination of my access logs, in which I discovered (and blocked, generally with code 403) spambots and crawlers that I felt were poorly-behaved.

Each criterion for badness is a violation of an element in Mark Pilgrim’s aggregator behaviour specification. I’ve interpreted SHOULD as MUST, as I’m unaware of any good reason for these agents to not follow the suggestion. This isn’t a complete list of user-agents or errors, it just lists the most common ones I’ve seen. Additions and updates are welcome.

User-AgentFlails?Ignores 403?Ignores 404?Ignores 410?Referrer spam?Unconditional retrieval?Notes
Googlebot/2.1yes?yes?noyes
Technoratibot/0.6no???yesno
Popdexter/1.0yes???yesyesNote: unthrottled spider, no robots.txt; adds / to URLs without extension
PubSub.com RSS readernoyesno?yesnono?403 and 410 processing fixed; if no more issues, will move to good bots list
Syndic8/1.0noyes?yesnonoignores 301
Feedster Crawler/1.0yesyesyesyesnonoignores 301
BlogPulse (ISSpider-3.0)yes
BlogLines/2.0ignores 301
Yahoo! Slurpignores 301
kinjabot beta2yes, every 6 hoursno
fastbuzz.comyes
Slower, Friendlier Spiders (BlogShares V1.36)ignores 301
FeedRover 1.0; Headlines Archiveyes
Twisted PageGetteryes
everyfeed-spider/1.0no??yesyes
IconSurf/2.0yes (by design!)yes??yes?/favicon.ico is stupid in the first place

Good bots

Giving credit where it’s due, here are the bots (and aggregators) that have been well-behaved so far:

  • NewsMonster
  • nntprss/0.4-beta-6
  • SharpReader/0.9.5.1