I’ve decided not to follow my dreams. Instead, I’m going to ask where they’re going and catch up with them later.
Mitch Hedberg
I’ve decided not to follow my dreams. Instead, I’m going to ask where they’re going and catch up with them later.
Mitch Hedberg
I’m in a really silly mood right now. Could be because I have tomorrow (well, today, now) and Friday off, due to some fortuitous timing and effort at work, but I prefer to think it’s something else.
This is rather disturbing: Lenni Jabour’s website (and e-mail address, and hosting site, and her agent) have disappeared from the face of the web. Cohorts that I’ve been able to contact haven’t heard much from her for several months; if anyone knows what’s up, please add a comment with a heads-up or contact me directly!
Update: Lenni’s webmaster/brother Joe just e-mailed to say it’s a temporary glitch while he moves hosts. Even better, he notes that she’s got a show in L.A. on the 13th at 9:00pm at The Gardenia Room, 7066 Santa Monica Blvd.
This word [SHOULD], or the adjective RECOMMENDED, mean that there may exist valid reasons in particular circumstances to ignore a particular item, but the full implications must be understood and carefully weighed before choosing a different course.
RFC 2119, section 3
The User-Agent request-header field contains information about the user agent originating the request. This is for statistical purposes, the tracing of protocol violations, and automated recognition of user agents for the sake of tailoring responses to avoid particular user agent limitations. User agents SHOULD include this field with requests.
RFC 2616, section 14.43, User-Agent
SHOULD means SHOULD. From now on, unconfigured web browsing libraries can take a hike: as of this evening I’m blocking bogus user agents, which will be sent to this page. Mark Pilgrim points out in a comment on his recent article on blocking spambots and spybots that there is no legal requirement for bots to identify themselves… most users won’t bother [changing their User-Agent], because they don’t understand how the web works or why it would matter. And not a lot of people block by User-Agent, so it really doesn’t matter all that much.
Perhaps there’s no legal requirement, but there is an explicit expectation of proper identification given by the RFCs quoted above. (RFCs are the de facto protocol standards, much as the W3C’s TRs are the de facto content standards.) Because it’s the default
is hardly a valid reason not to set a user-agent string.
This isn’t about getting rid of spambots or spybots, although that may be a corollary of the idea. It’s about following the rules–in societal terms, it’s about being polite. I know my piddly little site isn’t going to change the world; if someone can’t retrieve a page they want they can contact me or move on somewhere else and I won’t mind a bit. But if one person looks at the bogus-bot page and says hey, I should do that
then I’ll be a happy camper.
Without further ado, here’s the mod_rewrite magic. Share and enjoy (and suggest more!).
RewriteCond %{HTTP_USER_AGENT} ^$ [OR] RewriteCond %{HTTP_USER_AGENT} ^Java1 [OR] RewriteCond %{HTTP_USER_AGENT} ^Java/ [OR] RewriteCond %{HTTP_USER_AGENT} ^libwww-perl/ [OR] RewriteCond %{HTTP_USER_AGENT} ^Python-urllib/ RewriteRule .* /bot-redirect [R]
Update: Hmm. I should probably include RewriteCond %{HTTP_USER_AGENT} !.*/[0-9]
as well, perhaps in place of the Java1
rule that it supersets. Interesting to note that a couple of the blogbots (obidos-bot and the myelin ecosystem bot in my logs from the last week) would then show up bogus.
About 12 minutes later: Or not. Re-reading section 3.8, I see that product = token ["/" product-version]; product-version = token
, i.e. the /product-version
is optional and needs not start with a digit anyway. I’m going to claim to have been thrown because both bots misuse the Referer field too.
And another few minutes: Re-reading 14.43. The User-Agent field itself should be included, but additional tokens are only present by convention, listed in order of their significance for identifying the application.
So the letter of the RFC shoots down most of my rationale, minus the check for an empty string. Still, I think I’m still following the spirit of 2616:14.43, so I’m going to keep the rules and modify the verbiage of the redirection page slightly.
This is just too weird.
I’ve noticed hits on my site in the last couple of days from people searching for Paula Skimin, the improvising tapdance geek
who opened for Lenni Jabour in September. Being a good little glory-seeking weblogger, any time I see a search engine hit, I run the same query to see where I am… and, amazingly, I discovered that I’m the world’s number one authority
on Paula.
That’s not the weird part, although I do think it’s kinda cool.
The weird part starts here (or, more specifically, here), with a video of Paula, produced by Andrew Fedosov, on CBC’s ZeD site.
It’s not weird because of Paula; she, and the video, are quite interesting and not weird, in fact. It’s not weird because some of the footage is from her performance at the aforementioned concert, although that’s getting a little warmer. It’s not weird because I didn’t even notice (until now) that there was a video camera there. It’s not weird because I recognized David Peters and Drew Birston in the video either. It’s not weird because you can see Andrew and Sarah in one of the later shots… but that’s white-hot. Nope, the weird thing is that the very last few frames of the video zoom in on the audience, and right there (well, a little off to the left) is my happy little face.
I’m ready for my closeup, Mr. DeMille.
Update 1: OK, so apparently the video is quite dark on some monitors, to the point where they can’t even be adjusted high enough to see the last few frames. Guess I’ll be anonymous for a while longer.
Update 2: Another odd coincidence: my friend and officemate Mike actually watched Paula’s video when it first aired on CBC.
At least one good thing came out of my Apache problems: I’ve learned enough about mod_rewrite
to transparently poison my site to the address harvesters out there. I’m still going to leave the basic address-munging that Movable Type does enabled, but it should be even harder to scan these pages in the first place.
As usual, Mark Pilgrim has a more comprehensive description of how to block spambots et al. than I’ve been fumbling with. I’d say that great minds think alike, but, well, look around Dive Into Mark and look around here and make your own judgement.
A small advantage to not having After some simple changes, the Nice Titles on this site should now work in HTML or X(HT)ML. I did find a bug that I’ve been too preoccupied to fix in his original JavaScript, too: it can’t handle the case where there’s an embedded tag, like the Adrants link in this post.application/xhtml+xml
working is that Stuart Langridge’s Nice Titles will work on this page. That is, until I can figure out how to make them work in the X(HT)ML DOM, and then they’ll work everywhere in the weblog.
In other news, Eric Meyer has yet another brilliant line of CSS.
The three best baseball comedy routines ever: Abbott and Costello’s Who’s On First?, Bob Newhart’s Nobody Will Ever Play Baseball, and George Carlin’s Baseball and Football. Nothing else is remotely close.
New WaSP member Mark Pilgrim continues to (correctly, IMHO) take issue with NewsMonster, a new RSS aggregator. I’ve seen similar behaviour to this for several months–since November 27, in fact–from a user-agent purporting to be Mozilla/4.0 (compatible; MSIE 5.15; Mac_PowerPC)
. It seems unlikely that this is an incarnation of Burton’s software–if it is, why’s he looking at my site, of all things–which leads me to believe that there’s something else out there. (The other thing is that at least half of these requests are coming from doorstop.webtv.net
–i.e. WebTV Networks, i.e. Microsoft–which is kind of unlikely to be a Mac house.) The user-agent string isn’t listed in the über-user-agent list Mark cites, but I’ve seen other interactive requests from the agent so I’m presuming it’s just something spoofing the agent.
So, whatever you are–to continue what’s turning into today’s theme–Yer outta there!
It won’t be spring for another month, but major league baseball’s spring training season started earlier this week. With it comes a veritable flood of interesting, odd and disturbing stories.
What would Ted Williams have thought if he knew his body would be hanging upside down in a nitrogen-filled tank with perhaps four other full bodies and five heads at a cryogenics lab inside a strip mall in Scottsdale, Ariz.
Ed Janes, instant message, Feb. 19, 2003
My brother was talking about The Splendid Splinter, of course, and quoting ESPN’s recent story (based on an article by Bill Madden in the New York Daily News) on the horrifying
post-mortem treatment of the baseball great’s remains.
Later, he mentioned that Baltimore Orioles’ pitcher Steve Bechler collapsed from heatstroke on Sunday and died the next day; a Washington Post article quotes the team’s doctor blaming a dietary supplement
containing ephedra, which has been linked to several high-profile deaths involving athletes.
Two days later, fellow Oriole pitcher Jason Johnson was rushed off a practice field… after a diabetic episode.
This led Ed to share a trivia fact: Only one man has ever been killed by a thrown ball in MLB history, Ray Chapman by Carl Mays in 1920. I countered with a question triggered by memories of Dave Dravecky breaking his arm while throwing a pitch to Tim Raines of the Expos in 1989: how many players have broken limbs while pitching? We came up with four: Dravecky (who later had the arm amputated), John Smiley, Tom Browning, and Tony Saunders (who, incredibly, had it happen twice).
The day after my conversation with Ed, Rebecca Blood posted about Mamie Peanut Johnson, the first female pitcher in the Negro Leagues. Which led to another question: there was more than one woman in the league? My knowledge of the Negro Leagues in general is woefully inadequate, as is my knowledge of other alternative
leagues (like the AAGPBL, featured in A League Of Their Own).
And finally, something that I’ve meant to do for weeks is to pick up tickets for the new Canadian Baseball League team, the London Monarchs. On a related note, I’m sad to say I’ve never even been inside historic Labatt Park, even though it’s only a 15-minute walk away.
To coin a phrase, I love this game.