{"id":434,"date":"2005-02-21T02:18:41-05:00","date_gmt":"2005-02-21T07:18:41+00:00","guid":{"rendered":"http:\/\/peterjanes.ca\/blog\/archives\/2005\/02\/21\/google-off"},"modified":"2005-02-21T03:02:56-05:00","modified_gmt":"2005-02-21T08:02:56+00:00","slug":"google-off","status":"publish","type":"post","link":"https:\/\/peterjanes.ca\/blog\/2005\/02\/21\/google-off\/","title":{"rendered":"Google&nbsp;off"},"content":{"rendered":"<div class='e-content'><p class=\"lyrics\">Google on!  Google off!  Google on, Google off&#8230; the Googler!<\/p>\r\n\r\n<p>But seriously.<\/p>\r\n\r\n<p><cite class=\"name\">ckaminski<\/cite> of the <a href=\"http:\/\/www.webstandards.org\/\">Web Standards Project<\/a> <a href=\"http:\/\/webstandards.org\/buzz\/archive\/2005_02.html#a000492\">recently highlighted<\/a> the redesign of <a href=\"http:\/\/www.att.com\/\">AT&amp;T&#8217;s home page<\/a> and developer Joe D&#8217;Andrea&#8217;s <a href=\"http:\/\/www.joesapt.net\/archive\/2005\/02\/13\/00.18.35\/\">informative discussion of the project<\/a>.  One thing D&#8217;Andrea doesn&#8217;t mention, however, is the presence of the two comment-style directives <code>&lt;!--googleoff: index--&gt;<\/code> and <code>&lt;!--googleon: index--&gt;<\/code>.<\/p>\r\n\r\n<p><ins datetime=\"2005-02-21T02:41:00-05:00\">Question answered: <q>Those directives are not used by Google proper but rather our own slice-o-Google*, a mighty spiffy search appliance.<\/q>  Thanks Joe!  Wonder why this is only an option on the Google Appliance and isn&#8217;t turned on web-wide?<\/ins><\/p>\r\n\r\n<p>I&#8217;ve seen <code>googleoff\/on<\/code> on the aforementioned AT&amp;T site and on several <a href=\"http:\/\/www.cbc.ca\/story\/arts\/national\/2004\/12\/30\/Arts\/web-tsunami041230.html\">CBC news items<\/a> (which also use the argument <code>all<\/code>).  It&#8217;s also visible on <a href=\"http:\/\/www.google.com\/search?q=googleoff\">a number of disparate sites<\/a> because they have improperly-formatted comments; however, I believe this indicates that there are many more sites that use it properly.<\/p>\r\n\r\n<p>I wrote about the benefits of <a href=\"\/blog\/archives\/2004\/08\/24\/xmdp-style-robot-profile\">a content-level robot exclusion scheme<\/a> in August.  <code>googleoff\/on<\/code> is quite similar in concept, although it&#8217;s outside of the markup and thus requires a separate parser for XML-based content.  (On the tag-soup web, though, it&#8217;s just another special case.)  There&#8217;s also some comparison to be made to <a href=\"\/blog\/archives\/2005\/02\/21\/relnofollow-broken\"><code>rel=\"nofollow\"<\/code><\/a>; it&#8217;s interesting to note that <code>googleoff\/on: follow<\/code> could be used to accomplish exactly the same thing.<\/p>\r\n\r\n<p><del datetime=\"2005-02-21T02:50-05:00\">Whether it works or not is another question.  I&#8217;m inclined at the moment to say it doesn&#8217;t: <a href=\"http:\/\/www.google.com\/search?q=%22tsunami+survivors+and+anxious+relatives%22+science+business\">a search for the CBC article above<\/a> including a term that is only present in the <code>googleoff: index<\/code> or <code>googleoff: all<\/code> blocks still finds the page, while a <a href=\"http:\/\/www.google.com\/search?q=%22tsunami+survivors+and+anxious+relatives%22+foobar\">control request that includes a word not in the page at all<\/a> returns no hits.<\/del><\/p>\r\n\r\n<p><del datetime=\"2005-02-21T02:50-05:00\">Still, it&#8217;s worth an experiment, so<\/del> <ins datetime=\"2005-02-21T02:50-05:00\">In the hopes that this will someday make its way to the web at large,<\/ins> I&#8217;ve added the directives to the comment forms, navigation links, and other non-content sections of my weblog pages.  <del datetime=\"2005-02-21T02:50-05:00\">Currently a search for <a href=\"http:\/\/www.google.com\/search?q=petroglyphs+wordpress+site:peterjanes.ca\">occurrences of <cite class=\"term\">petroglyphs<\/cite> <cite class=\"term\">wordpress<\/cite> on this domain<\/a> returns only 3 results, and it&#8217;s my hope that that number won&#8217;t increase as Google respiders all the pages that now announce the presence of <a href=\"http:\/\/www.wordpress.org\/\">WordPress<\/a>.<\/del><\/p>\r\n\r\n<p>If by some chance the content exclusion <del datetime=\"2005-02-21T02:50-05:00\">works<\/del> <ins datetime=\"2005-02-21T02:50-05:00\">is ever turned on<\/ins>, it might be interesting to try several other tests.  The <code>index<\/code> argument is a keyword from the <a href=\"http:\/\/www.robotstxt.org\/wc\/meta-user.html\">robots meta tag<\/a>, so perhaps <code>follow<\/code> would also work.  (I&#8217;m presuming that <code>all<\/code> takes the place of <code>index,follow<\/code> and that the mere presence of <code>googleoff<\/code> implies the <code>no<\/code> prefix to those arguments.)  Also, because the comments are external to the markup, it should be possible to nest or otherwise intertwine them.<\/p><\/div><div class=\"syndication-links\"><\/div>","protected":false},"excerpt":{"rendered":"Google on! Google off! Google on, Google off&#8230; the Googler! But seriously. ckaminski of the Web Standards Project recently highlighted the redesign of AT&amp;T&#8217;s home page and developer Joe D&#8217;Andrea&#8217;s informative discussion of the project. One thing D&#8217;Andrea doesn&#8217;t mention, however, is the presence of the two comment-style directives &lt;!&#8211;googleoff: index&#8211;&gt; and &lt;!&#8211;googleon: index&#8211;&gt;. Question&hellip;","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"mf2_syndication":[],"venue_id":0},"categories":[3],"tags":[],"kind":false,"_links":{"self":[{"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/posts\/434"}],"collection":[{"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/comments?post=434"}],"version-history":[{"count":0,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/posts\/434\/revisions"}],"wp:attachment":[{"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/media?parent=434"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/categories?post=434"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/tags?post=434"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}