{"id":216,"date":"2003-03-24T21:56:37-05:00","date_gmt":"2003-03-25T05:56:37+00:00","guid":{"rendered":"http:\/\/peterjanes.ca\/wordpress\/?p=216"},"modified":"2003-03-24T21:56:37-05:00","modified_gmt":"2003-03-25T05:56:37+00:00","slug":"spyce-up-your-life","status":"publish","type":"post","link":"https:\/\/peterjanes.ca\/blog\/2003\/03\/24\/spyce-up-your-life\/","title":{"rendered":"Spyce up your&nbsp;life"},"content":{"rendered":"<div class='e-content'><p>Good lord, I just quoted the <a href=\"http:\/\/www.spicegirlsforever.co.uk\/\">Spice Girls<\/a>.  I need to sit down for a second.<br \/><br \/><br \/><\/p>\n\n<p>OK, better now.  Still, this will just be a jumble of thoughts more than a proper entry.<\/p>\n\n<p>I&#8217;ve been playing some more with <a href=\"http:\/\/spycyroll.sourceforge.net\/\">Spycyroll<\/a>, and I think I&#8217;m making headway on adapting it for my purposes.  Until today I&#8217;ve just been letting it continue to accumulate posts without removing any; as a result, my aggregate page was well over 400<abbr title=\"kilobytes\">K<\/abbr> and growing.  (The reason for this is related to the date issues I <a href=\"http:\/\/peterjanes.ca\/blog\/archives\/2003\/03\/21\/mmm_spycy.html\">referred to earlier<\/a>.)  Tonight I got (slightly) brighter and realized all I had to do is to put <cite>read<\/cite> files in one directory and <cite>unread<\/cite> files in another.  Through the magic of Python, that took all of about five lines of code, and my aggregate page is a much healthier 16K.<\/p>\n\n<p>Holding on to all of those deleted files is still an issue.  Because I can&#8217;t tell what items are no longer in a feed, it&#8217;s necessary to hold on to all of them.  I&#8217;m thinking a database will be necessary, probably of <cite>MD5<\/cite> checksums for each post, but I&#8217;m not comfortable enough with Python to start messing around with its database support yet.<\/p>\n\n<p>I&#8217;ve also realized that although <code>rssparser.py<\/code> does nice resource retrieval (using <code>If-Modified-Since<\/code>\/<code>Last-Modified<\/code>, <code>If-None-Match<\/code>\/<code>ETag<\/code>, and <code>Accept-Encoding: gzip<\/code>), Spycyroll doesn&#8217;t take advantage of it.  I&#8217;ll probably use the filesystem to store that information too in the interim.<\/p>\n\n<p>A few things to consider there, and there&#8217;s more, but I&#8217;m getting antsy to try some of this out&#8230;.<\/p>\n\n<p><ins datetime=\"2003-03-24T23:15:00-05:00\">It works, mostly.  Now that I&#8217;m taking advantage of smart retrieval, the site links in the <cite>blogroll<\/cite> part of the page aren&#8217;t being filled in, because they&#8217;re pulled from the channel.  Two steps forward&#8230;.<\/ins><\/p><\/div><div class=\"syndication-links\"><\/div>","protected":false},"excerpt":{"rendered":"Good lord, I just quoted the Spice Girls. I need to sit down for a second. OK, better now. Still, this will just be a jumble of thoughts more than a proper entry. I&#8217;ve been playing some more with Spycyroll, and I think I&#8217;m making headway on adapting it for my purposes. Until today I&#8217;ve&hellip;","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"mf2_syndication":[],"venue_id":0},"categories":[3],"tags":[],"kind":false,"_links":{"self":[{"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/posts\/216"}],"collection":[{"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/comments?post=216"}],"version-history":[{"count":0,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/posts\/216\/revisions"}],"wp:attachment":[{"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/media?parent=216"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/categories?post=216"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/peterjanes.ca\/blog\/wp-json\/wp\/v2\/tags?post=216"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}