My workspace from this evening. Now all I have open is this browser window :)
Well after several long hours of debugging, cursing and several tall cups of coffee, my Perl script to import static pages from Google's cache, convert them to a WordPress WXR/RSS file and import here worked! This means all the blog posts from February 2009 to April 2009 that weren't part of my last good backup are now available again.
Had it not been for Google's cache, over 120 posts would have been lost.
I've started a cron job to make automatic backups every evening in both WXR/RSS and SQL formats and send them to a secret email address so this mess never happens again. I'll also be looking into checksumming/hashing said files before and after their transfer so I don't end up with the same invalid, messed up export files I had to deal with and piece back together this time around.
Aside from the changed webhost and URI, everything is now back to the way it was before. The only things missing are comments; if you left a comment on a post over the last two months feel free to write another if you could be bothered.
Next step is to finally implement my new theme I've been kicking around for a few months. As I said in the last post I've tried my best to set up correct redirects for search engines and people subscribed, as well as changing the URIs on all the posts here to speed things up. If you see something amiss, feel free to leave a comment.
UPDATE: If you see repeated posts in Google Reader, Bloglines or other blog readers, I’m sorry for bombarding you! I attempted to maintain the permalinks so this wouldn’t happen, but clearly Google Reader and Bloglines still thought they were new posts.
UPDATE: It seems a couple of posts are still missing because they were uploaded too recently to have been put into Google’s cache. Tomorrow morning I’ll go to trusty Google Reader and copy them back here. Google has been a real pal over these last few days!