Log changes made due to website changes here! (want to get an idea of how often the newspapers change their systems...) 2007-07-23 Mirror: new url format broke scraper 2007-07-26 Guardian moving to a new backend 2007-10-?? Sun: totally new website format 2008-01-23 Independent: new website 2008-03-13 ft: started directing their blog feeds through feedburner.com 2008-04-25 times: they added a module to their output which broke our scraper 2008-04-25 scotsman: fixes to handle minor website changes 2008-05-30 dailymail: new website 2008-07-05 times: minor change on "our papers" page borked up the scraper 2008-07-15 dailymail: dailymail rss page changed format 2008-07-20 sun: minor tidy up of website (much more consistant now) 2008-07-23 telegraph: new-look website comes online (some sections still in old format) 2008-08-21 guardian: started moving blogs onto new their in-house CMS 2008-08-21 independent: new rss feeds, removed old ones 2008-09-16 guardian: moved the rest of their blogs over to new CMS 2008-10-15 guardian: changed date format on blogs 2008-10-15 dailymail: minor markup changes 2008-11-05 bbcnews: embedded video causing scraper headaches. 2008-11-11 guardian: minor pubdate fix (some articles have date in the "publication" bit)