Because I used it successfully before, I decided to use scrAPI to scrape the entries from the old Aihato guestbook. After preprocessing the HTML a bit, I finally got beyond an endless debugging sessions (which cumulated in me discovering a whole collection of nested <html> tags, which forbad any type of sensible parsing of the page).
By Rowan Rodrik, 7 years ago, on June 21, 2010, at 17:06 |
Thursday evening: so, I had written myself a nice little script using Aaron Patterson's WWW::Mechanize and why's Hpricot to extract some data from a popular web-based airport directory.
By Rowan Rodrik, 10 years ago, on May 02, 2007, at 13:05 |