-
Categories
-
Tags/Keywords
nutrition zimbra VIM CLI HTTP Subversion MediaWiki bash WLB07051 Apache samba HTML SSH Screen Family svn RAID Javascript smb blog.bigsmoke.us Linux PHP T61 xen plugin van der Molen WWW MySQL mod_rewrite metabolism postfix ssl WordPress Windows X shell Ruby XTerm CSS DNS RuG Firefox Debian Gentoo Ubuntu -
Recent Posts
-
Recent Comments
Tag: scrAPI
RubyGems nuisances
Because I used it successfully before, I decided to use scrAPI to scrape the entries from the old Aihato guestbook. After preprocessing the HTML a bit, I finally got beyond an endless debugging sessions (which cumulated in me discovering a whole collection of nested <html> tags, which forbad any type of sensible parsing of the page). Read More »
Web scraping in Ruby: why I had to use scrAPI instead of WWW::Mechanize and Hpricot
Thursday evening: so, I had written myself a nice little script using Aaron Patterson's WWW::Mechanize and why's Hpricot to extract some data from a popular web-based airport directory. Read More »