I’ve posted the slides from my webcast on February 27. If you weren’t able to make it, I gave an introduction to what web scraping is, basic details of the HTTP protocol, available resources for developing web scraping applications, and best practices. I know there are plans to make the audio from the webcast and I will update this post with a link once it becomes available.
If the slides and audio aren’t enough for you, I will in all likelihood be giving an extended version of the presentation that includes both retrieval and analysis as part of the Unconference event at php|tek. Look forward to seeing you there!