<pedrocorreia.net ⁄>
 

<Web scraping with PHP and XPath ⁄ >




clicks: 492 492 2009-02-18 2009-02-18 goto programacao myNews programacao  Bookmark This Bookmark This


When I was writing about how I use web scraping, I was still hadn't tried using Xpath (shame on me). sssscripting blog responded to my article with very good and rich post about all sorts of different techniques for scraping (with Ruby examples) and after reading this post in Kore Nordmann blog I finally decided to try making something with Xpath.

It turned out, that using Xpath is extremely easy, really. When you master it, you can do everything in seconds. Yes, you need to know how XML works and how to write correct Xpath queries (brief explanation of Xpath syntax is available at W3Schools), but hey - these topics are in 1st year of university.

Also, there are good tools like XPath checker for Firefox which allows you to debug and test your queries without writing any code. Stupid to say, but XPath queries looks a lot like CSS selectors, but with much more power and flexibility. Without further talking, lets look at example (idea from Kore's article):



este é só um excerto do artigo, para aceder ao artigo completo, clique no link em baixo:
this is just a small excerpt from the article, to access the full article please click in the link below:

http://dev.juokaz.com/php/web-scraping-with-php-and-xpath




Subscribe News RSS  Subscribe News Updates by E-mail





myNews <myNews show="rand" cat="programacao" ⁄>

Android Intents - Tutorial

Intents allow the application to request and / or provide services . For example (...)

clicks: 42 42 2010-07-30 2010-07-30 goto url (new window) vogella.de/arti... goto myNews programacao


Android Development - Hierarchy Viewer

The Android SDK comes with a wide variety of tool that are intended to help deve (...)

clicks: 43 43 2010-07-29 2010-07-29 goto url (new window) barebonescoder.... goto myNews programacao


JavaScriptSerializer - Dictionary to JSON Serialization and Deserialization

Few weeks ago I was working with the JavaScriptSerializer for serializing objec (...)

clicks: 43 43 2010-07-29 2010-07-29 goto url (new window) weblogs.asp.net... goto myNews programacao


Memoizing higher-order functions

Memoization incrementally converts functions into data structures. It pays off w (...)

clicks: 36 36 2010-07-29 2010-07-29 goto url (new window) conal.net/blog/... goto myNews programacao


20+ Required Windows Apps: Web Designer's Choice

In April, Jeffrey Way gave us a list of the "20 Mac Apps You'll Use Every Day." (...)

clicks: 56 56 2010-07-28 2010-07-28 goto url (new window) net.tutsplus.co... goto myNews programacao


Introducing ASP.NET MVC 3 (Preview 1)

This morning we posted the "Preview 1" release of ASP.NET MVC 3. You can downlo (...)

clicks: 80 80 2010-07-28 2010-07-28 goto url (new window) weblogs.asp.net... goto myNews programacao


Uploading a File (Or Files) With ASP.NET MVC

I wanted to confirm something about how to upload a file or set of files with AS (...)

clicks: 76 76 2010-07-27 2010-07-27 goto url (new window) haacked.com/arc... goto myNews programacao


Android - Multithreading For Performance

A good practice in creating responsive applications is to make sure your main UI (...)

clicks: 45 45 2010-07-27 2010-07-27 goto url (new window) android-develop... goto myNews programacao