parse with PHP-Simple-DOM-Parser on wikipedia

see the Wikipediapages:

the pages look like very very simple and very similar to each other

should i get it with the curl and parse it with PHP-Simple-DOM-Parser
or with Perl

Liste der Kirchengebäude in München – Wikipedia
Liste der Kirchengebäude in Leipzig – Wikipedia
Liste der Kirchengebäude im Erzgebirgskreis – Wikipedia
Liste der Kirchengebäude in Chemnitz – Wikipedia
Liste der historischen Kirchengebäude im Westerwald – Wikipedia

what would you do - how would you
do the parsing job

dilbertone wrote:
> see the Wikipediapages:
>
> the pages look like very very simple and very similar to each other
>
> should i get it with the curl and parse it with PHP-Simple-DOM-Parser
> or with Perl

> what would you do - how would you
> do the parsing job

Neither. Wikipedia is a wiki, so the original material is the wiki
source (use the ‘View source’ link at the top of each page, not your
browser’s view source option - that’s different).

And be aware of Wikipedia’s methods of data access, which discourage
‘scraping’ pages

http://en.wikipedia.org/wiki/Wikipedia:Database_download

hello dear** djh-novell**](http://forums.opensuse.org/members/djh-novell.html)

many thanks for the answer. Well i do have to investigate - and have a closer look at wiki to understand your answer.
Wiki is not that simple to parse - is this true !?

more answers and ideas later

					http://forums.opensuse.org/images/novell/user-offline.png