split in a dataset - the outcome of a mechanize process

hello dear linux-friends

here the dataset; at the moment it is a dataset that is the outcome of a Perl-job done with Perl Mechanize
note - there is also a encoding iso-8859 issue - and therefore we need a treatment that take care for that issue.

Here the dataset with its specific design and charcakters - and structure;

Name ( of the institution) -

Street and housenumber
Postalcode and town: ( 4 digit-postalcode - obligatore; townname can consist out of several words)
Telefonnumber: (the telephone-number is following after a “:” - the telephone - number can be very long)
Fax Nummer: (the fax-number consists out of a large string…)
E-Mail: note - not every data-line has got the E-Mail-Adressdataset.

Here again the conclusion: Fax and E-Mail are not in every line. See below for more infos - or just
contact mme…

many many greetings

dilbert

**
here an example.
**


Pichlsbier beim Wels Pfarrhoferplatz 1 4632 Pichlbier bei Wels Telefonnummer: 0333337444247/6444457770 0655550076/85557765291 FAX-Nummer: 055567247/5556777-4 E-Mail: pichldasbierchen.wels@linzertorte.at
Pierbach Dorfstra�e 1 4282 Pierbach Telefonnummer: 07267/8205 0676/8776529
Pinsdorf Moargasse 2 4812 Pinsdorf Telefonnummer: 07612/63952 0676/87765293 E-Mail: pichelsteiner@linzertorte.at
Pischelsdorf Pischelsdorf 2 5233 Pischelsdorf am Engelbach Telefonnummer: 07742/7207 0676/87765294 E-Mail: pischelsdorf@linze