I gang

two data scraper for italian websites

I need two data scraper for the following sites:

www dot aziende dot it

login dot cercaziende dot it

The scraper needs to collect the following information

- category (eg plumbers etc)

- Business Name

- description (id="textDescriptor")

- All phone & fax numbers

- Address

- website address

- email address

A business may have more than one phone number and should be broken into the following fields.

- Ph

- Other

- Fax

- Mobile

- AH Contact

I also need the address broken into separate fields

- Street number and name

- Suburb

- State

- postcode

- Country

The script must be able to:

use a proxy server lists in round robin way, rotating them every 20 or 50 requests

use as input a file with the urls list

export the data to a csv.

A simple interface will allow me to start/stop the script and provide basic progress feedback.

automatically extract the data from the continuing pages i.e. 2, 3, 4 onwards to get the full data

I should be able to specify the max number of records to retrieve and the speed (delay) of retrieving

For the first web site the url that contains the links to the information

are like:

http://www dot aziende dot it/abbigliamento/[url removed, login to view]

http://www dot aziende dot it/casa-e-giardino/[url removed, login to view]

and so on.

For the second website the urls are like:

http://login dot cercaziende dot it/category/abbigliamento

http://login dot cercaziende dot it/category/auto-e-moto

and so on

for this site the info is all on the page, you do not have to follow other links beside the paging.

Færdigheder: C programmering, Databehandling, PHP

Se mere: web paging, state auto, script php proxy web, round name, proxy server simple, broken websites, two , street 3, postcode, php websites, ph, mobile number data, mobile data, italian to, into italian, extract phone number, export data, csv data, collect data two website, casa, business data, ah, file email extract, country name start, data contact number

Om arbejdsgiveren:
( 15 bedømmelser ) Syndey, Australia

Projekt-ID: #263685

Tildelt til:

victory07

Please see PMB

$120 USD in 3 dage
(52 bedømmelser)
6.4

15 freelancere byder i gennemsnit $157 for dette job

SigmaVisual

Please check PMB.

$225 USD in 4 dage
(258 bedømmelser)
8.0
creatorul

Professional work.

$250 USD på 1 dag
(144 bedømmelser)
7.5
NishantBamb

Hello, please refer your PMB. Thank you.

$200 USD in 7 dage
(152 bedømmelser)
7.4
MAnkita

Hello,Ready to [url removed, login to view] you.

$150 USD in 3 dage
(121 bedømmelser)
7.0
yousefla

Hello, Will be glad to help. Best Regards, Yousef

$195 USD in 3 dage
(42 bedømmelser)
6.3
fstudio

Dear sir, I am very interested in your project, Please see PMB for more details. Thanks. Best Regards.

$100 USD in 2 dage
(60 bedømmelser)
5.7
SouthIndian

Please refer PMB. Thanks.

$100 USD in 3 dage
(72 bedømmelser)
5.2
rz931

Please check PM. Thanks RC.

$100 USD in 2 dage
(13 bedømmelser)
4.9
andreiandrei

Hi,please check PM.

$200 USD in 2 dage
(7 bedømmelser)
4.9
Framp

I'm interested in your projects. Regards, Federico

$120 USD in 2 dage
(9 bedømmelser)
4.6
dibyendu01

Plz see pmb :))

$249 USD in 5 dage
(2 bedømmelser)
2.4
niksite

Can do that on python. I have experience with scraping USA and BEL yellowpages, so I do not expect any unexpectedness.

$100 USD in 4 dage
(1 bedømmelse)
1.0
anhr

I can do it

$100 USD in 3 dage
(0 bedømmelser)
0.0
rqlguevarra

I'm very much interested consider me as one of your personnel online. thanks

$150 USD in 5 dage
(0 bedømmelser)
0.0