Crawling links and extract data

Tamamlanmış İlan edilme: Jul 6, 2008 Teslim sırasında ödenir
Tamamlanmış Teslim sırasında ödenir

I need a flat file php script (if possible) that search the top ... (max 10) pages in Google for a certain keyword and visit those links and collect all the text on these pages (like if you go visit the page and use select all). This way all text is collected, including link text.

Then it takes all the text from the top... pages and put this text from all the pages together into one file and list it accordingly by the number of times the words are used, like here below:

One word list:

jobs (346)

dubai (334)

job (241)

hotel (204)

united (167)

emirates (164)

arab (135)

bahrain (107)

uae (102)

hotels (80)

location (80)

description (79)

manager (78)

hospitality (73)

travel (67)

recruitment (67)

experience (67)

middle (61)

jul (57)

dhabi (56)

etc.

Two word phrases list:

arab emirates

united arab

job description

abu dhabi

dubai united

middle east

emirates job

location dubai

saudi arabia

our client

3rd jul

real estate

united states

dubai jobs

job vacancies

hospitality travel

job category

travel location

category hospitality

description our

years dubai

manama bahrain

location united

years job

bachelors degree

dubai uae

degree experience

emirates education

etc.

Those keywords that are displayed should be able to open in a new window without numbers (only the keywords) so you can copy them and use them for another program or paste them into a text file for example.

Some other options that should be available:

- Excluse words with less then 3...(fill in) letters.

- A list of words to ignore. Want to have a prefilled list by default with words like:

the

of

to

and

a

in

is

it

you

that

he

was

for

on

are

with

as

i

his

they

be

Mühendislik MySQL PHP Yazılım Mimarisi Yazılım Test Etme Web Hosting Web Sitesi Yönetimi Web Sitesi Testi

Proje NO: #3034323

Proje hakkında

4 teklif Uzak proje Aktif Jul 7, 2008

Seçilen:

jalsvw

See private message.

%selectedBids___i_period_sub_7% gün içinde 21.25%project_currencyDetails_sign_sub_9% %project_currencyDetails_code_sub_10%
(8 Değerlendirme)
3.0

Bu iş için 4 freelancer ortalamada $37 teklif veriyor

MuktoSoftware

See private message.

$42.5 USD in 7 gün içinde
(440 Değerlendirme)
7.5
webhorizonvw

See private message.

$42.5 USD in 7 gün içinde
(3 Değerlendirme)
2.3
GOCBD

See private message.

$42.5 USD in 7 gün içinde
(0 Değerlendirme)
0.0