Devam Ediyor

Extracting backlinks from URL-s from CommonCrawl

I need a code (in java, .net or php) that will choose random 10.000 URL-s from CommonCrawl dataset ([url removed, login to view]). For each URL you need to extract:

1) page title of that page (from <title> tag)

2) most frequent anchor text used to link to that URL - excluding one word anchor text and excluding URL anchor texts (anchor texts with http:// and www)

The results should be exported in excel or csv file. The file will have these columns:

URL, TITLE, ANCHOR TEXT.

For step 2) you will probably need to use external API like [url removed, login to view], [url removed, login to view] or similar. The 1 monthly cost of these api will be paid by me.

Beceriler: .NET, Hadoop, Java, Map Reduce, PHP

Daha fazlasını görün: backlinks from, extract excel word, text extracting, excel use api, moz api, tag java, extract word text file, external backlinks, backlinks org, ahrefs api php, excel csv java, ahrefs php, ahrefs api, backlinks monthly, url link csv file, java code excel, anchor text backlinks dofollow, extract text csv, php random link anchor text, 000 backlinks, commoncrawl, ahrefs, monthly backlinks, php similar texts, tag url

İşveren Hakkında:
( 15 değerlendirme ) Sv.Lovrec, Croatia

Proje NO: #6551034

Seçilen:

barundebnath

Hi, I am expert crawler maker. So this project wont be any problem for me. I will use [url removed, login to view] for api. And I will use .net for codding. Thanks

3 gün içinde 111$ USD
(78 Değerlendirme)
6.4

10 freelancer bu iş için ortalamada 162$ teklif veriyor

SigmaVisual

Dear Client, I can help in your project. We have already experience of working on similar projects. Please see below to get idea of my similar experience: Amazon/Ebay Bots: [url removed, login to view] Daha fazlası

in 3 gün içinde210$ USD
(270 Değerlendirme)
8.0
goraph

Hi Can be done. You need 10000 random pages from public dataset, and not care which pages? Also, how many anchor texts you need? One most popular or more? As I check, 5 most popular availiable for free on [url removed, login to view] Daha fazlası

in 3 gün içinde172$ USD
(50 Değerlendirme)
6.0
murtaza1981

Hi, Please feel free to discuss the project with me ........................................................................... Thanks, Murtaza

in 3 gün içinde250$ USD
(10 Değerlendirme)
4.5
usaravananbe2004

Hi, I am Saravanan. I have 7 years’ hands on experience on Web /Desktop Application Development, Automation/Scraping and Testing using Java Technologies. I went through your requirements, I am intrested to work on th Daha fazlası

in 3 gün içinde166$ USD
(7 Değerlendirme)
4.3
honghas

Hi! I am interested in your project. I am working in same projects (web spider) so I strongly believe that my abilities fit to your requirements. I look forward to working with you!

in 5 gün içinde155$ USD
(2 Değerlendirme)
3.2
ergouravsingal

A proposal has not yet been provided

in 3 gün içinde155$ USD
(0 Değerlendirme)
0.0
aztechdev

A proposal has not yet been provided

in 2 gün içinde150$ USD
(0 Değerlendirme)
0.0
danielbustos86

I've done similar developments, I would soon have results. I'm used to comply with dates I have long worked as a developer for companies.

in 3 gün içinde155$ USD
(0 Değerlendirme)
0.0
PeterJaxon

Good day! For your specific project I would be perfect. Have a look at my portfolio! I have extensive experience in Java programming (over 6 years) and have worked with databases multiple times - especially in the Daha fazlası

in 3 gün içinde100$ USD
(0 Değerlendirme)
0.0