Kapalı

Big data Project - using sentence embedding(word2vec, doc2vec) and gradient boost models such as catBoost

1. Collect and process pdf data dump from COVID-19 Open Research Dataset Challenge (CORD-19)

[login to view URL]

2. Analyze the data and provide publication statistics such as the number of publications according to time, location but not limited to. Provide (any type of) visualization for the results.

3. Learn sentence embedding from the articles' abstract and main content respectively.

4. Build a tool for question answering: given a user input sentence or query, outputs the top 10 most relevant sentences from the data and the source of the data, i.e., the sentence comes from which article. The tool could be command-line based or a simple Web-based interface.

Note that the dataset is large, so if you have difficulties processing all the articles provided in the dataset, you could work on part of it but no less than 5000 articles. And provide justification of why you choose the number of articles to work o

Beceriler: Python, Büyük Veri, Veri Bilimi, Neural Networks, Web Scraping

Daha fazlasını gör: export excel data project using macro, skype big data project, BIG data project, big data project freelancer, freelancer big data project, big data project bidding, big data project ideas for students, big data project examples, big data processing using hadoop, big data project management, example big data project plan, big data project manager job description, big data project manager roles and responsibilities, big data project manager resume, big data project challenges, big data project steps, big data project process, big data project titles, big data project report pdf, https www kaggle com uciml student alcohol consumption data

İşveren Hakkında:
( 1 değerlendirme ) sydney, Australia

Proje NO: #31906837

Bu iş için 7 freelancer ortalamada $169 teklif veriyor

(41 Değerlendirme)
6.8
(68 Değerlendirme)
7.0
nibeditad007

Hi, Hope you are doing well. I have over 6 years of rich experience in data science and machine learning. I have worked hands on in Python with different datasets for data wrangling, data manipulation, data analysis Daha Fazla

$250 AUD in 5 gün içinde
(14 Değerlendirme)
4.7
duongquocdat7411

Hello, I am a data scientist with strong background in machine learning and statistics and more than 3 years in building complex ML systems in NLP, computer vision, active learning, federared learning, few-shot learnin Daha Fazla

$200 AUD in 7 gün içinde
(11 Değerlendirme)
4.3
freelancerIrvan

Hello, There I am a talented python web scraper and automation specialist. I am familiar with data extracting using requests, scrapy, selenium and bs4, so I have rich experience in scraping of many plat Daha Fazla

$139 AUD in 7 gün içinde
(3 Değerlendirme)
3.9
edwardfree

Dear Client Thank you for your project. I've just checked your job description carefully. I'm senior developer with 9+ years of Python. By using Python, I developed AI engine, BOT, Web Scraping Tools, Web Searching Too Daha Fazla

$140 AUD in 7 gün içinde
(1 Yorum)
1.9
andreizabauski

Hi dear, I have read your job description carefully and I am very interested in your pdf scraping project. I will use Beautifulsoup to perform your large data scraping. Beautifulsoup is very good python scraping libra Daha Fazla

$140 AUD in 7 gün içinde
(0 Değerlendirme)
0.0