
Kapalı
İlan edilme:
Teslimde ödenir
I need a robust web-scraping solution that pulls every public record from [login to view URL]—including the information hidden behind the “View results” tab—and exports it to a single, well-structured CSV file. The data has to refresh automatically each day so I always hold a current snapshot of the full registry. Minimum fields I will audit first are the Study Title, Study Results and Study Dates, but the scraper must capture every other table, note and metadata point the site exposes, without exceptions. Pagination, multi-language entries, and PDF attachments that sometimes appear inside the results section all need to be handled gracefully. Please code the solution so it can run headless on a Linux server; Python with requests, BeautifulSoup, Selenium or Scrapy is fine as long as it is reliable and well-documented. Deliverables are: • An executable script (plus [login to view URL]) that performs the full crawl and writes/overwrites a CSV. • A brief README explaining setup, scheduling for the daily run (cron is OK) and how errors are logged. • One initial full dataset generated by your script so I can validate completeness. I will consider the work complete when consecutive runs show identical record counts to the live site, and random spot-checks confirm every data point on the “View results” pages is present in the CSV.
Proje No: 40088005
9 teklifler
Uzaktan proje
Son aktiviteden bu yana geçen zaman 1 ay önce
Bütçenizi ve zaman çerçevenizi belirleyin
Çalışmanız için ödeme alın
Teklifinizin ana hatlarını belirleyin
Kaydolmak ve işlere teklif vermek ücretsizdir

Chennai, India
Ödeme yöntemi onaylandı
Nis 22, 2020 tarihinden bu yana üye
₹600-1500 INR
$10-30 USD
$10-30 AUD
₹12500-37500 INR
€12-18 EUR / saat
₹1500-12500 INR
$15-25 USD / saat
$10-30 USD
$750-1500 USD
₹750-1250 INR / saat
₹5000-12000 INR
$100-200 USD
₹12500-37500 INR
₹750-1250 INR / saat
$30-250 USD
$8-15 AUD / saat
₹5000-15000 INR
$30-250 USD
$30-250 USD
$10-30 USD