Implementing the procedures for creating and maintaining an index of scientific publications - 2


Institute of Neuroinformatics

University of Zurich & ETH Zurich

Project Objective:

Creating an ElasticSearch index of scientific publications (metadata and full-text) by aggregating content from various data sources (i.e. scientific publication databases). Keeping the ElasticSearch index updated as new content is added to the data sources. Implementation of a flexible workflow to integrate additional future data sources. Being able to handle changes to the APIs of data sources.

Duration: ~1 month

Technologies to use:

Node.js, Python, Elastic Stack, Docker. Open for other suggestions

Data sources (example list):

1) Crossref ([login to view URL]): contains the metadata of all publications having a digital object identifier (DOI). Content can be downloaded by querying the database through a REST API in a rate-limited fashion.

2) MEDLINE/Pubmed ([login to view URL]): contains metadata and abstracts of most publications related to life sciences. Also contains publications not having DOIs. Content can be bulk downloaded.

3) CORE ([login to view URL]): An aggregate database of most open access publications, including full text of some. If full-text is not available (e.g. papers from arXiv), a link is provided to the original source, which should be crawled to fetch the full text content. Contains data from large databases such as arXiv, CiteSeerX. Database can be bulk downloaded.

The workflow should be flexible to include additional data sources as they become available.

Project Parts/Tasks:

Different tasks should be handled by individual Docker microservices

1) Downloading and parsing the entire content of listed data sources and indexing this in individual ElasticSearch indices. The implementation for parsing the data sources needs to be template based, i.e. same functions can be used with a different template for a different data source.

2) Extracting content of PDF files (in an unstructured-format) if data source only provides PDFs (e.g. CORE)

3) Aggregating downloaded content from data sources in an “meta” ElasticSearch index

4) Keeping meta index updated as new publications appear

5) Maintaining the meta index: handling duplicates, handling different versions of a publication (e.g. arXiv preprints vs their final publication in a journal), adding new fields to the index, etc.

We do not ask for the delivery of a database, but the tools to populate it. The source code of your implementation needs to be delivered. Please do not submit code with potential license issues. Third party software/libraries can be used if they are FOSS.

We do not ask for a GUI.

ElasticSearch index fields (not exhaustive):

Title, journal, page, publication date, authors, affiliations, abstract, full-text, references, figures, data source, data source ID, DOI

Beceriler: Docker, Elasticsearch, node.js, Python

Daha fazlasını gör: lms implementation checklist, sample lms implementation project plan, quality improvement process steps pdf, quality improvement process steps, quality improvement pdf, quality improvement in healthcare examples, quality improvement models in healthcare

İşveren Hakkında:
( 0 değerlendirme ) Zurich, Switzerland

Proje NO: #18920857

Bu iş için 31 freelancer ortalamada €1336 teklif veriyor


Hello! I am a programming expert! I am interested in you project. I wanna work with you. If you hire me, I am gonna do my best for your project. I hope you have a dialog with me so that I can understand your requi Daha Fazla

€1250 EUR in 20 gün içinde
(4 Değerlendirme)

https://www.freelancer.com/projects/software-architecture/Elastic-search-kibana-cuckoo-API https://www.freelancer.com/projects/javascript/Webdevelopment-Project-for-Shadab/ done the similar tasks using Python, Elasti Daha Fazla

€2352 EUR in 20 gün içinde
(27 Değerlendirme)

Hi, Dear Employer! I am really interested in your project. I have enough experience in Python, C/C++, C#, java programming. I am 100% sure I can satisfy your requirements perfectly. User-Friendly Interface And Cle Daha Fazla

€1250 EUR in 20 gün içinde
(21 Değerlendirme)

Greetings! my name is Huapeng Li. I would love to have the opportunity to discuss your project with you Are you okay with creating the project using latest Node/Python-Django? I have 6 years of experience with No Daha Fazla

€1400 EUR in 20 gün içinde
(8 Değerlendirme)

Dear Sir. Your project is very interesting to me and I have a good experience in indexing. I can assure you that I will be able to submit your task on time with quality work. I want to discuss about your proje Daha Fazla

€1250 EUR in 20 gün içinde
(15 Değerlendirme)

Hi, Client. Nice to meet you. I'm very confident on your project because I have a lot of experiences on this IT field I'm a full stack developer who can handle all of mobile app/web application and backend. I'm ver Daha Fazla

€1250 EUR in 20 gün içinde
(4 Değerlendirme)

Hello sir I read your proposal and i am interested in your project I have rich experience in web development My main skill is php, javascript,nodejs, python, elasticsearch, docker I will do my best to satisfy you b Daha Fazla

€900 EUR in 15 gün içinde
(11 Değerlendirme)

Hello as a core developer i am having relevant skills and experience as you requested in your project description...i can share some demo as well in further chat. can we discuss more on this to get detail understandin Daha Fazla

€1500 EUR in 20 gün içinde
(3 Değerlendirme)

Hello? How are you? I have seen the project - "Implementing the procedures for creating and maintaining an index of scientific publications - 2." I have been working in these fields((Docker, Elasticsearch, node.js, Daha Fazla

€800 EUR in 3 gün içinde
(18 Değerlendirme)

Hi there, I have checked the details I have great experience with Docker, Elasticsearch, node.js, Python. Please start the chat so we can discuss this job more in detail. Thanks

€1250 EUR in 20 gün içinde
(3 Değerlendirme)

Hello! I am a python developer. I looked at your project and it seems interesting. I have all necessary skills required for this project. Ping me to discuss in detail.

€1125 EUR in 2 gün içinde
(23 Değerlendirme)

Everlytics is an Enterprise AI company headquartered in Singapore with an offshore development team in Bangalore, India. We as a team have an exhaustive technical skill set and have delivered a number of projects succe Daha Fazla

€1333 EUR in 20 gün içinde
(3 Değerlendirme)

Hi there, Your Job post has caught my attention and pleased to inform you that I can do this job of yours as I have excellent experience in mentioned technology. Thanks

€1250 EUR in 20 gün içinde
(8 Değerlendirme)

Dear sir, I am web and mobile developer with 12 year of experience. My profile : [login to view URL] I also are software team in Vietnam, we have 6 year in Software Industry with many running products in mobile Daha Fazla

€1250 EUR in 20 gün içinde
(11 Değerlendirme)

Hello Manager ,   Hope things are going well in your life!!   We are a team of 50+ individuals highly experienced in creating wonderful Web applications and apps for our clients to increase their business and to he Daha Fazla

€1110 EUR in 20 gün içinde
(4 Değerlendirme)

I can do this. I'm Uk based and English native. Fully qualified in all blockchain development technologies, web crawler technologies and nodejs, also an early crypto investor. We are the owners and developers o Daha Fazla

€1250 EUR in 20 gün içinde
(1 Yorum)

Hello! I am good in web development (Python/Django or PHP/Laravel) and I am very much familiar with the databases you have mentioned. I can deliver the product with full code transfer in 25 days. If you liked my bid, Daha Fazla

€750 EUR in 25 gün içinde
(9 Değerlendirme)

Hello, I am expert in Djnago,Python and JS like Angular, i am working in Djnago from last 5 years , please share some detail about the project. i am full stack Python/Django individual developer having mix experienc Daha Fazla

€1250 EUR in 20 gün içinde
(1 Yorum)

Hey, there, Please if possible give me the list of features and also reference that would be great for me. Please come over the chat for the further detailed discussion. Thanks

€1250 EUR in 20 gün içinde
(1 Yorum)

Would like to offer to work with arangodb with limited experience that I have with it, as it has arangosearch feature in competition with elasticsearch, with the gist of it that it supports graphs in case that you wou Daha Fazla

€1250 EUR in 20 gün içinde
(0 Değerlendirme)