Write a Python script that will parse PubChem to download all chemicals with given properties and run this script

There is a public website with all chemical compunds call PubChem:

[url removed, login to view]

We need to download information about all molecules with less than 11 atom.

It can be done in the following way:

1. Use advanced search available on the website:

[url removed, login to view]

and search for the following string:

((0:10[HeavyAtomCount]) AND 0:0[TotalFormalCharge]) AND 0:0[IsotopeAtomCount]

It will return the list of all compunds with less than 10 heavy atoms, but some of them are ionic compunds not molecules and some contain more than 10 atoms.

2. We need to sort the results by complexity

3. Then we need to check all the results and use two filters:

Filter A: remove compounds with more than 10 atoms in Molecular Formula

Filter B: remove compunds that contain a dot sign (".") in Canonical SMILES

4. All the components that are not removed by those filters should be collected in CSV text file that contains the following columns:

* PubChem CID

* Molecular Formula

* Canonical SMILES

* Molecular Weight

* Chemical Names

* IUPAC Name

* If 2D structure XML file is presented (yes/no)

* If 3D structure XML file is presented (yes/no)

5. For each compound that match our filters we should also download it 2D and 3D structures as XML files and place them in two folders. File names should be like "[url removed, login to view]" and "[url removed, login to view]" where 101826982 is PubChem CID of this compound

The results:

The results of this project should be

A. A ZIP archive with many xml files with 2D and 3D structures of the and one [url removed, login to view] file.

B. Python script(s) that generates this CSV file and download XML files

Deadline for this project: August 24th, 2017, 13:00 London time


For your information: PubChem supports API that makes this project much easier:

REST Tutorial:

[url removed, login to view]

REST Documentation:

[url removed, login to view]

Other API documentation:

[url removed, login to view]

List of properties:

[url removed, login to view]

Example how to download needed properties of several substances:

[url removed, login to view],129251212,5460638,5460696/property/MolecularFormula,MolecularWeight,CanonicalSMILES,Complexity,Charge,HeavyAtomCount,IsotopeAtomCount/XML

Python wrapper for PubChem:

[url removed, login to view]

Beceriler: Python

Daha fazlasını gör: pubchempy python, chemspider free download, chemspider api, pugrest pubchem, chemspider python, ncbi pug rest, pubchem python, pubchempy get_compounds, need write python script telit gc864quad module, write python web bot, python parse google, need help write python script operate telit module, python parse google result, python mail attachment download, write code parse web page

İşveren Hakkında:
( 13 değerlendirme ) Amersham, United Kingdom

Proje NO: #14952675

Bu iş için 24 freelancer ortalamada $196 teklif veriyor


Hi, sir! I have a good skill in python programming. If you award this project to me, I'll complete it in time. Thank you in advance. Stay tuned, I'm still working on this proposal.

in %bids___i_period_sub_35% gün içinde250%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(27 Değerlendirme)

First of all thank you for excellent description! I can create Python scraper and collect all data you want (including 2D and 3D files) in less than 3 days. Thanks. Roman Relevant Skills and Experience I Python deve Daha Fazla

in %bids___i_period_sub_35% gün içinde170%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(121 Değerlendirme)

Dear Sir ,I am interested in your job and I wish to work with you. I think you can check my profile and reviews and check me :) I have rich experience in these fields :Python, Software Architecture My account is ne Daha Fazla

in %bids___i_period_sub_35% gün içinde100%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(5 Değerlendirme)
in %bids___i_period_sub_35% gün içinde252%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(6 Değerlendirme)

We are experts in software development, worked in companies like Adobe, Dell etc. Java, PHP, Python, HTML, CSS, Javascript, Selenium with Python and Java, Web Development and Web Design, Web Scraping Relevant Skills a Daha Fazla

in %bids___i_period_sub_35% gün içinde155%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(8 Değerlendirme)

please let me know if you want to get started. Relevant Skills and Experience python Proposed Milestones $250 USD - code

in %bids___i_period_sub_35% gün içinde250%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(4 Değerlendirme)

Hi, I'm a professional software engineer with 4 years of experience in Python, Java, Scala. I can help you with the download of molecular data.

in %bids___i_period_sub_35% gün içinde110%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(5 Değerlendirme)

Search Pub chem for 10 atom compounds. Filter down the results based on the specified criteria. convert to csv. Relevant Skills and Experience Python Web Automation Web Services Chemistry Software Architecture Daha Fazla

in %bids___i_period_sub_35% gün içinde155%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(6 Değerlendirme)

I read your project brief. I can do your project by using PubChemPy wrapper of Python to search for chemicals on PubChem according to the criteria you specified and deliver a CSV file with molecular data. Relevant S Daha Fazla

in %bids___i_period_sub_35% gün içinde180%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(6 Değerlendirme)

Yes, I am new here, but we have been working on Python,Django,Web Crawling/Data Scraping for last 7 years. Relevant Skills and Experience We have used Flask and iFrame to achieve the desired results on Python 2 & 3. Daha Fazla

in %bids___i_period_sub_35% gün içinde977%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(4 Değerlendirme)

A proposal has not yet been provided

1 gün içinde %bids___i_sum_sub_32%%project_currencyDetails_sign_sub_33% USD
(3 Değerlendirme)

Hi, I can extract and parse all the data you need about the chemical compounds with the specified properties, and generate the CSV and XML files. Deliver them in a ZIP, and the Python script. Relevant Skills and Expe Daha Fazla

in %bids___i_period_sub_35% gün içinde150%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(2 Değerlendirme)

Hi, I have a web scraping history with python. I fully undestood your userstories and I also had a look API for it. I can provide you that you want.

in %bids___i_period_sub_35% gün içinde150%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(1 Yorum)

Hello, I have over 4 years of professional python experience. Let me help you with the implementation of your python tool. Relevant Skills and Experience Over 4 years of professional python programming experience. Daha Fazla

in %bids___i_period_sub_35% gün içinde88%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(1 Yorum)

Hello. We were carefully reviewing the requirements of the job description, so our developers can work on your project without delay. We have years of working on projects related on any available CMS, from "scratch" Daha Fazla

in %bids___i_period_sub_35% gün içinde257%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(0 Değerlendirme)

Hi, I hope you have not granted this project to someone else :) I have a script ready that does the followings: 1. get list of cids that match your search criteria 2. pull the required properties for all cids 3. store Daha Fazla

in %bids___i_period_sub_35% gün içinde165%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(0 Değerlendirme)

I am 3rd year student of Indian Institute of Technology (BHU) Varanasi. I have good knowledge of Python and especially web scrapping in python. GitHub profile [login to view URL] Relevant Skills and Expe Daha Fazla

in %bids___i_period_sub_35% gün içinde155%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(0 Değerlendirme)

I have been working with third party APIs to access needed information, such as Yahoo Financial API, Yandex maps API etc. Guess I will be able to perform your job good. If you are interested, I would like to connect an Daha Fazla

in %bids___i_period_sub_35% gün içinde194%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(0 Değerlendirme)

Parse compounds from PubChem website, filter and scrape the results to extract desired information, to be delivered in .zip and .csv files, with specific naming scheme. PubChem's APIs are available. Relevant Skills an Daha Fazla

in %bids___i_period_sub_35% gün içinde222%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(0 Değerlendirme)

Greetings.. Hi, I am representing the company named CS Infotech Pvt. [login to view URL] are a team of 40+ creative people who cater the market of web & Mobile app design& development along with Digital Marketing. Relevant Skills Daha Fazla

in %bids___i_period_sub_35% gün içinde155%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(0 Değerlendirme)