More Federal Agency Rule Reports -- Data Scraping

I need data on United States federal agency Major Rule Reports from 2013 scraped from the US Government Accountability Office Congressional Review Act website ([url removed, login to view]) and put into a spreadsheet. This would require a search of the Major Rules Reports in 2013 for 113 agencies and recording information from each report. There are about 80 reports total. Each report in the database will be one row (i.e. 80 rows). The columns with data for each entry should include data on Agency, sub-agency, subject line of the letter to committees, date of the letter, the committees to which the letter was addressed, the RIN as reported in the enclosure, and the statutory authorization for the rule as reported in the enclosure.

The scraping is somewhat involved. We will provide a list of agencies. A search on the GAO site above will have to be conducted for each agency on the list. The searches should be for the exact agency name on the list surrounded by quotation marks (e.g., “Department of Agriculture”). Otherwise the searches will pick up reports that are not associated with that agency. This means the scraper will have to go to the website above and click on the “Search Major Rules Reports” link or go directly to: [url removed, login to view]:2:{s:4:%22site%22;s:12:%22Publications%22;s:7:%22subsite%22;s:25:%22Federal+Agency+Major+Rule%22;}.

Each report will have to be clicked on and the scraper will have to go to the “View Decision” tab for each report. This will lead the scraper to a letter from the GAO to two or more committees in the U.S. Congress (in most cases only two). These letters will include all of the information I need for each report.

The agency name is self explanatory. The sub-agency, if there is one, will be listed after the comma after the agency name in the title or in the “Subject” line of the letter. The subject line is after the addresses in the letter. It begins with the word “Subject.” The date of the letter is at the top of the letter. The committees are listed as part of the address of the letter at the top. They begin with the word “Committee” (e.g., Committee on Agriculture, Nutrition, and Forestry). There are normally two committees listed. On rare occasions there may be a third or fourth. There should be one column for the first committee listed, one for the second, one for the third, etc. It probably makes sense to have the columns for committees at the end of the spreadsheet. The RIN number is usually included in a couple of places in each letter, often in the first paragraph of the letter and later as part of the letter called “ENCLOSURE”. It is normally in parentheses in the following format: (RIN: 0560-AH92). The statutory authorization for each rule is listed after a section of the enclosure called “Statutory authorization for the rule.” The language usually says something like “The final rule is authorized by the ADA Amendments Act of 2008, Pub. L. No. 110-325.” We do not need the “The final rule is authorized by the.” If possible, we only need “ADA Amendments Act of 2008, Pub. L. No. 110-325.”

Beceriler: Veri Girişi, Excel, Web Scraping

Daha fazlasını gör: federal database scraping, view more, third federal, quotation letter format, quotation format letter, lead agency, government entry, government Data, data government, data entry quotation, a format quotation, 4 letter word for in search of, click agency, data entry rules, www data entry gov, website data scraping, rin, reports in excel, government data entry, forestry, federal, excel reports, enclosure, e-pub, e pub

İşveren Hakkında:
( 9 değerlendirme ) Nashville, United States

Proje NO: #4407778



Can be done.

%selectedBids___i_period_sub_7% gün içinde 30%project_currencyDetails_sign_sub_9% %project_currencyDetails_code_sub_10%
(116 Değerlendirme)

Bu iş için 3 freelancer ortalamada $93 teklif veriyor


Let us discuss in private message.

in %bids___i_period_sub_35% gün içinde150%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(27 Değerlendirme)

Hello, I'm interested.

in %bids___i_period_sub_35% gün içinde100%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(0 Değerlendirme)

I can complete this task for you efficiently. As an economics student, I have experience in both research and data collection/entry. Hope to hear from you!

in %bids___i_period_sub_35% gün içinde99%project_currencyDetails_sign_sub_37% %project_currencyDetails_code_sub_38%
(0 Değerlendirme)