I need someone to help me put in place a simple data scraping system from public social media pages originating in Denmark. There are tons of methods and software to do this. I think this could be accomplished on Google Data Studio, but I'm open to other options.
The goal is to get an overview of all the public mentions of a specific topic on Danish social media, and to be able to keep track of them. Because of the topic and location restrictions, it should be possible to cover a bigger date range, maybe from 2000 onwards. The topic can be identified with keywords, which will be both in English and Danish.
The first task is the scraping of Pages from Facebook, YouTube, Twitter, Instagram, and LinkedIn originating in Denmark mentioning certain keywords. This is only about public pages and channels. The second task is to find a way to have an overview of the data, so that it is more accessible than just a spreadsheet crammed with text. I want to know how many times certain words have been mentioned, where, and how often each year. I also want to have a way of removing false positives and clean the dataset. Since I will need the data of next year as well, it shouldn't be a static dataset.
I have gained more than enough experience in web scraping in past two years. Github: [login to view URL]