I am looking for a scraper that visits the websites [url removed, login to view] and [url removed, login to view] and downloads articles from the section Wirtschaft. The articles should be stored as plain text.
I welcome bids from people with experience and personal libraries in this field, who can also educate me about what is possible and what not and about the pros and cons of different techniques. For this project, other bids will not be considered.
The result would be a Java API with a couple of options like name of newspaper, date, section, or title keyword.