This is a very simple utility which will automate the function which can be done easily by a human child. It is very easy, but is very repetitive, hence the need for automation.
Input xls file. Column A contains rows of web page URLs. Column B contains URLs of web links which may or may not be found within the web pages in Column A. Column C is blank, to be filled (by the utility) with the output RSS feeds.
The utility will scan the web page URL in Column A and check if the web page contains the link given in Column B. If it does, this is ignored and the words 'NOT FOUND' inserted into Column C. This MUST be a live link which is pointing to the specific URL in Column B (I will leave it to the coder to decide how this is done - whether to look for it in the web page itself, or in the source code, etc.).
However, if the link is found in the web page the RSS feed is obtained by going to the feedage html2rss page, entering the captcha, and copying the RSS feed URL into Column C. (I have an account with both Decaptcher and DeathByCaptcha.)
There will be a time delay of a random number of seconds (between 21 and 51) before starting on the next row in the xls file. Also there will be a secondary time delay of between 11 minutes and 14 minutes 24 seconds every completed 23 rows in an input XLS file.
It may be that Col. B contains no URL. In this case the checking stage is obviously bypassed, and the RSS feed is created in the normal way without any check beforehand, and the RSS URL pasted into Col. C. (So in cases where Col. B is blank, 'NOT FOUND' cannot be in Col. C).
Output will be pasted into column C of the input XLS file. No other output will be necessary.