I need a piece of software to go to one particular website, scrounge around different places for some data and consolidate it all into one simple report.
Here’s exactly what I need: there is this website
That is a list of every racehorse name that’s become available in the last year. It’s a constantly changing list. You’ll note that instead of them being all in one place, they’re broken up into 26 different links (starting with each letter).
What I need is something that will:
go into each letter and pull the entire list of names
the names are in four columns, so you have to deal with that
once you have the complete list from A-Z, you need to pick out the ones that are only one word
once you have the list of the ones that are one word, I want that list
also, I need you to run that list against a dictionary and pull out the one-word names that are also “real” words (since many horse names that look like one word are just nonsensical combinations of multiple words)
And that’s it.. what I want as output (in plain text is fine) are those two lists… one, all the one-word names… and two, a filtered list of actual words as validated against an English dictionary
This can be run in a browser, OSX or Windows… platform doesn’t matter.