Our site was created to allow investors and traders a central place to find pertinent information about financial market. We select twitter/facebook post about a specific Stock, Futures, or Forex symbol. We select all the messages from a public time line that have a $ before the ticker symbol.
We need some regular expressions patterns for matching strings of text, such as particular characters, words, or patterns of characters, to provide better service to our users. Consequently we need to filter the messages:
1 - with obscenity and bad words;
2 - have no clear meaning, according to specific semantic models (in English and in Italian). For example, to exclude messages that contain words without financial meaning.
Regular Expression Objects
We want a regular expression object so we can use it efficiently throughout our application.
Regarding the point "2" we need proceed by trial and error testing solutions inspired by real cases. So we need a long term collaboration.