The project consists of creating page with flask framework in order to do some visualization.
The idea is to let the user upload his dataset (in format like csv , tsv , etc..)
after that have some options to customize the upload (choosing the separator, header existing or not, strings are quoted or not, etc...)
One the dataset is loaded the user can choose the type for each column (numeric , string, etc..)
after that the user can create a interactive sankey diagram choosing the wanted columns (Please not that if the user choose a continuous variable, this one should be binned into 10 bins of 10% each. It's like creating percentiles)
The sankey diagram should be interactive. when the user hover over a node the full path that have data in the dataset should be highlighted
please see these two urls to get an idea :
- [login to view URL] (starting minute 1.55)
- [login to view URL]
Please see also the video attached to get an idea of the design of the page and also the type of virtualization to create.
Finally it should communicate with python scripts (pushing and pulling the data ) because I intend to call some Machine learning models that I've developed in python.
Please contact me for more details.
I've joined two toy datasets to use for small tests.
Thanks in advance for your reply and help