1) Author’s name and e-mail address
2) The title and purpose of your study.
Discuss why you are examining this topic. Clearly state the problems to be investigated and the objective of the study.
3) A list of variables and clearly state the sources (URL web addresses) for your data.
* Define the target variable, and use at least 4 predictor variables ( x1 , x2 , x3 , x4 ).
* If you are collecting your own data and if it is hard to collect a large number of observations, a sample size at least 30 to 100 or is acceptable.
If the dataset is from the data science website (e.g., like [login to view URL]), the number of observations may be much larger. If the data set is too large, you can select a subset of it (e.g., a thousand or several hundreds) for your study.
4) Analysis.
Training and validation data: Divide the data randomly into training (60%) and validation (40%) partitions. The training data is used in developing a model, and its usefulness is tested in the validation data. (Note: If the sample size is small, you can consider partition the data into 80% training and 20% validation so that there is at least 20+ observations in the training data to build model).
Model building:
Use the training data to develop classification model using Logistic Regression technique.
You can start with the full model with all the X’s. Based on the p-value, remove insignificant predictors. (You can use 20% or 30% as the significance level). Try a few models and select one that you think is the best overall (based on the classification confusion matrix and the various metrics like the false positive, false negative, overall percentage error; and the accuracy, sensitivity, specificity of the model). Comment on the model performance for both the training and validation data.
5) Summary and Conclusion
Write a summary, conclusion and insights about the data you study based on the analysis that you have done.
Hi,
Hope you are doing well.
I have over 6 years of rich experience in data science and machine learning.
I have worked hands on in Python, R, XLMiner, Knime with different datasets for data wrangling, data manipulation, data analysis etc. I have worked on several kinds of ML techniques like regression, classification and clustering.
I understand your problem completely. Since I have worked on such problems in the past, I am sure I would be able to deliver your work.
Looking forward to work with you
$80 USD 3 gün içinde
4,9 (9 değerlendirme)
4,3
4,3
2 freelancer bu proje için ortalama $53 USD teklif veriyor
Hey!
I am a professional data analyst. I have more than 7 years of working experience in the field of data analysis for business, economics, social sciences and science. I have expertise in SPSS, Stata, Excel and python. I can apply advanced statistical analysis like Regression, forecasting etc.
I have read the details of the project "Analytic Solver and XLMINER Data mining" and interested in it.
You can use below link to view my profile.
https://www.freelancer.com/u/sajidjasi
Please contact me to discuss more about this project.
Thanks