Paper, Order, or Assignment Requirements
this is the application
http://www.inside-r.org/download
Please copy and paste below in Microsoft word, and answer Q by doing print screen.
you can find adult table in the internet, just download anyone.
Hands – on- analysis:
Using the churn data set, develop EDA which shows that the remaining numeric variables
In the data set (apart from those covered in the text above) indicate no obvious association
with the target variable.
Use the Adult data set from the book series website for the following exercises. The
target variable is income, and the goal is to classify income based on the other variables. Which variables are categorical and which are continuous?
3, Using software, construct a table of the first 10 records of the data set, in order to get a feel for the data.
- Investigate whether we have any correlated variables.
- For each of the categorical variables, construct a bar chart of the variable, with an of the target variable. Normalize if necessary.
- Discuss the relationship, if any, each of these variables has with the target van’
- Which variables would you expect to make a significant appearance in any data classification model we work with?
- For each pair of categorical variables, construct a crosstabulation. Discuss your results.
- (If your software supports this.) Construct a web graph of the categorical variabi tune the graph so that interesting results emerge. Discuss your findings.
- Report on whether anomalous fields exist in this data set, based on your EDA, which these are, and what we should do about it.
- Report the mean, median, minimum, maximum, and standard deviation for each numerical variables.
- Construct a histogram of each numerical variables, with an overlay of the target income. Normalize if necessary. –
- Discuss the relationship, if any, each of these variables has with the target varial
- Which variables would you expect to make a significant appearance in any data mj classification model we work with? I
- For each pair of numerical variables, construct a scatter plot of the variables. Discussj salient results,
- Based on your EDA so far, identify interesting sub-groups of records within the dab that would be worth further investigation.
- Apply binning to one of the numerical variables. Do it in such a way as to maximize effect of the classes thus created (following the suggestions in the text). Now do it in a way as to minimize the effect of the classes, so that the difference between the class diminished. Comment.
- Refer to the previous exercise. Apply the other two binning methods (equal width, equal number of records) to this same variable. Compare the results and discuss differences. Which method do you prefer?
- Summarize your salient EDA findings from the above exercises, just as if you were wri a report.
Is this question part of your Assignment?
We can help
Our aim is to help you get A+ grades on your Coursework.
We handle assignments in a multiplicity of subject areas including Admission Essays, General Essays, Case Studies, Coursework, Dissertations, Editing, Research Papers, and Research proposals
Header Button Label: Get Started NowGet Started Header Button Label: View writing samplesView writing samples