Exploratory Data Analysis (Part A)
Introduction
Your instructor will provide you
Exploratory Data Analysis (Part A)
Introduction
Your instructor will provide you with a data file that includes data on five variables:
SALES represents the number of sales made this week.
CALLS represents the number of sales calls made this week.
TIME represents the average time per call this week.
YEARS represents years of experience in the call center.
TYPE represents the type of training the employee received.
Part A: Exploratory Data Analysis
Preparation
Open the files for the course project and the data set.
For each of the five variables, process, organize, present and summarize the data. Analyze each variable by itself using graphical and numerical techniques of summarization. Use Excel as much as possible, explaining what the results reveal. Some of the following graphs may be helpful: stem-leaf diagram, frequency/relative frequency table, histogram, boxplot, dotplot, pie chart, bar graph. Caution: not all of these are appropriate for each of these variables, nor are they all necessary. More is not necessarily better. In addition be sure to find the appropriate measures of central tendency, the measures of dispersion, and the shapes of the distributions (for the quantitative variables) for the above data. Where appropriate, use the five number summary (the Min, Q1, Median, Q3, Max). Once again, use Excel as appropriate, and explain what the results mean.
Analyze the connections or relationships between the variables. There are ten possible pairings of two variables. Use graphical as well as numerical summary measures. Explain the results of the analysis. Be sure to consider all 10 pairings. Some variables show clear relationships, while others do not.
Report Requirements
From the variable analysis above, provide the analysis and interpretation for three individual variables. This would include no more than 1 graph for each, one or two measures of central tendency and variability (as appropriate), the shapes of the distributions for quantitative variables, and two or three sentences of interpretation.
For the 10 pairings, identify and report only on three of the pairings, again using graphical and numerical summary (as appropriate), with interpretations. Please note that at least one pairing must include a qualitative variable and at least one pairing must not include a qualitative variable.
Prepare the report in Microsoft Word, integrating graphs and tables with text explanations and interpretations. Be sure to include graphical and numerical back up for the explanations and interpretations. Be selective in what is included in the report to meet the requirements of the report without extraneous information.
All DeVry University policies are in effect, including the plagiarism policy.
Project Part A report is due by the end of Week 2.
Project Part A is worth 100 total points. See grading rubric below.
Submission: The report, including all relevant graphs and numerical analysis along with interpretations
Format for report:
Brief Introduction
Discuss 1st individual variable, using graphical, numerical summary and interpretation
Discuss 2nd individual variable, using graphical, numerical summary and interpretation
Discuss 3rd individual variable, using graphical, numerical summary and interpretation
Discuss 1st pairing of variables, using graphical, numerical summary and interpretation
Discuss 2nd pairing of variables, using graphical, numerical summary and interpretation
Discuss 3rd pairing of variables, using graphical, numerical summary and interpretation
Conclusion
Part A: Grading Rubric
Category Points % Descriiption
Three individual variables – 12 point each 36 points 36% Graphical analysis, numerical analysis (when appropriate), and interpretation
Three relationships – 15 points each 45 points 45% Graphical analysis, numerical analysis (when appropriate), and interpretation
Communication skills 19 points 19% Writing, grammar, clarity, logic, cohesiveness, adherence to the above format
Total 100 points 100% A quality paper will meet or exceed all the above requirements
Part B: Hypothesis Testing and Confidence Intervals
Complete the following four hypotheses, using α = 0.05 for each. The week 5 spreadsheet can be used in these analyses.
1. Mean sales per week exceed 42.5 per salesperson
2. Proportion receiving online training is less than 55%
3 Mean calls made among those with no training is at least 145
4. Mean time per call is 14.7 minutes
Using the same data set from part A, perform the hypothesis test for each speculation in order to see if there is evidence to support the manager’s belief. Use the Eight Steps of a Test of Hypothesis from Section 9.1 of your text book as a guide. You can use either the p-value or the critical values to draw conclusions. Be sure to explain your conclusion and interpret that to the claim in simple terms
Compute 99% confidence intervals for the variables used in each hypothesis test, and interpret these intervals.
Write a report about the results, distilling down the results in a way that would be understandable to someone who does not know statistics. Clear explanations and interpretations are critical.
All DeVry University policies are in effect, including the plagiarism policy.
Project Part B report is due by the end of Week 6.
Project Part B is worth 100 total points. See grading rubric below.
Format for report:
Summary Report (about one paragraph on each of the four speculations)
Appendix with the calculations of the Eight Elements of a Test of Hypothesis, the p-values, and the confidence intervals. Include the Excel formulas or spreadsheet screen shots used in the calculations.
Leave a Reply