WPC300
Practice Test (Practical)
Instructions
PRACTICE Practical Exam Instructions
This exam has a total of two sections. You are required to use two sets of data to answer all the questions. Please see the individual instructions for each section. The total time allowed for this exam is 75 minutes. You must complete the exam in one seating. You will need JMP Pro and Excel software to analyze data and answer questions.
Note: You are expected to work individually to complete this exam before the due date. Getting help from outside resources other than what is made available to you via the canvas course site is considered a violation of the code of academic integrity for which you will be liable for the consequence.
Attempt History
Submitted Mar
Score for this attempt: 20 out of 20
Data background
The sample includes various demographic and blood test responses for 442 diabetes patients (respondents). The response variable Y is a quantitative measure of disease progression one year after baseline measurements were taken. The ten variables measured at baseline time are age, gender (1 = male, 2 = female), body mass index (BMI), average blood pressure (BP), and six blood serum measurements (Total Cholesterol, LDL, HDL, TCH, LTG, & Glucose). The response Y Binary is constructed from the response Y and defined as high if Y is above 200 or low otherwise.
Section A:
Instructions:
- Use the following data file for this section: SampleDiabetes.xlsx
- Remember the honor code.
- Use Excel to prepare your responses to the questions in this section
- Note that sometimes numbers have been rounded.
Create a new column using a vlookup() function to categorize the age variable into age categories as follows:
Age
|
Category
|
70+
|
1
|
60-69
|
2
|
50-59
|
3
|
40-49
|
4
|
30-39
|
5
|
19-29
|
6
|
Question 1
1 / 1 pts
Using a pivot table, determine which of the following statements is incorrect.
Question 2
1 / 1 pts
Using a pivot table, determine which of the following statements is incorrect about the average age of respondents in each age category.
Question 3
1 / 1 pts
Create a pivot table pie chart for people of age 40 or older using the same age categories as before, determine which of the following statements is correct.
Section B
Instructions
- Use the following JMP data file for this section [Diabetes.JMP]
- Remember the honor code.
- Use JMP Pro to prepare your responses to the questions in this section
- Note that sometimes numbers have been rounded.
Question 4
1 / 1 pts
Which of the following statements is not correct based on the sample data provided?
Question 5
1 / 1 pts
Looking at the distribution of BMI, you observe that the data centrality is measured as:
Question 6
1/ 1 pts
Looking at the distribution of Glucose, you observe that the distribution spread is measured as:Answered
Question 7
1 / 1 pts
It is generally believed that the average population age is 50. You claim that the population average age is less than 50. Perform a statistical test on the sample to see if the average age for the sample is consistent with your hypothesis (use a margin of error of 5%). What is the p-value from the test?
Question 8
1/ 1 pts
It is generally believed that the average population age is 50. You claim that the population average age is more than 50. Perform a statistical test on the sample to see if the average age for the sample is consistent with your hypothesis (use a margin of error of 5%). What can you conclude?
Question 9
1 / 1 pts
Perform a pairwise correlation analysis of the variables Y, age, BMI, BP, Total Cholesterol, LDL, HDL, TCH, LTG, & Glucose in the sample suggests that:
Question 10
1 / 1 pts
If we are interested in determining a possible cause and effect relationship where BMI and Age are causing disease progression (Y), _____ is the independent variable and ____ is the dependent variable?
Question 11
1 / 1 pts
Perform a simple linear regression to predict Y using respondents’ BMI. What is the correct equation for the regression line?
Question 12
1 / 1 pts
Perform a multiple regression analysis (with a margin of error of 5%) that examines all of the variables in the sample (excluding Y binary) as potential predictors of Y. Which of the following conclusions can be made based on the analysis without removing any of the predictor variables?
Question 13
1 / 1 pts
After performing model building by applying backward deletion to the model described in Q12, which of the following conclusions is valid based on the final model?
Question 14
1/ 1 pts
Based on the final model developed in Q13, which is the strongest predictor in the model?
Question 15
1 / 1 pts
Based on the final model developed in Q13, which is the weakest predictor in the model?ct!
Question 16
1 / 1 pts
How much of the variation in the dependent variable can be explained by the final regression model developed in Q13?
Question 17
1 / 1 pts
Is there a multicollinearity concern for the final model developed in Q13?
Question 18
1 / 1 pts
The Y Binary variable was developed to categorize respondents into high and low development of Diabetes over the year since their baseline measurements were taken. What proportion of high development respondents are female? Answered
Question 19
1 / 1 pts
In an initial logistic regression analysis attempting to establish if all of the variables (excluding Y) in the sample can predict (with a margin of error of 5%) the level (high/low) of the disease, it can be concluded that:u Answered
Question 20
1 / 1 pts
In the final logistic regression model to predict/classify Y binary, which of the following statements is true: