Stat 1000: Tips for Assignment 4

Published: Fri, 03/16/12


I am still finalizing the date for the Final Exam Seminar for Stat 1000.  Please give me your input.  I am considering either Good Friday, April 6 or Saturday, April 7.
Please click this link for more information about the seminar if you are interested:
Grant's Stat 1000 Exam Prep Seminars 
 
If you ever want to look back over a previous tip I have sent, do note that all my tips can be found in my archive.  Click this link to go straight to my archive:
Grant's Updates Archive
 
Did you miss my Tips on How to Do Well in this Course? Click here
 
Did you miss my Tips for Assignment 3? Click here
 
If you are taking the course by Distance/Online (Sections D01, D02, etc.), click here for my tips for your Assignment 4.
 
If you are taking the course by classroom lecture (Sections A01, A02, etc.), click here for my tips for your Assignment 4.
 
Tips for Assignment 4 (Sections A01, A02, etc.)
 
You will need to study Lesson 5: Introduction to Probability, Lesson 6: The Binomial Distribution and Lesson 7: The Distribution of the Sample Mean in my study book to prepare for this assignment.  (If you have an older edition of my book, Lessons 6 and 7 may be in reverse order, and question 10 in my Binomial Lesson may be found in Lesson 10, question 1, of your older book (the Inference for Proportions lesson) .
 
Question 1:
You should realize that this is a Venn diagram question.  Study that section of Lesson 5.  My question 18 is very similar.
 
Question 2:
Yet another Venn diagram question.  But this one is more like my question 17.  It is not a three-circle Venn because you have not been told the probability of doing all three things.  It is three separate two-circle Venn diagrams.
 
Question 3:
Although you are not told this, we must assume that the result of each game is independent.  Read carefully. Part (a) does not ask for the sample space, just how many outcomes are in the sample space.  You can answer part (b) with a formula (or with a two-way table).  Part (c) is a two-way table problem.
 
Question 4:
Two-way table.  Very similar to my question 4 in Lesson 5.
 
Question 5:
If you are ever asked to decide if a particular situation is binomial or not, remember, to be binomial, four conditions must be satisfied:
(i)  There must be a fixed number of trials, n.
(ii)  Each trial must be independent.
(iii)  Each trial can have only two possible outcomes, success or failure, and the probability of success on each trial must have a constant value, p.
(iv) X, the number of successes, is a discrete random variable where
X = 0, 1, 2, ... n.
Question 6 and 7:
If you are solving a binomial problem, and they ask you to compute a mean and/or standard deviation, read carefully.  Do they want the mean of X? or do they want the mean of p-hat, the sample proportion?  Be sure to study the sections about the Distribution of X and the Distribution of p-hat in my Binomial Distribution lesson (Lesson 6 in my new edition, Lesson 7 in older editions).  Take a look, especially, at question 10 of that lesson as a good run through of these concepts.
 
Questions 8 and 9:
These questions deal with the concepts I teach in Lesson 7.  Especially make sure you have practised questions 3 to 7 before attempting these questions. 
 
Tips for Assignment 4 (Distance/Online Sections D01, D02, etc.)
 
Study Lesson 2: Regression and Correlation in my book, if you have it, to prepare for this assignment. (In older editions of my book this was Lesson 3.)
 
Question 1:
To compute the correlation coefficient by hand, follow my example in Lesson 2, question 1, part (c).  Note, you are not given the means and standard deviations for x and y already, so you are certainly allowed to use the Linear Regression Stat Mode on your calculator to tell you the means and standard deviations of both x and y.  Put your calculator in Linear Regression Stat Mode (see Appendix D of my book).  After you enter all the (x,y) data points, you can ask it for the mean and standard deviation of the x values and the mean and standard deviation of the y values.  For example, Sharps use "RCL 4" to get x-bar and "RCL 7" to get y-bar.  "RCL 5" gives you Sx and "RCL 8" gives you Sy.
 
Even though they tell you to do everything to three decimal places, don't do that.  Record every single decimal place your calculator gives you for each calculation, or else your answers won't be accurate enough.  I suggest you do everything on paper first, then you can type in the results, rounding all of your numbers off to 3 decimal places at that time (even though you actually did the calculations using all the decimal places).  Of course, your calculator actually tells you the value of r, so you can use that as a check.
 
Question 2 is just an algebra question.  They give you three of x, y, a, and b and want you to figure out the missing one.  Sub the givens into the appropriate places of
y = a + bx and solve what is missing.
 
Question 3 is a good run through of the formulas I show you in Lesson 3.
 
Question 4 uses JMP.
Here is how to use JMP for linear regression.  First copy and paste the data into a New Data Table the usual way (see my previous homework tips if you are not sure how to paste the data).  If you have to type the data in manually, simply double-click the space to the right of "Column 1" to create "Column 2".  Enter the X data down column 1 and the Y data down column 2.  Be sure to double-click each column to give it an appropriate name and to ensure the Data Type is Numeric and the Modeling Type is Continuous.
 
Select Analyze, then Fit Y By X.  Highlight the column you have determined should be X, and click the X, Factor button.  Highlight the column you have determined should be Y and click the Y, Response button.  Click OK.
 
You should now see a scatterplot.  Click the red triangle above the scatterplot and select Fit Line and JMP will draw in the least-squares regression line.  Note, it shows you the regression equation directly below the scatterplot.  JMP also shows you the value of r-squared (the coefficient of determination), rather than r, the correlation coefficient.  Remember, the coefficient of determination is the percentage of y's variation explained by the regression equation.  You can always square root this number to get r, the correlation coefficient, but use your scatterplot to help you decide if r is negative or positive because your calculator can't tell you that.
 
If you want to get rid of anything, click the red triangle and deselect anything you don't want to see.  Note, if you click the blue triangle next to something, that will make part of the output disappear as well, if you wish.  Just click the blue triangle again to make it reappear.
http://grantstutoring.com/
http://www.facebook.com/grantstutoring
https://twitter.com/grantstutoring