Stat 2000 Distance: Assignment 3 Tips (Distance/Online Sections)
Published: Sat, 01/26/13
My tips for Assignment 3 are coming below, but first a couple of announcements.
Please note that my first review seminar for
Stat 2000 will be on Feb. 24 (one week before the midterm exam). Unfortunately, that means the seminar is the weekend at the end of the week-long Midterm Break, but it is out of my hands. This seminar will cover the lessons in Volume 1 of my book.
Please note that I am now taking registrations for my midterm exam
prep seminars. Please click this link for more info and to register, if
you are interested:
Make sure you do: Tips on How to Do Well in Stat 2000
Did you read my Tips on what kind of calculator you should get?
Did you miss my Tips for Assignment 2?
Tips for Assignment 3 (Distance/Online Sections D01, D02, D03, etc.)
Don't have my book? You can download a free sample containing Lesson 3 at my website here:
Again, this assignment focuses on Lessons 1 and 2 in my study book. The difference is that now you are using t instead of z
because σ, the population standard deviation, is not given. You will also need to study the Matched Pairs section of Lesson 4 of my book. Study
up to the end of question 4 in that lesson (the rest of the lesson will
be covered in Assignment 4). If you are using an older edition of my
book, you should find Matched Pairs taught as the last two questions in
Lesson 2 of my book.
Question 1
is standard stuff for a student who has studied my lessons. Be sure to
use the Stat mode in your calculator to work out the mean and standard
deviation. See Appendix A in my book if you don't know how. By LCL and UCL, they mean for you to enter the Lower and Upper limits of the confidence interval, respectively.
Question 2 (bone formation):
Personally, I would
not bother stacking this data like they suggest (good luck even
successfully copying it and pasting it at all). I would merely type the
data in manually.
Open a "New Data
Table" in JMP. Double-click "Column 1" and name it something like
"OC". Make sure the "data type" is numeric and the "modeling type" is
continuous, and click OK. Now type the given data into the column on
the spreadsheet and make sure you don't make a mistake.
If you insist on copying and stacking the data, here's how:
Copy and paste the given data into a New Data Table in JMP. In the
toolbar at the top, select "Tables", then select "Stack". Highlight all
of the columns in the "Select Columns" box and click "Stack Columns"
and click OK. You will now see all of the data stacked into one column
(there will be another column showing all the column names which you can
ignore). Name the column something like "OC" and make sure its Data
Type is Numeric and its Modeling Type is Continuous. Click OK.
To get JMP to make confidence intervals for the mean:
Select "Analyze, Distribution" from the toolbar at
top. Highlight the column you are interested in ("OC" in this case) and
click the "Y, Columns" button. Click OK. You are now taken to a
window showing a histogram and stuff. To get a confidence interval,
click the red triangle next to your column variable directly above the
histogram to get a drop-down list and select "Confidence Interval". In
the pop-up window that appears, select "Other" (even if the level of
confidence you desire is in the list) and type in the level of
confidence you want (in decimal form, so 95% is 0.95). Make sure
"Two-sided" is selected. You are not given a value for sigma in
this question, so make sure the "Use known Sigma" checkbox is not
selected. Click OK. A Confidence Intervals table will appear in your
output screen at the bottom.
Of course, JMP will already have made a histogram for you
while you were getting the confidence interval, so I would use that
graph. If you want the stemplot
instead, click the red triangle and select "Stem and Leaf Plot". When
they ask you to comment on the suitability, remember my discussion in
Lesson 1 just before question 1 about the key sample size values of 15
and 40.
To get JMP to test hypotheses for the mean:
To test a hypothesis, click that
same red triangle you used to make a confidence interval and select
"Test Mean". Type in the value the null hypothesis believes the mean to
be (given in part (d)) and type in the known value of sigma, if you have one (otherwise
leave that value blank). Click OK. A Test Mean = Value table appears
in your output where, among other things, JMP gives you the test
statistic and three probability values. Those three probabilities are
the P-value for the three possible alternative hypotheses. JMP will use
a z statistic if you are given a sigma value to enter or a t statistic
if sigma is unknown.
Prob > |t| is the two-tailed P-value.
Prob > t is the upper-tailed P-value.
Prob < t is the lower-tailed P-value.
To get rid of any outputs you don't want to copy and paste, click the red triangle and deselect the unwanted things.
To copy and paste the parts of a JMP printout you do want,
click ALT or the thin blue line at the top to reveal the JMP toolbar. Select the icon on the JMP toolbar that looks like a fat white plus
sign "+" (the Selection tool). You can then click various parts of the
printout to select the sections you want. Copy and paste into Word or
something like that.
Questions 2 (e) to (g):
To make a column with the logarithms: Double-click on
the empty space next to the last column of data you have to make JMP
create a new column for you. Name it something like "log(OC)".
Double-click that new column heading to get the pop-up menu. Click the
"Column Properties" button and select Formula. Now click the Edit
Formula button. In the formula pop-up screen select "Transcendental" in
the Functions(grouped) menu and then select "Log" in the sub-menu. You
will see Log appear in the section below with a set of brackets around a
red box. Highlight the OC column in the "Table Columns" section of
this screen to make OC appear in that red box. Now click OK a few times
to get back to your data table and you should see your Log(OC) column
filled in with numbers. Each of those numbers is the natural log of the
original OC scores. Which is to say, it is identical to computing the
"ln" of each OC score by pressing the "ln" button on your calculator
(which is right next to the "log" button on your calculator). For
example, if your OC score was 49.9, then log(OC) would be ln(49.9) =
3.91002...
You can now make the confidence interval for "Log(OC)" in the
same way you made the confidence interval for "OC" except using the
"Log(OC)" column, of course.
Once you have found the confidence interval limits for your
Log(OC) scores, you can convert those back to OC limits by simply using
the ex button on your calculator. You get ex by
pressing "2nd F" "ln" or "SHIFT" "ln". For example, if your lower
limit for Log(OC) was 5, then you would press "2nd F" "ln" 5 to compute
e5 = 148.413... to get the corresponding lower limit for your OC score.
Do not think your answers in (g) have to match your answers in (a), but they will be kind of similar, or at least in the same ballpark. Of course, the answers in (e) will look totally different to the answers in (a) and (g).
Why are they doing this? This all
boils down to the reliability of our confidence intervals or hypothesis
tests for means. Remember, our methods are only reliable if the sample
mean is normally distributed. If n < 15, we can only trust our
methods if our population is normal. If n ≥ 15, we can generally trust
our methods even if the population is not normal. If the population is
strongly skewed or has outliers, we should use n ≥ 40. That is why they
are having you make graphs. To get an idea of the possible shape of
the population and therefore the reliability of your methods.
Statisticians sometimes transform the data (by doing logarithms or
something) in order to make a new population that is more normally
distributed than the original population, and so to be able to get more
reliable confidence intervals or hypothesis tests.
Which data do you think will make t more reliable
in your problem? The OC data or the log(OC) data? Which confidence
limits do you think are more reliable?
Question 3:
Make sure you are examining this data correctly!
Again, look at the Matched Pairs section in Lesson 4 of my book. Use your calculator
in Stat mode to work out the mean and standard deviation (see Appendix A in my book how to do this on your calculator).
This question is quite similar to my question 3 in Lesson 4 (or my question 20 in Lesson 2 if you are using an old book).