Stat 1000: Tips for Assignment 1

Published: Wed, 09/17/14

Did you read my tips on how to study and learn Stat 1000?  If not, here is a link to those important suggestions:
Did you read my Calculator Tips?  If not, here is a link to those important suggestions:
Tips for Assignment 1
Study Lesson 1 in my study book (if you have it) to learn the concepts involved in Assignment 1.  Remember my advice in the tips above.  Don't start working on the assignment too soon.  Study and learn the lesson first, and use the assignment to test your knowledge.  Of course, always seek out assistance from my book, your course notes, etc. if you ever hit a question you don't understand, but try not to be learning things as you do an assignment.  Learn first, then put your learning to the test.

Exception: Always do any JMP stuff open-book.  Have my tips in front of you, and let me guide you step-by-step through any JMP stuff.  JMP is just "busy" work.  The sooner you get it done and can move on to productive things like understanding the concepts and interpreting the JMP outputs, the better off you will be.
Don't have my book?  You can download a free sample of my book and audio lectures containing Lesson 1:
A Warning about StatsPortal
Make sure that you are using Firefox for your browser.  Don't even use Internet Explorer.  It actually also has some glitches in the HTML editor boxes.

Do note that every time you exit a question in StatsPortal, the next time you return to it, the data may very well change.  Do not press the "back-up" button on your browser in a question.  That, too, will change the data.  When you are prepared to actually do a question, open the link, keep it open, and do not close it until you have submitted your answers.  Be sure to press "Save Answers" once you have done any calculations and entered any information to ensure the data does not change and force you to start over again.

After you submit the answer to a question, if you have been marked wrong on any parts, be sure that you write down the correct answers before you exit the screen (or grab a screen shot).  To try a second attempt at the question do not click the link to the question again, that will change the data and you will have to start all over again.  Also, DO NOT click "try again" or make a "second attempt."  That will also reset the data.

Instead, exit back to the home screen where they show the links for all the different questions on the assignment.  Where it shows the tries for a question on the right side of your screen, you should see the "1" grayed out, showing that you have had 1 attempt.  Click the number "2" to get your second attempt with the same data.  That way you can enter the answers you already know are correct and focus on correcting your mistakes.

You should also have already downloaded the JMP statistical software which was provided with either one of the course options for StatsPortal as mentioned in your course outline.

Make sure you have gone through Assignment 0 completely to learn how to use the interface.  I also suggest you print out a copy of question 8 in Assignment 0 (Long Answer Questions - Part 3) so that you have the steps for saving and uploading files into the HTML editor in front of you.
Question 1: Classifying Variables
This is a standard question about classifying variables, similar to my question 1 in Lesson 1.
Question 2: Frequency Distribution - Batting Averages
Remember, if you find the total of the second column (the frequency or count column) in a frequency table, that will tell you n, the sample size.

This deals with some aspects of quantitative distributions.  Note that part (a) wants a decimal, not a percent.  For example, if you figured out that 20 out of 30 are in the given interval, then 20 divide by 30 is 0.6667, not 66.6667%.  The proportion is 0.6667.  Make sure you round off correctly!  They want four decimal places, so if the fifth decimal place is 5 or more, round up.

Part (b):  Remember that a frequency table is a precursor to a histogram.  Visualize the histogram (don't actually make a histogram, just picture it in your mind) to help answer the questions. 

Part (c):  You cannot actually compute the median, mean or quartiles because you do not have the actual data.  You don't need to.  As I discussed in Lesson 1, the shape of the distribution is enough to know if the mean is larger, smaller or the same as the median. 

Part (d):  You do know the sample size, n, (the total count in the Frequency column), so you can use the steps I teach in Lesson 1 to find the location of any quartile .  Then just make a running total of the counts in the intervals.  How much data is in the first interval? (The count or frequency as given in the second column.)  Now add the count in the second interval (for example if there are 3 scores in the first interval, and 7 scores in the second interval, that means there are 3+7=10 scores in total in the first two intervals.  Those must be the 10 lowest scores in the data set.  Continue adding the frequencies in each interval until you reach or exceed the count you are looking for that marks the location of the first, second or third quartile as desired.
Question 3: Boxplots - Speeding Tickets
Make sure you compute the five-number summary by hand, as I demonstrate in Lesson 1, question 4.  Of course, as I also teach in that same question, you will use the 1.5 IQR rule to establish the outliers.

NOTE, in part (b), they want you to make the standard boxplot, then comment on the shape you see.  However, don't even think about whether there are outliers or not!  They want you to assume there were no outliers at all (they don't even ask you to think about outliers until the next part).  In other words, do the whiskers make the distribution appear skewed?

Then, in part (e), they want you to make the outlier boxplot which will almost certainly cause you to change your opinion about skewness.  They are trying to show you how it is important to identify outliers first before you comment on the shape of a distribution.
Question 4: Graphical Displays - Stock Prices
Do not use JMP for the stemplot in part (a)You can just type the stemplot directly into the text box they provide.  Note that you are told to trim the leaves.  That means that you cut away the last digit (don't round off, just cut it off as though it was never there in the first place).  For example, 21.47 would be trimmed to 21.4, not rounded off to 21.5.

I suggest you make the split stemplot on paper first, then transfer it to the box.  Make sure you click "HTML Editor" below the box before you type anything in.

Use the vertical line on your computer keyboard to separate the stem from the leaves ("SHIFT \" will give you " | ").  Don't worry if your columns don't end up perfectly lined up, just do the best you can.  Be sure to label the first line in your stemplot "Stem | Leaf", then enter all the stems and leaves row-by-row underneath.  It will be pretty difficult to make the data line up nicely when you type the stemplot into the HTML editor, but I don't think you should waste much time trying to make it look pretty.

Be sure to include a note explaining that the leaves are a decimal.

If you prefer, you could make the stemplot in Word by inserting a table with 2 columns to represent the Stem and Leaf columns, then copy the table you made and insert it into the HTML editor box, by selecting Edit on the HTML editor toolbar and clicking Paste from Word.  Personally, I wouldn't bother to do this.

To make the histograms in part (b):
First, enter the data into JMP manually: Click the "New Data Table" icon on the toolbar at top left in the JMP home screen (or select "File" in the toolbar, then New, then Data Table).  You are automatically taken to an empty spreadsheet with one column. Double-click "Column 1" and change its name to "Price", or right-click "Column 1" and select "Column Info" and type in the name "Price" and click OK. 

DO NOT MAKE A COLUMN CALLED Day.  You have no  need for that data, you need only enter the data for Price.  But, be sure that you do enter the price data in the exact same order it was given in order to get the correct time series later.

Make sure the column properties are correct!  When you are naming the column, check what it says for Data Type and Modeling Type.  The Data Type should be numeric and the Modeling Type should be continuous.  If it is not, click the drop-down lists to change them.

Now enter the data you have been given into the column.  Note you can use your arrow buttons or TAB button to move from one cell to the next as you enter your data.

Once you have entered all the data down your column, you are ready to make your histogram.  In the toolbar at the top, select Analyze then select Distribution.  In the "Select Columns" part of the pop-up window, select Price to highlight it, and click the Y, Columns button.  You should see the column name appear in the section to the right of the "Y, Columns" button.  Click OK.

It now opens yet another pop-up window called "Distributions" where your histogram should appear.  Your histogram is sideways. but they want to see it the typical way.

Click the red triangle next to Price above the histogram and select Display Options from the drop-down menu.  Select Horizontal Layout to turn the histrogram the way we want.
 

Now, move your cursor to the region just below the numbers on the horizontal axis of the histogram and double-click to get a pop-up menu called X Axis Specification at the top.  As they instruct, type 18.8 in the Minimum box and 22.2 in the Maximum box, 0.5 in the Increments box, and, finally, 0 in the # Minor Ticks box.  Click OK.

If you want to hide all the other parts of the output (but they said you don't have to), click that same red triangle again and deselect "Outlier Box Plot" and anything else that has a check mark next to it.  Click the red triangle again, select "Display Options" and deselect "Quantiles" and "Summary Statistics" to make those parts disappear.  Alternatively, you can make the Quantiles and Summary Statistics disappear if you simply click the gray triangles (to the left of the red triangles) next to their title bars.  Click the gray triangles again to make them reappear.

You are now ready to insert your histograms into the HTML editor box:
  • If you are using Windows:
  • Press "Alt" on your keyboard or click the thin blue line that is near the top of the window to get the toolbar icons to appear.  Select "File" then "Save As" to get a pop-up window.  Type in whatever name you want the file to have in the "File name" section. Click the "Browse Folders" arrow and select which folder you want to save the file in (I suggest you select "Desktop" so that the file will just appear right on your desktop home screen).  Finally, click the drop down arrow in the "Save as type" section and select "JPEG File".  Click "Save".  You should now have your file ready to upload into the assignment.
  • To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box.  Click the toolbar option called "Link" and select "Image."  In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it).  Click the "Browse" button and find the histogram file you just saved.  Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box.  Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window.  Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box. 

  • If you are using Apple/Mac:
  • You will need to take a screen shot of your output in order to upload it.  To take a screen shot hold down Command+Shift+4 and drag the cross-hairs over the image to capture it.  The image will save a .png file to your desktop by default.
  • To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box.  Click the toolbar option called "Link" and select "Image."  In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it).  Click the "Browse" button and find the histogram file you just saved.  Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box.  Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window.  Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box.

To make the Time Series in part (c):

DON'T FORGET TO COMMENT ON THE TREND YOU SEE IN THE TIME SERIES IN THE HTML EDITOR BOX AFTER YOU HAVE CONSTRUCTED AND UPLOADED IT!

Return to the window that has your spreadsheet with the Price data entered in.

You are now ready to make the time series.  Select Analyze in the toolbar, then select Modeling in the drop-down list and finally select time series.  Select the variable you are tracking, Price, and click "Y, Time Series".  Click OK.  Just ignore that other pop-up menu asking about time lags or autocorrelations or whatever, click OK and move on.  They suggest you enter 2 in for the autocorrelation lags, so go ahead.  None of that has anything to do with the time series.

You should now be looking at your Time Series with "Row" on the horizontal axis and Price on the vertical axis.  Click the red triangle next to "Time Series Price" and deselect "Autocorrelation" and "Partial Autocorrelation" to remove those parts of the output.  Click the red triangle again, select "Graph" then deselect "Mean Line".  That removes the horizontal line in your time series showing the mean points score.

  • If you are using Windows:
  • Press "Alt" on your keyboard or click the thin blue line that is near the top of the window to get the toolbar icons to appear.  Select "File" then "Save As" to get a pop-up window.  Type in whatever name you want the file to have in the "File name" section. Click the "Browse Folders" arrow and select which folder you want to save the file in (I suggest you select "Desktop" so that the file will just appear right on your desktop home screen).  Finally, click the drop down arrow in the "Save as type" section and select "JPEG File".  Click "Save".  You should now have your file ready to upload into the assignment.
  • To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box.  Click the toolbar option called "Link" and select "Image."  In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it).  Click the "Browse" button and find the time series file you just saved.  Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box.  Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window.  Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box. 

  • If you are using Apple/Mac:
  • You will need to take a screen shot of your output in order to upload it.  To take a screen shot hold down Command+Shift+4 and drag the cross-hairs over the image to capture it.  The image will save a .png file to your desktop by default.
  • To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box.  Click the toolbar option called "Link" and select "Image."  In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it).  Click the "Browse" button and find the time series file you just saved.  Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box.  Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window.  Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box.

DON'T FORGET TO COMMENT ON THE TREND YOU SEE IN THE TIME SERIES IN THE HTML EDITOR BOX!

Question 5: Comparing Distributions - CFL vs NFL
To make the side-by-side boxplots in part (a):

Open a "New Data Table" in JMP.

You will make two columns, but not the way you might think. DO NOT put CFL in one column and  NFL in another!

Double-click Column 1 (or right-click and select Column Info) and name it Points Scored.  Type all 20 scores from the CFL first then enter the 17 scores from the NFL data, giving you a total of 37 rows in the first column. Double-click the region to the right of Column 1 at the top to create Column 2 and name that column League.  Type CFL in the first 20 rows of that column (better yet, type it once, copy and paste it into the next 19 rows; that way you ensure it is typed exactly the same in all 20 rows as is necessary).  Then type (or copy and paste) NFL in the remaining rows of column 2.

To make the side-by-side boxplots:
Select "Analyze" then "Fit Y By X".  Highlight Points Scored and click "Y, Response".  Highlight League and click "X, Factor".  Click OK.  This should open a pop-up window with a bunch of dots arranged vertically in two columns on a graph for CFL and  NFL.  If  that does not happen, return to the data table and double-click each column (or right-click and select column info).  The Points Scored column better have Data Type as "numeric" and Modeling Type as "continuous."  Change those settings if not.  The League column
better have Data Type as "character" and Modeling Type as "nominal."  Change those settings if not.

Now click the red triangle next to "Oneway Analysis ..." and select "Quantiles."  Your side-by-side box plots should appear on the graph as well as a Quantiles output below that shows you the five-number summary among other things.  Click the red triangle again and select "Display Options" (down near the bottom of the menu), then deselect "Grand Mean" to get rid of the horizontal line in the graph showing the mean of all the scores. 

You are now ready to insert your boxplots into the HTML editor box:
  • If you are using Windows:
  • Press "Alt" on your keyboard or click the thin blue line that is near the top of the window to get the toolbar icons to appear.  Select "File" then "Save As" to get a pop-up window.  Type in whatever name you want the file to have in the "File name" section. Click the "Browse Folders" arrow and select which folder you want to save the file in (I suggest you select "Desktop" so that the file will just appear right on your desktop home screen).  Finally, click the drop down arrow in the "Save as type" section and select "JPEG File".  Click "Save".  You should now have your file ready to upload into the assignment.
  • To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box.  Click the toolbar option called "Link" and select "Image."  In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it).  Click the "Browse" button and find the boxplots file you just saved.  Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box.  Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window.  Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box. 

  • If you are using Apple/Mac:
  • You will need to take a screen shot of your output in order to upload it.  To take a screen shot hold down Command+Shift+4 and drag the cross-hairs over the image to capture it.  The image will save a .png file to your desktop by default.
  • To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box.  Click the toolbar option called "Link" and select "Image."  In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it).  Click the "Browse" button and find the boxplots file you just saved.  Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box.  Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window.  Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box.

Type your answers to part (b) and (c) directly into the box.  Don't forget to type HTML Editor before you type anything in the box.  As they say, it is easy to identify the outliers, the dots that extend beyond the whiskers of the side-by-side boxplots.

Question 6: Mean & Std. Dev. - Calling Card
This question should be done by hand (i.e. with your calculator, not with JMP).  Use the Stat Mode on your calculator to compute the Mean and Standard Deviation.  Don't you dare waste your time using the formulas to compute the mean and standard deviation.  That is what your Stat Mode on your calculator is for!

Check the Appendix at the back of my book to learn how to use the Stat Mode on your calculator.  Here is a link to a digital copy of that appendix:

Make sure you round the answers off to 2 decimal places before proceeding to answer the other parts of the question.

Part (c)
is introducing a key concept about changing the units in data.  Be sure to read the "Effect of Changing Units on Centre and Spread" section of my book in Lesson 1 and see questions 17 and 18 for examples.  As they say, once you know the mean and standard deviation from parts (a) and (b), you can convert them into the mean and standard deviation for part (c) using the conversion information they gave you at the start.  Here, you have to multiply the scores by a certain amount, then add on the incentive bonus to convert the amount spent to the loyalty points.

Rather than convert each piece of data, you can apply this recipe to the mean and standard deviation of the amount spent to get the mean and standard deviation of the loyalty points.  But do it properly! Remember, addition or subtraction of a constant has no effect on spread, only multiplication or division does.  While everything affects measures of centre.

For part (d), consider this:  Let's say you are taking a course, and your average mark so far is 65.  What will happen to your average if you score higher on the next test?  What if you score lower on the next test?  What would you have to get on the next test to keep your average 65?

For part (e), having decided what that new score must be in part (d), how much does that score deviate from the mean? If that is a larger deviation than the standard deviation you computed earlier, you have increased the overall standard deviation; if it is the same amount of deviation as earlier, you have not changed the standard deviation at all; if it has a smaller deviation, you have decreased your overall standard deviation. 

The closer a value is to the mean, the smaller its deviation from the mean.  Small deviations cause low standard deviations; large deviations cause high standard deviations.

Question 7: Weighted Mean - GPA
First, you need to know the scores attached to each letter grade.  An A+ is 4.5, A is 4, B+ is 3.5, B is 3, etc.

To compute your grade point average:
First, make a new column where you multiply each grade score by the number of credit hours.  For example, if you got a B+ in a 3 credit-hour course, you would multiply 3.5 by 3 to get 10.5 in this new column. Find the total of this new column and find the total number of credit hours.  Divide the total of the new column by the total number of credit hours to get the GPA.  Put another way, if you got a B+ in 3 credit-hour course, it is as though you scored 3.5 three separate times.  You could put your calculator in Stat Mode, and enter 3.5 in three separate times.  If you got an A in a 6 credit-hour course, you got 4.0 six times.  Enter 4.0 six separate times.  After you have entered all the data, your calculator will tell you the mean (your GPA).

An easy way to think of grade points is to consider the amount of credit hours as the frequency of that grade.  Gettting an A in a 3-credit hour course, is like getting an A 3 separate times.  Getting a C in a 6-credit hour course is like getting a C 6 separate times.  It is like finding the average of three A's and six C's.