Follow:
|
Did you read my tips on how to study and learn Stat 1000? If not, here is a link to those important suggestions: Did you read my Calculator Tips? If not, here is a
link to those important suggestions: Study Lesson 1 in my study book (if you have it) to learn the concepts involved in Assignment 1. Remember my advice in the tips above. Don't start working on the assignment too soon. Study and learn the lesson first, and use the assignment to test
your knowledge. Of course, always seek out assistance from my book, your course notes, etc. if you ever hit a question you don't understand, but try not to be learning things as you do an assignment. Learn first, then put your learning to the test. Exception: Always do any JMP stuff open-book. Have my tips in front of you, and let me guide you step-by-step through any JMP stuff. JMP is just "busy" work. The sooner you get it done and can move on to
productive things like understanding the concepts and interpreting the JMP outputs, the better off you will be.
Don't have my book or audio lectures? You can download a free sample of my book and audio lectures containing Lesson 1:
A Warning about StatsPortal
|
Make sure that you are using Firefox for your browser. Don't even use Internet Explorer. It actually also has some glitches in the HTML editor boxes.
Do note that every time you exit a question in StatsPortal, the next time you return to it, the data may very well
change. Do not press the "back-up" button on your browser in a question. That, too, will change the data. When you are prepared to actually do a question, open the link, keep it open, and do not close it until you have submitted your answers. Be sure to press "Save Answers" once you have done any calculations and entered any information to ensure the data does not change and force you to start over again.
After you submit the answer to a question, if you
have been marked wrong on any parts, be sure that you write down the correct answers before you exit the screen (or grab a screen shot). To try a second attempt at the question do not click the link to the question again, that will change the data and you will have to start all over again. Also, DO NOT click "try again" or make a "second attempt." That will also reset the data.
Instead, exit back to the home screen where they show the links for
all the different questions on the assignment. Where it shows the tries for a question on the right side of your screen, you should see the "1" grayed out, showing that you have had 1 attempt. Click the number "2" to get your second attempt with the same data. That way you can enter the answers you already know are correct and focus on correcting your mistakes.
You should also have already downloaded the JMP statistical software which was provided with
either one of the course options for StatsPortal as mentioned in your course outline.
Make sure you have gone through Assignment 0 completely to learn how to use the interface. I also suggest you print out a copy of question 8 in Assignment 0 (Long Answer Questions - Part 3) so that you have the steps for saving and uploading files into the HTML editor in front of you.
Question 1: Classifying Variables -- Horse Racing
|
This is a standard question about classifying variables, similar to my question 1 in Lesson 1.
Question 2: Frequency Distribution -- Storm Duration
|
Remember, if you find the total of the second column (the frequency or count column) in a frequency table, that will tell you n, the sample size.
This deals with some aspects of quantitative distributions.
Note
that part (a) wants a decimal, not a percent. For example, if you figured out that 20 out of 30 are in the given interval, then 20 divide by 30 is 0.6667, not 66.6667%. The proportion is 0.6667. Make sure you round off correctly! They want four decimal places, so if the fifth decimal place is 5 or more, round up.
A relative frequency or proportion is the relevant count divided
by n, the total sample size. Always leave your answer in decimal form, do not change it into a percent.
Part (b): Remember that a frequency table is a precursor to a histogram. Visualize the histogram (don't actually make a histogram, just picture it in your mind) to help answer the question about shape.
Part (c): You cannot actually compute the median, mean or quartiles because you do not have the actual
data. You don't need to. As I discussed in Lesson 1, the shape of the distribution is enough to know if the mean is larger, smaller or the same as the median.
Part (d): You do know the sample size, n, (the total count in the Frequency column), so you can use the steps I teach in Lesson 1 to find the location of any quartile . Then just make a running total of the counts in the intervals. How much data is in the first interval? (The
count or frequency as given in the second column.) Now add the count in the second interval (for example if there are 3 scores in the first interval, and 7 scores in the second interval, that means there are 3+7=10 scores in total in the first two intervals. Those must be the 10 lowest scores in the data set. Continue adding the frequencies in each interval until you reach or exceed the count you are looking for that marks the location of the first, second or third quartile as
desired.
Question 3: Graphical Displays -- Energy Consumption
|
Do not use JMP for the stemplot in part (a). You can just type the stemplot directly into the text box they provide. Note that you are told to trim the leaves. That means that
you cut away the last digit (don't round off, just cut it off as though it was never there in the first place). For example, 2147 would be trimmed to 214, not rounded off to 215.
I suggest you make the split stemplot on paper first, then transfer it to the box. Make sure you click "HTML Editor" below the box before you type anything in.
Use the vertical line
on your computer keyboard to separate the stem from the leaves ("SHIFT \" will give you " | "). Don't worry if your columns don't end up perfectly lined up, just do the best you can. Be sure to label the first line in your stemplot "Stem | Leaf", then enter all the stems and leaves row-by-row underneath. It will be pretty difficult to make the data line up nicely when you type the stemplot into the HTML editor, but I don't think you should waste much time
trying to make it look pretty.
If you prefer, you could make the stemplot in Word by inserting a table with 2 columns to represent the Stem and Leaf columns, then copy the table you made and insert it into the HTML editor box, by selecting Edit on the HTML editor toolbar and clicking Paste from Word. Personally, I wouldn't bother to do this.
Part (b),
although you should have a pretty good idea what the shape is after you have made the split stemplot, wait until you have made your histogram below and use it to confirm your opinion. Both graphs should tell you the same thing about the shape of the distribution (although one may do a better job of revealing it than the other).
To make the histogram in part (c): First, enter the data into JMP
manually: Click the "New Data Table" icon on the toolbar at top left in the JMP home screen (or select "File" in the toolbar, then New, then Data Table). You are automatically taken to an empty spreadsheet with one column. Double-click "Column 1" and change its name to Energy, or right-click "Column 1" and select "Column Info" and type in the name Energy and click OK.
Now enter the data you have been given into the
column. Note you can use your arrow buttons or TAB button to move from one cell to the next as you enter your data.
Once you have entered all the data down your column, you are ready to make your histogram. In the toolbar at the top, select Analyze then select Distribution. In the "Select Columns" part of the pop-up window, select Energy to highlight it, and click the Y, Columns
button. You should see the column name appear in the section to the right of the "Y, Columns" button. Click OK.
It now opens yet another pop-up window called "Distributions" where your histogram should appear. Your histogram is sideways.
If you would like to orient the histogram the traditional way (but they don't insist you do this), click the red triangle next to Energy above the histogram and select Display Options from the
drop-down menu. Select Horizontal Layout to turn the histrogram the way we want.
If you want to hide all the other parts of the output (but they said you don't have to), click that same red triangle again and deselect "Outlier Box Plot" and anything else that has a check mark next to it. Click the red triangle again, select "Display Options" and deselect "Quantiles" and "Summary Statistics" to make those parts
disappear. Alternatively, you can make the Quantiles and Summary Statistics disappear if you simply click the gray triangles (to the left of the red triangles) next to their title bars. Click the gray triangles again to make them reappear.
You are now ready to insert your histogram into the HTML editor box:
- If you are using Windows:
- Press "Alt" on your keyboard or click the thin blue line that
is near the top of the window to get the toolbar icons to appear. Select "File" then "Save As" to get a pop-up window. Type in whatever name you want the file to have in the "File name" section. Click the "Browse Folders" arrow and select which folder you want to save the file in (I suggest you select "Desktop" so that the file will just appear right on your desktop home screen). Finally, click the drop down arrow in the "Save as type"
section and select "JPEG File". Click "Save". You should now have your file ready to upload into the assignment.
- To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box. Click the toolbar option called "Link" and select "Image." In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom
of the pop-up window, you may have to enlarge the box or scroll down to see it). Click the "Browse" button and find the histogram file you just saved. Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box. Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window. Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear
in the text box.
- If you are using Apple/Mac:
- You will need to take a screen shot of your output in order to upload it. To take a screen shot hold down Command+Shift+4 and drag the cross-hairs over the image to capture it. The image will save a .png file to your desktop by default.
- To upload your file into the text box they provide: Click "HTML
editor" below the text box (if you have not already done so) to make a toolbar appear in the text box. Click the toolbar option called "Link" and select "Image." In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it). Click the "Browse" button and find the histogram file you just saved. Either double-click that file or select it and click "Open" and
you should see the path to that file appear in the Browse box. Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window. Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box.
Part (d). As I say in Lesson 1, we should only use mean and standard deviation if a distribution is
symmetric.
To make the Time Series in part (e):
DON'T FORGET TO COMMENT ON THE TREND YOU SEE IN THE TIME SERIES IN THE HTML EDITOR BOX AFTER YOU HAVE CONSTRUCTED AND UPLOADED IT! In other words, don't forget to answer part
(f).
First, enter the data into JMP manually: Click the "New Data Table" icon on the toolbar at top left in the JMP home screen (or select "File" in the toolbar, then New, then Data Table). You are automatically taken to an empty spreadsheet with one column. Double-click "Column 1" and change its name to Monthly Energy, or right-click "Column 1" and select "Column
Info" and type in the name Monthly Energy and click OK.
Enter all of the energy scores down the Monthly Energy column (as given in the second row of the data table they provide). Make sure that you are entering the data in the order you have been given.
DO NOT EVEN TRY TO ENTER THE DATES (like Jan 2011) in your data table!
That entire row of information is irrelevant to what we will do, and is an absolute pain to deal with.
If you have done things correctly, you will have a data table with just one column, Monthly Energy, and that column has all the given energy scores in order from Jan 2011 to Dec 2012 even though there has been absolutely no mention of those dates in your data table.
You are now ready to
make the time series. Select Analyze in the toolbar, then select Modeling in the drop-down list and finally select time series. Select the variable you are tracking, Monthly Energy, and click "Y, Time Series". Click OK. Just ignore that other pop-up menu asking about time lags or autocorrelations or whatever, click OK and move on. None of that has anything to do
with the time series.
You should now be looking at your Time Series with "Row" on the horizontal axis and Monthly Energy on the vertical axis. Click the red triangle next to "Time Series Price" and deselect "Autocorrelation" and "Partial Autocorrelation" to remove those parts of the output. Click the red triangle again, select "Graph" then deselect "Mean Line". That removes the horizontal
line in your time series showing the mean points score.
- If you are using Windows:
- Press "Alt" on your keyboard or click the thin blue line that is near the top of the window to get the toolbar icons to appear. Select "File" then "Save As" to get a pop-up window. Type in whatever name you want the file to have in the "File name" section. Click the "Browse Folders" arrow and
select which folder you want to save the file in (I suggest you select "Desktop" so that the file will just appear right on your desktop home screen). Finally, click the drop down arrow in the "Save as type" section and select "JPEG File". Click "Save". You should now have your file ready to upload into the assignment.
- To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done
so) to make a toolbar appear in the text box. Click the toolbar option called "Link" and select "Image." In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it). Click the "Browse" button and find the time series file you just saved. Either double-click that file or select it and click "Open" and you should see the path to that file appear in the
Browse box. Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window. Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box.
- If you are using Apple/Mac:
- You will need to take a screen shot of your output in order to upload it. To take a screen shot hold down Command+Shift+4 and drag the
cross-hairs over the image to capture it. The image will save a .png file to your desktop by default.
- To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box. Click the toolbar option called "Link" and select "Image." In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the
pop-up window, you may have to enlarge the box or scroll down to see it). Click the "Browse" button and find the time series file you just saved. Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box. Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window. Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in
the text box.
DON'T FORGET TO COMMENT ON THE TREND YOU SEE IN THE TIME SERIES IN THE HTML EDITOR BOX!
Question 4: Graphical Displays -- Stock Returns
|
To make the side-by-side boxplots in part (a):Open a "New Data Table" in JMP.
You will make two columns, but not the way you might think. DO NOT put Consumer Stocks in one column and Technology Stocks in
another!
Double-click Column 1 (or right-click and select Column Info) and name it Return. Type all 13 scores from the Consumer Stocks first then enter the 17 scores from the Technology Stocks data, giving you a total of 30 rows in the first column. Double-click the region to the right of Column 1 at the top to create Column 2 and name that column
Sector. Type Consumer in the first 13 rows of that column (better yet, type it once, then copy and paste it into the next 12 rows; that way you ensure it is typed exactly the same in all 13 rows as is necessary). Then type (or copy and paste) Technology in the remaining rows of column 2.
If you have done this correctly, you should now have two columns. The first column shows the
Return of all 30 stocks you were given. The second column shows the Sector each of those 30 stocks came from (Consumer or Technology).
Make sure the column properties are correct! Right-click at the top of each column and select Column Info to check what it says for Data Type and Modeling Type.
For Return, the Data Type should be Numeric and the Modeling Type should be continuous. If it is not, click the drop-down lists to change them.
For Sector, the Data Type should be Character and the Modeling
Type should be nominal. If it is not, click the drop-down lists to change them.
To make the side-by-side boxplots: Select "Analyze" then "Fit Y By X". Highlight Return and click "Y, Response". Highlight Sector and click "X, Factor". Click OK. This should open a pop-up window with a bunch of dots arranged vertically in two columns on a graph for Consumer
and Technology.
If that does not happen, you do not have the correct Data Type and Modeling Type for your data! Follow my instructions above to fix your column properties.
Now click the red triangle next to Oneway Analysis ... and select Quantiles. Your side-by-side box plots should appear on the graph as well as a Quantiles output below
that shows you the five-number summary among other things. Click the red triangle again and select "Display Options" (down near the bottom of the menu), then deselect Grand Mean to get rid of the horizontal line in the graph showing the mean of all the scores.
You are now ready to insert your boxplots into the HTML editor box:
- If you are using Windows:
- Press
"Alt" on your keyboard or click the thin blue line that is near the top of the window to get the toolbar icons to appear. Select "File" then "Save As" to get a pop-up window. Type in whatever name you want the file to have in the "File name" section. Click the "Browse Folders" arrow and select which folder you want to save the file in (I suggest you select "Desktop" so that the file will just appear right on your desktop home screen).
Finally, click the drop down arrow in the "Save as type" section and select "JPEG File". Click "Save". You should now have your file ready to upload into the assignment.
- To upload your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box. Click the toolbar option called "Link" and select "Image." In the pop-up window that appears, click
the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it). Click the "Browse" button and find the boxplots file you just saved. Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box. Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window. Select the file in the list of "Uploaded Files" to
highlight it and click OK and you should see the file appear in the text box.
- If you are using Apple/Mac:
- You will need to take a screen shot of your output in order to upload it. To take a screen shot hold down Command+Shift+4 and drag the cross-hairs over the image to capture it. The image will save a .png file to your desktop by default.
- To upload
your file into the text box they provide: Click "HTML editor" below the text box (if you have not already done so) to make a toolbar appear in the text box. Click the toolbar option called "Link" and select "Image." In the pop-up window that appears, click the button called "Find/Upload File" (it is at the bottom of the pop-up window, you may have to enlarge the box or scroll down to see it). Click the "Browse" button and find the boxplots file you just saved.
Either double-click that file or select it and click "Open" and you should see the path to that file appear in the Browse box. Click "Upload File" and its name should appear in the "Uploaded Files" pop-up window. Select the file in the list of "Uploaded Files" to highlight it and click OK and you should see the file appear in the text box.
Type your answers to parts (b), (c) and (d) directly into the
box. Don't forget to type HTML Editor before you type anything in the box.
What are the side-by-side boxplots telling you about the potential returns in your stocks. Which would you prefer to invest in if you are a risk-taker and looking for big profits? Which would you prefer if you are a risk-averse investor and just want to make sure you have enough to live on when you retire?
Question 5: Boxplots -- Vehicle Speeds
|
Make sure you compute the five-number summary by hand, as I demonstrate in Lesson 1, question 4.
NOTE, in part (b), they want you to make the standard boxplot, then comment on the shape you see. However, don't even think
about whether there are outliers or not! They want you to assume there were no outliers at all (they don't even ask you to think about outliers until the next part). In other words, do the whiskers make the distribution appear skewed?
Now, in part (c) they want you to use the 1.5 IQR Rule to identify the cut-offs for outliers. Then, count your outliers to answer part (d).
Then, in
part (e), they want you to make the outlier boxplot which will almost certainly cause you to change your opinion about skewness. They are trying to show you how it is important to identify outliers first before you comment on the shape of a distribution.
Question 6: Mean and Standard Deviation -- Calling Cards
|
This question should be done by hand (i.e. with your calculator, not with JMP). Use the Stat Mode on your calculator to compute the Mean and Standard Deviation. Don't you dare waste your time using the formulas to compute the mean and standard
deviation. That is what your Stat Mode on your calculator is for!Check the Appendix at the back of my book to learn how to use the Stat Mode on your calculator. Here is a link to a digital copy of that appendix: Make sure you round the answers off to 2 decimal places before proceeding to answer the other parts of the question.
Part (c) is introducing a key concept about changing the units in data. Be sure to read the "Effect of Changing Units on Centre and Spread" section of my book in Lesson 1 and see questions 17 and 18 for examples. As they say, once you know the mean and standard deviation from parts
(a) and (b), you can convert them into the mean and standard deviation for part (c) using the conversion information they gave you at the start. Here, you have to multiply the scores by a certain amount, then add on the incentive bonus to convert the amount spent to the loyalty points.
Rather than convert each piece of data, you can apply this recipe to the mean and standard deviation of the amount spent to get the mean and standard deviation of the
charge for the calls. But do it properly! Remember, addition or subtraction of a constant has no effect on spread, only multiplication or division does. While everything affects measures of centre.
Be careful, parts (d) and (e) are referring to the length of calls again, not the charge for those calls.
For part (d), consider this: Let's say you are taking a course, and your
average mark so far is 65. What will happen to your average if you score higher on the next test? What if you score lower on the next test? What would you have to get on the next test to keep your average 65?
For part (e), having decided what that new score must be in part (d), how much does that score deviate from the mean? If that is a larger deviation than the standard deviation you computed earlier, you have increased the overall standard deviation;
if it is the same amount of deviation as earlier, you have not changed the standard deviation at all; if it has a smaller deviation, you have decreased your overall standard deviation.
The closer a value is to the mean, the smaller its deviation from the mean. Small deviations cause low standard deviations; large deviations cause high standard deviations.
Question 7: Weighted Mean -- GPA
|
First, you need to know the scores attached to each letter grade. An A+ is 4.5, A is 4, B+ is 3.5, B is 3, etc.
To compute your grade point average: First, make a new column where you multiply each grade score by the number of credit hours. For example, if
you got a B+ in a 3 credit-hour course, you would multiply 3.5 by 3 to get 10.5 in this new column. Find the total of this new column and find the total number of credit hours. Divide the total of the new column by the total number of credit hours to get the GPA. Put another way, if you got a B+ in 3 credit-hour course, it is as though you scored 3.5 three separate times. You could put your calculator in Stat Mode, and enter 3.5 in three separate times. If you got
an A in a 6 credit-hour course, you got 4.0 six times. Enter 4.0 six separate times. After you have entered all the data, your calculator will tell you the mean (your GPA).
An easy way to think of grade points is to consider the amount of credit hours as the frequency of that grade. Gettting an A in a 3-credit hour course, is like getting an A 3 separate times. Getting a C in a 6-credit hour course is like getting a C 6 separate times. It is like
finding the average of three A's and six C's.
|
|