Author Question: What would the correlation coefficient be if all observations for the two variables were on a curve ... (Read 357 times)

jCorn1234

  • Hero Member
  • *****
  • Posts: 545
What would the correlation coefficient be if all observations for the two variables were on a curve described by Y = X2?
 
  What will be an ideal response?

Question 2

For the exercise, use the first 500 observations only. Using data for average hourly earnings only (ahe), describe the earnings distribution. Use summary statistics, such as the mean, median, variance, and skewness. Produce a frequency distribution (histogram) using reasonable earnings class sizes.
 
  What will be an ideal response?



Jayson

  • Sr. Member
  • ****
  • Posts: 350
Answer to Question 1

Answer: The correlation coefficient would be zero in this case, since the relationship is non-linear.

Answer to Question 2

Answer:
ahe

Mean 19.79
Standard Error 0.51
Median 16.83
Mode 19.23
Standard Deviation 11.49
Sample Variance 131.98
Kurtosis 0.23
Skewness 0.96
Range 58.44
Minimum 2.14
Maximum 60.58
Sum 9897.45
Count 500.0

The mean is 19.79. The median (16.83) is lower than the average, suggesting that the mean is being pulled up by individuals with fairly high average hourly earnings. This is confirmed by the skewness measure, which is positive, and therefore suggests a distribution with a long tail to the right. The variance is 2131.96, while the standard deviation is 11.49.

To generate the frequency distribution in Excel, you first have to settle on the number of class intervals. Once you have decided on these, then the minimum and maximum in the data suggests the class width. In Excel, you then define bins (the upper limits of the class intervals). Sturges's formula can be used to suggest the number of class intervals (1+3.31log(n) ), which would suggest about 9 intervals here. Instead I settled for 8 intervals with a class width of 8  minimum wages in California are currently 8 and approximately the same in other U.S. states.

The table produces the absolute frequencies, and relative frequencies can be calculated in a straightforward way.

bins Frequency rel. freq.
8 50 0.1
16 187 0.374
24 115 0.23
32 68 0.136
40 38 0.076
48 33 0.066
56 8 0.016
66 1 0.002
More 0

Substitution of the relative frequencies into the histogram table then produces the following graph (after eliminating the gaps between the bars).



Related Topics

Need homework help now?

Ask unlimited questions for free

Ask a Question
 

Did you know?

Elderly adults are at greatest risk of stroke and myocardial infarction and have the most to gain from prophylaxis. Patients ages 60 to 80 years with blood pressures above 160/90 mm Hg should benefit from antihypertensive treatment.

Did you know?

Warfarin was developed as a consequence of the study of a strange bleeding disorder that suddenly occurred in cattle on the northern prairies of the United States in the early 1900s.

Did you know?

The lipid bilayer is made of phospholipids. They are arranged in a double layer because one of their ends is attracted to water while the other is repelled by water.

Did you know?

Despite claims by manufacturers, the supplement known as Ginkgo biloba was shown in a study of more than 3,000 participants to be ineffective in reducing development of dementia and Alzheimer’s disease in older people.

Did you know?

Certain rare plants containing cyanide include apricot pits and a type of potato called cassava. Fortunately, only chronic or massive ingestion of any of these plants can lead to serious poisoning.

For a complete list of videos, visit our video library