When N 30 and the population standard deviation is unknown what is the appropriate?

Q: When the sample size is 30 or less and the population standard deviation is unknown we use?

If the population standard deviation is unknown and the sample size n is small then when we substitute the sample standard deviation s for σ the normal approximation is no longer valid. The solution is to use a different distribution, called Students t-distribution with n−1 degrees of freedom.

Hypothesis Tests for One Population Mean when Sigma is Unknown

You may use this project freely under the Creative Commons Attribution-ShareAlike 4.0 International License. Please cite as follow: Hartmann, K., Krois, J., Waske, B. (2018): E-Learning Project SOGA: Statistics and Geospatial Data Analysis. Department of Earth Sciences, Freie Universitaet Berlin.

Inhaltsverzeichnis Show

Hypothesis Tests for One Population Mean when Sigma is Unknown
Population Standard Deviation Known
Student's t Distribution
Degrees of Freedom
Population Standard Deviation Unknown
What Is a Z-Test?
Key Takeaways
Understanding Z-Tests
One-Sample Z-Test Example
What's the Difference Between a T-Test and Z-Test?
When Should You Use a Z-Test?
What Is a Z-Score?
What Is Central Limit Theorem (CLT)?
The Bottom Line
What is the appropriate distribution if'n 30 and the population standard deviation is unknown?
When N 30 and the population standard deviation is known what is the appropriate test to be used?
When the sample size is 30 or less and the population standard deviation is unknown we use?
What do you do if the population standard deviation is unknown?

You are estimating the population mean, mu, not the sample mean, x bar.

Population Standard Deviation Known

If the population standard deviation, sigma is known, then the mean has a normal (Z) distribution.

The maximum error of the estimate is given by the formula for E shown

. The Z here is the z-score obtained from the normal table, or the bottom of the t-table as explained in the introduction to estimation. The z-score is a factor of the level of confidence, so you may get in the habit of writing it next to the level of confidence.

Once you have computed E, I suggest you save it to the memory on your calculator. On the TI-82, a good choice would be the letter E. The reason for this is that the limits for the confidence interval are now found by subtracting and adding the maximum error of the estimate from/to the sample mean.

Student's t Distribution

When the population standard deviation is unknown, the mean has a Student's t distribution. The Student's t distribution was created by William T. Gosset, an Irish brewery worker. The brewery wouldn't allow him to publish his work under his name, so he used the pseudonym "Student".

The Student's t distribution is very similar to the standard normal distribution.

It is symmetric about its mean
It has a mean of zero
It has a standard deviation and variance greater than 1.
There are actually many t distributions, one for each degree of freedom
As the sample size increases, the t distribution approaches the normal distribution.
It is bell shaped.
The t-scores can be negative or positive, but the probabilities are always positive.

Degrees of Freedom

A degree of freedom occurs for every data value which is allowed to vary once a statistic has been fixed. For a single mean, there are n-1 degrees of freedom. This value will change depending on the statistic being used.

Population Standard Deviation Unknown

If the population standard deviation, sigma is unknown, then the mean has a student's t (t) distribution and the sample standard deviation is used instead of the population standard deviation.

The maximum error of the estimate is given by the formula for E shown

. The t here is the t-score obtained from the Student's t table. The t-score is a factor of the level of confidence and the sample size.

Notice the formula is the same as for a population mean when the population standard deviation is known. The only thing that has changed is the formula for the maximum error of the estimate.

Table of Contents

What Is a Z-Test?

A z-test is a statistical test used to determine whether two population means are different when the variances are known and the sample size is large.

The test statistic is assumed to have a normal distribution, and nuisance parameters such as standard deviation should be known in order for an accurate z-test to be performed.

Key Takeaways

A z-test is a statistical test to determine whether two population means are different when the variances are known and the sample size is large.
A z-test is a hypothesis test in which the z-statistic follows a normal distribution.
A z-statistic, or z-score, is a number representing the result from the z-test.
Z-tests are closely related to t-tests, but t-tests are best performed when an experiment has a small sample size.
Z-tests assume the standard deviation is known, while t-tests assume it is unknown.

Understanding Z-Tests

The z-test is also a hypothesis test in which the z-statistic follows a normal distribution. The z-test is best used for greater-than-30 samples because, under the central limit theorem, as the number of samples gets larger, the samples are considered to be approximately normally distributed.

When conducting a z-test, the null and alternative hypotheses, alpha and z-score should be stated. Next, the test statistic should be calculated, and the results and conclusion stated. A z-statistic, or z-score, is a number representing how many standard deviations above or below the mean population a score derived from a z-test is.

Examples of tests that can be conducted as z-tests include a one-sample location test, a two-sample location test, a paired difference test, and a maximum likelihood estimate. Z-tests are closely related to t-tests, but t-tests are best performed when an experiment has a small sample size. Also, t-tests assume the standard deviation is unknown, while z-tests assume it is known. If the standard deviation of the population is unknown, the assumption of the sample variance equaling the population variance is made.

One-Sample Z-Test Example

Assume an investor wishes to test whether the average daily return of a stock is greater than 3%. A simple random sample of 50 returns is calculated and has an average of 2%. Assume the standard deviation of the returns is 2.5%. Therefore, the null hypothesis is when the average, or mean, is equal to 3%.

Conversely, the alternative hypothesis is whether the mean return is greater or less than 3%. Assume an alpha of 0.05% is selected with a two-tailed test. Consequently, there is 0.025% of the samples in each tail, and the alpha has a critical value of 1.96 or -1.96. If the value of z is greater than 1.96 or less than -1.96, the null hypothesis is rejected.

The value for z is calculated by subtracting the value of the average daily return selected for the test, or 1% in this case, from the observed average of the samples. Next, divide the resulting value by the standard deviation divided by the square root of the number of observed values.

Therefore, the test statistic is:

(0.02 - 0.01) ÷ (0.025 ÷ √ 50) = 2.83

The investor rejects the null hypothesis since z is greater than 1.96 and concludes that the average daily return is greater than 1%.

What's the Difference Between a T-Test and Z-Test?

Z-tests are closely related to t-tests, but t-tests are best performed when the data consists of a small sample size, i.e., less than 30. Also, t-tests assume the standard deviation is unknown, while z-tests assume it is known.

When Should You Use a Z-Test?

If the standard deviation of the population is unknown and the sample size is greater than or equal to 30, then the assumption of the sample variance equaling the population variance should be made using the z-test. Regardless of the sample size, if the population standard deviation for a variable remains unknown, a t-test should be used instead.

What Is a Z-Score?

A z-score, or z-statistic, is a number representing how many standard deviations above or below the mean population the score derived from a z-test is. Essentially, it is a numerical measurement that describes a value's relationship to the mean of a group of values. If a z-score is 0, it indicates that the data point's score is identical to the mean score. A z-score of 1.0 would indicate a value that is one standard deviation from the mean. Z-scores may be positive or negative, with a positive value indicating the score is above the mean and a negative score indicating it is below the mean.

What Is Central Limit Theorem (CLT)?

In the study of probability theory, the central limit theorem (CLT) states that the distribution of sample approximates a normal distribution (also known as a “bell curve”) as the sample size becomes larger, assuming that all samples are identical in size, and regardless of the population distribution shape. Sample sizes equal to or greater than 30 are considered sufficient for the CLT to predict the characteristics of a population accurately. The z-test's fidelity relies on the CLT holding.

The Bottom Line

A z-test is used in hypothesis testing to evaluate whether a finding or association is statistically significant or not. In particular, it tests whether two means are the same (the null hypothesis). A z-test can only be used if the population standard deviation is known and the sample size is 30 data points or larger. Otherwise, a t-test should be employed.

What is the appropriate distribution if'n 30 and the population standard deviation is unknown?

You must use the t-distribution table when working problems when the population standard deviation (σ) is not known and the sample size is small (n<30). General Correct Rule: If σ is not known, then using t-distribution is correct.

When N 30 and the population standard deviation is known what is the appropriate test to be used?

A z-test can only be used if the population standard deviation is known and the sample size is 30 data points or larger.

When the sample size is 30 or less and the population standard deviation is unknown we use?

If the population standard deviation is unknown and the sample size n is small then when we substitute the sample standard deviation s for σ the normal approximation is no longer valid. The solution is to use a different distribution, called Student's t-distribution with n−1 degrees of freedom.

What do you do if the population standard deviation is unknown?

Population Standard Deviation Unknown If the population standard deviation, sigma is unknown, then the mean has a student's t (t) distribution and the sample standard deviation is used instead of the population standard deviation. . The t here is the t-score obtained from the Student's t table.