Chapter 4 Statistics Standard Deviation n Sample Standard

  • Slides: 42
Download presentation
Chapter 4 Statistics

Chapter 4 Statistics

Standard Deviation n Sample Standard deviation (for n Population Standard deviation use with small

Standard Deviation n Sample Standard deviation (for n Population Standard deviation use with small samples n< ~25) (for use with samples n > 25) n n m = population mean IN the absence of systematic error, the population mean approaches the true value for the measured quantity.

Example n The following results were obtained in the replicate analysis of a blood

Example n The following results were obtained in the replicate analysis of a blood sample for its lead content: 0. 752, 0. 756, 0. 752, 0. 760 ppm lead. Calculate the mean and standard deviation for the data set.

Standard deviation n 0. 752, 0. 756, 0. 752, 0. 760 ppm lead. Excel®

Standard deviation n 0. 752, 0. 756, 0. 752, 0. 760 ppm lead. Excel® Demo You’d report the amount of lead in this sample of blood as

Distributions of Experimental Data n n We find that the distribution of replicate data

Distributions of Experimental Data n n We find that the distribution of replicate data from most quantitative analytical measurements approaches a Gaussian curve. Example – Consider the calibration of a pipet.

Replicate data on the calibration of a 10 -ml pipet.

Replicate data on the calibration of a 10 -ml pipet.

Frequency distribution

Frequency distribution

Average= 9. 982 Std. Dev = + 0. 0056

Average= 9. 982 Std. Dev = + 0. 0056

4 -2 Confidence Intervals

4 -2 Confidence Intervals

For small data sets m is the true mean and the above equations express

For small data sets m is the true mean and the above equations express that the “true mean” will be in the calculated range at a given confidence.

Example n The following results were obtained in the replicate analysis of a blood

Example n The following results were obtained in the replicate analysis of a blood sample for its lead content: 0. 752, 0. 756, 0. 752, 0. 760 ppm lead. Calculate the mean and standard deviation for the data set. Find (a) the 50% CL and (b) the 90% CL

Confidence Intervals ?

Confidence Intervals ?

Confidence Intervals ?

Confidence Intervals ?

Confidence Intervals ?

Confidence Intervals ?

Confidence Intervals 90 % CI 50 % CI 0. 750 0. 755 0. 760

Confidence Intervals 90 % CI 50 % CI 0. 750 0. 755 0. 760 There is a 50% chance that the true mean, m, lies in the range 0. 755 + 0. 001 ppm (of from 0. 754 to 0. 756 ppm) Likewise, these calculations mean that there is a 90% chance that the true mean, m, lies in the range 0. 755 + 0. 005 ppm (of from 0. 750 to 0. 760 ppm)

Confidence limits and uncertainty n Suppose we measure the volume of a vessel five

Confidence limits and uncertainty n Suppose we measure the volume of a vessel five times and observe values: 6. 372, 6. 375, 6. 374, 6. 377, and 6. 375 m. L. And find average = 6. 3746 m. L And s = 0. 0018 m. L Use a 90% CL to Estimate uncertainty!

Experimental Uncertainty n Well, a 90% CI means that there is a 90% chance

Experimental Uncertainty n Well, a 90% CI means that there is a 90% chance that the true volume is within the range. And find average = 6. 3746 m. L And s = 0. 0018 m. L

Experimental Uncertainty

Experimental Uncertainty

Comparison of Means with Student’s t

Comparison of Means with Student’s t

Comparison of Means with Student’s t n A t test is used to compare

Comparison of Means with Student’s t n A t test is used to compare one set of measurements with another to decide whether or not they are “The Same” n n Compare measured result with a “true” value Comparing two experimental means

Comparing a mean with a true value n n Good for detecting systematic (determinate)

Comparing a mean with a true value n n Good for detecting systematic (determinate) errors Uses student tvalues COMPARE TO ttable If tcalc > ttable – difference is significant If tcalc < ttable - difference is NOT significant

EXAMPLE n A new procedure for the rapid determination of sulfur in kerosene was

EXAMPLE n A new procedure for the rapid determination of sulfur in kerosene was tested on a KNOWN sample (m or xt = 0. 123% S). The results were: % S = 0. 112, 0. 118, 0. 115, and 0. 119. Is there a difference at the 95% confidence level?

tcalculated = ttable = 3. 182 Are they significantly different?

tcalculated = ttable = 3. 182 Are they significantly different?

Strutt’s Story At the turn of the last century it was generally thought that

Strutt’s Story At the turn of the last century it was generally thought that dry air contained about one-fifth oxygen and four-fifths nitrogen. One man wanted to confirm this …. sample – Dry air … Added red-hot copper. Cu would react with oxygen to make solid copper oxide (Cu. O). n 1 st Air without Oxygen. 2 nd sample – Make an equal volume of nitrogen. n Pure Nitrogen can be generated by decomposition of N 2 O (Nitrous oxide) or NO (Nitric Oxide). Pure nitrogen. Reasoned the amounts would be the same

Is there a Difference at the 95% Confidence Level?

Is there a Difference at the 95% Confidence Level?

Comparison of two means

Comparison of two means

Comparison of two means

Comparison of two means

If tcalc > ttable – difference is significant n n Why the difference? In

If tcalc > ttable – difference is significant n n Why the difference? In 1904, Lord Rayleigh was awarded the Novel Prize for discovering Argon

Comparison of Standard deviations between data n The F-test may be used to provide

Comparison of Standard deviations between data n The F-test may be used to provide insights into: n Whethere is a difference in the precision of two methods. n n (may warrant a new calculation to compare means! ) Is method A more precise than method B?

F-test (comparison of std. dev. ) We always put the larger standard deviation in

F-test (comparison of std. dev. ) We always put the larger standard deviation in the numerator, so that F>1. If Fcalculated > Ftable then the difference is significant at the 95% CL.

Example A well developed method for protein concentration determination yields a standard deviation of

Example A well developed method for protein concentration determination yields a standard deviation of 0. 25 M over many hundreds of replicates. A) Dr. Skeels develops a rapid method for the determination of protein concentration that yields a standard deviation of 0. 15 M (for 12 degrees of freedom). B) Dr. Marano’s method yields 0. 11 M (std dev) for the same number of degrees of freedom. Is Dr. Skeels’ method more precise than the standard or is Dr. Marano’s, or neither? n

Throwing out “Bad data”

Throwing out “Bad data”

For an analysis of alcohol content in wine Dr. Skeels finds the following: 12.

For an analysis of alcohol content in wine Dr. Skeels finds the following: 12. 53, 12. 56, 12. 47, 12. 67, and 12. 48% n

Q-test for Bad Data Compare to Qcritical Qcalc > Qcritical can reject

Q-test for Bad Data Compare to Qcritical Qcalc > Qcritical can reject

Range 12. 47 12. 48 12. 53 12. 56 12. 67 Gap

Range 12. 47 12. 48 12. 53 12. 56 12. 67 Gap