Chapter 1 Introduction to Statistical Methods The true

  • Slides: 66
Download presentation
Chapter 1: Introduction to Statistical Methods “The true logic of this world is in

Chapter 1: Introduction to Statistical Methods “The true logic of this world is in the calculus of probabilities”. James Clerk Maxwell

Introductory Remarks • This course: The Physics of systems containing HUGE numbers (~ 1023)

Introductory Remarks • This course: The Physics of systems containing HUGE numbers (~ 1023) of particles: Solids, liquids, gases, E&M radiation (photons), …. • Challenge: Describe a system’s Macroscopic characteristics from a Microscopic theory. Classical: Newton’s Laws need to solve 1023 coupled differential equations of motion (ABSURD!!) Quantum: Schrödinger’s Equation need to solve for 1023 particles (ABSURD!!)

 Use a Statistical description of such a system. Talk about Probabilities & Average

Use a Statistical description of such a system. Talk about Probabilities & Average Properties We are NOT concerned with the detailed behavior of individual particles. • Definitions: Microscopic: ~ Atomic dimensions ~ ≤ a few Å Macroscopic: Large enough to be “visible” in the “ordinary” sense

Definitions • An Isolated System is in Equilibrium when it’s Macroscopic parameters are timeindependent.

Definitions • An Isolated System is in Equilibrium when it’s Macroscopic parameters are timeindependent. This is the usual case in this course! • But, note! Even if it’s Macroscopic parameters are time-independent, a system’s Microscopic parameters can still vary with time!

The Random Walk & The Binomial Distribution Section 1. 1: Elementary Statistical Concepts; Examples

The Random Walk & The Binomial Distribution Section 1. 1: Elementary Statistical Concepts; Examples • Some math preliminaries (& methods) for the next few lectures. • To treat statistical physics problems, we must first know something about the mathematics of Probability & Statistics The following should be a review! (? ) • Keep in mind: Whenever we want to describe a situation using probability & statistics, we must consider an assembly of a large number N (in principle, N ∞) of “similarly prepared systems”.

This assembly is called an ENSEMBLE (“Ensemble” = the French word for assembly). •

This assembly is called an ENSEMBLE (“Ensemble” = the French word for assembly). • The Probability of an occurrence of a particular event is DEFINED with respect to this particular ensemble & is given by the fraction of systems in the ensemble characterized by the occurrence of this event. • Example: In throwing a pair of dice, we can give a statistical description by considering that a very large number N of similar pairs of dice are thrown under similar circumstances. Alternatively, we could imagine the same pair of dice thrown N times under similar circumstances. The probability of obtaining two 1’s is then given by the fraction of these experiments in which two 1’s is the outcome.

 • Note that this probability depends strongly on the nature of the ensemble

• Note that this probability depends strongly on the nature of the ensemble to which we are referring. • Reif’s flower seed example (p. 5). • To quantitatively introduce probability concepts, we use a specific, simple example, which is actually much more general than you first might think. • The example is called The Random Walk Problem

One-Dimensional Random Walk In it’s simplest, idealized form, the random walk problem can be

One-Dimensional Random Walk In it’s simplest, idealized form, the random walk problem can be viewed as in the figure above: A drunk starts out from a lamp post on a street. Each step he takes is of equal length ℓ. The man is SO DRUNK, that the direction of each step (right or left) is completely independent of the preceding step. The probability of stepping to the right is p & of stepping to the left is q = 1 – p. In general, q ≠ p. The x axis is along the sidewalk, the lamp post is at x = 0. Each step is of length ℓ, so his location on the x axis must be x = mℓ where m = a positive or a negative integer.

 • Question: After N steps, what is the probability that the man is

• Question: After N steps, what is the probability that the man is at a specific location x = mℓ (m specified)? To answer, we first consider an ensemble of a large number N of drunk men starting from similar lamp posts. Or repeat this with the same drunk man walking on the sidewalk N times. • This can be easily generalized to 2 dimensions. See figure to the right:

The 2 dimensional random walk corresponds to a PHYSICS problem of adding N, 2

The 2 dimensional random walk corresponds to a PHYSICS problem of adding N, 2 dimensional vectors of equal length (see figure) & random directions & asking: What is the probability that the resultant has a certain magnitude & direction?

Physical Examples to which the Random Walk Problem applies 1. Magnetism N atoms, each

Physical Examples to which the Random Walk Problem applies 1. Magnetism N atoms, each with magnetic moment μ. Each has spin ½. By Quantum Mechanics, each magnetic moment can point either “up” or “down”. If these are equally likely, what is the Net magnetic moment of the N atoms? 2. Diffusion of a Molecule of |Gas A molecule travels in 3 dimensions with a mean distance ℓ between collisions. How far is it likely to have traveled after N collisions? Answer using Classical Mechanics.

The Random Walk Problem • The Random Walk Problem illustrates some fundamental results of

The Random Walk Problem • The Random Walk Problem illustrates some fundamental results of Probability Theory. • The techniques used are Powerful & General. They are used repeatedly throughout Statistical Mechanics. So, it’s very important to spend some time on this problem & understand it!

Section 1. 2: 1 Dimensional Random Walk • Forget the drunk, let’s get back

Section 1. 2: 1 Dimensional Random Walk • Forget the drunk, let’s get back to Physics! Think of a particle moving in 1 dimension in steps of length ℓ, with probability p of stepping to the right & q = 1 – p of stepping to the left. After N steps, the particle is at position: x = mℓ (- N ≤ m ≤ N). Let n 1 ≡ # of steps to the right (of N), n 2 ≡ # of steps to the left. Clearly, N = n 1 + n 2 Clearly also, x ≡ mℓ = (n 1 - n 2)ℓ or, m = n 1 - n 2 Combining (1) & (2) gives m = 2 n 1 – N Thus, if N is odd, so is m and if N is even, so is m. (1) (2) (3)

A Fundamental Assumption is that successive steps are statistically independent • Let p ≡

A Fundamental Assumption is that successive steps are statistically independent • Let p ≡ the probability of stepping to the right and q = 1 – p ≡ the probability of stepping to the left. • Since each step is statistically independent, the probability of a given sequence of n 1 steps to the right followed by n 2 steps to the left is given by multiplying the respective probabilities for each step: p·p·p· · · · p·p· ···· · q·q·q·q·q· · · q·q ≡ pn 1 qn 2 n 1 factors n 2 factors • But, also, clearly, there are MANY different possible ways of taking N steps so n 1 are to right & n 2 are to left!

 • The # of distinct possibilities is the SAME as counting the #

• The # of distinct possibilities is the SAME as counting the # of distinct ways we can place N objects, n 1 of one type & n 2 of another in N = n 1 + n 2 places: 1 st place: Can be occupied any one of N ways 2 nd place: Can be occupied any one of N - 1 ways 3 rd place: Can be occupied any one of N - 2 ways · · (N – 1)th place: Can be occupied only 2 ways Nth place: Can be occupied only 1 way All available places can be occupied in: N(N-1)(N-2)(N-3)(N-4)·····(3)(2)(1) ≡ N! Ways N! ≡ “N-Factorial”

 • Note However! This analysis doesn’t take into account the fact that there

• Note However! This analysis doesn’t take into account the fact that there are only 2 distinguishable kinds of objects: n 1 of the 1 st type & n 2 of the 2 nd type. All n 1! possible permutations of the 1 st type of object lead to exactly the same N! possible arrangements of the objects. Similarly, all n 2! possible permutations of the 2 nd type of object also lead to exactly the same N! arrangements. So, we need to divide the result by n 1!n 2! So, the # of distinct ways in which N objects can be arranged with n 1 of the 1 st type & n 2 of the 2 nd type is ≡ N!/(n 1!n 2!) This is the same as the # of distinct ways of taking N steps, with n 1 to the right & n 2 to the left.

In Summary: The probability WN(n 1) of taking N steps, with n 1 to

In Summary: The probability WN(n 1) of taking N steps, with n 1 to the right & n 2 = N - n 1 to the left is WN(n 1) = [N!/(n 1!n 2!)]pn 1 qn 2 or WN(n 1) = [N!/{n 1!(N – n 1)!]}pn 1(1 -p)n 2 • Often, this is written as WN(n 1) = N n 1 pn 1 qn 2 Remember that q = 1 - p • This probability distribution is called the Binomial Distribution. • This is because the Binomial Expansion has the form (p + q)N = ∑(n 1 = 0 N)[N!/[n!(N–n 1)!]pn 1 qn 2

 • We really want the probability PN(m) that x = mℓ after N

• We really want the probability PN(m) that x = mℓ after N steps. This really the same as WN(n 1) if we change notation: PN(m) = WN(n 1). But m = 2 n 1 – N, so n 1 = (½)(N + m) & n 2 = N - n 1 = (½)(N - m). So the probability PN(m) that x = mℓ after N steps is: PN(m) = {N!/([0. 5(N + m)]![0. 5(N – m)!]}p 0. 5(N+m)(1 -p)0. 5(N-m) For the common case of p = q = ½, this is: PN(m) = {N!/([0. 5(N + m)]![0. 5(N – m)!]}(½)N This is the usual form of the Binomial Distribution which is probably the most elementary (discrete) probability distribution.

 • As a trivial example, suppose that p = q = ½, N

• As a trivial example, suppose that p = q = ½, N = 3 steps: P 3(m) = {3!/[0. 5(3+m)!][0. 5(3 -m)!](½)3 So P 3(3) = P 3(-3) = (3!/[3!0!](⅛) = ⅛ P 3(1) = P 3(-1) = (3!/[2!1!](⅛) = ⅜ Possible Step Sequences n 1 3 2 1 0 n 2 0 1 2 3 m = n 1 – n 2 3 1 -1 -3

 • As another example, suppose that: p = q = ½, N =

• As another example, suppose that: p = q = ½, N = 20. P 20(m) = {20!/[0. 5(20 + m)!][0. 5(20 - m)!](½)3 • Doing this gives the histogram Note: The envelope of the histogram is a bell-shaped curve. The significance results in the figure of this is that, after N random steps, the probability of a particle being a distance of N steps away from the start is very small & the probability of it being at or near the origin is relatively large: P 20(20) = [20!/(20!0!)](½)20 P 20(20) 9. 5 10 -7 P 20(0) = [20!/(10!)2](½)20 P 20(0) 1. 8 10 -1

Sect. 1. 3: General Discussion of Mean Values

Sect. 1. 3: General Discussion of Mean Values

 • The Binomial Distribution is only one example of a probability distribution. Now,

• The Binomial Distribution is only one example of a probability distribution. Now, we’ll begin a discussion of a General Distribution. • Most of the following results are valid for ANY probability distribution, • Let u = a variable which can take on any of M discrete values: u 1, u 2, u 3, …, u. M-1, u. M with probabilities P(u 1), P(u 2), P(u 3), …. . , P(u. M-1), P(u. M) • The Mean (average) value of u is defined as: ū ≡ <u> ≡ (S 2/S 1) where S 1 ≡ P(u 1) + P(u 2) + P(u 3) +…. . + P(u. M-1) + P(u. M) ≡ ∑i. P(ui) S 2 ≡ u 1 P(u 1) + u 2 P(u 2) + u 3 P(u 3) +…. . + u. M-1 P(u. M-1) + u. M-1 P(u. M) ≡ ∑iui. P(ui) For a properly normalized distribution, S 1 = ∑i. P(ui) = 1. We assume this from now on.

 • Sometimes, ū is called the 1 st moment of P(u). • If

• Sometimes, ū is called the 1 st moment of P(u). • If O(u) is any function of u, the mean value of O(u) is: Ō ≡ <O> ≡ ∑i. O(ui)P(ui) • Some simple mean values that are useful for describing the probability distribution P(u): 1. The mean value, ū This is a measure of the central value of u about which the various values of ui are distributed. • Consider the quantity Δu ≡ u - ū (deviation from the mean). It’s mean is: <Δu> = <u - ū> = ū – ū = 0 The mean value of the deviation from the mean is always zero!

 • Now, lets look at (Δu)2 = (u - <u>)2 (square of the

• Now, lets look at (Δu)2 = (u - <u>)2 (square of the deviation from the mean). It’s mean value is: <(Δu)2> = <(u - <u>)2> = <u 2 -2 uū – (ū)2> = <u 2> - 2<u><u> – (<u>)2 = <u 2> - (<u>)2 • This is called the “Mean Square Deviation” (from the mean). It is also called several different (equivalent!) other names: the Dispersion or the Variance or the 2 nd Moment of P(u) about the mean. <(Δu)2> is a measure of the spread of the u values about the mean ū. NOTE that <(Δu)2> = 0 if & only if ui = ū for all i. • It can easily be shown that, <(Δu)2> ≥ 0, or <u 2> ≥ (<u>)2

 • We could also define the nth moment of P(u) about the mean:

• We could also define the nth moment of P(u) about the mean: <(Δu)n> ≡ <(u - <u>)n> • This is rarely used beyond n = 2. Almost never beyond n = 3 or 4. • NOTE: A knowledge of the probability distribution function P(u) gives complete information about the distribution of the values of u. But, a knowledge of only a few moments, like knowing just ū & <(Δu)2> implies only partial, though useful knowledge of the distribution. A knowledge of only some moments is not enough to uniquely determine P(u). Math Theorem • In order to uniquely determine a distribution P(u), we need to know ALL moments of it. That is we need all moments for n = 0, 1, 2, 3…. .

Section 1. 4: Calculation of Mean Values for the Random Walk Problem • Also

Section 1. 4: Calculation of Mean Values for the Random Walk Problem • Also we’ll discuss a few math “tricks” for doing discrete sums! • We’ve found: The probability in N steps of making n 1 to the right & n 2 = N - n 1 to the left is the Binomial Distribution: WN(n 1) = [N!/(n 1!n 2!)]pn 1 qn 2 p = the probability of a step to the right, q = 1 – p = the probability of a step to the left. • First, lets verify normalization: ∑(n 1 = 0 N) WN(n 1) = 1? • Recall the binomial expansion: (p + q)N = ∑(n 1 = 0 N) [N!/(n 1!n 2!)]pn 1 qn 2 = ∑(n 1 = 0 N) WN(n 1) • But, (p + q) = 1, so (p + q)N = 1 & ∑(n 1 = 0 N) WN(n 1) = 1.

 • Question 1: What is the mean number of steps to the right?

• Question 1: What is the mean number of steps to the right? <n 1> ≡ ∑(n 1 = 0 N) n 1 WN(n 1) = ∑(n 1 = 0 N) n 1[N!/(n 1!(N-n 1)!]pn 1 q. N-n 1 (1) • We can do this sum by looking it up in a table OR we can use a “trick” as follows. The following is a general procedure which usually works, even if it doesn’t always have mathematical “rigor”. • Temporarily, lets treat p & q as “arbitrary”, continuous variables, ignoring the fact that p + q =1. • NOTE that, if p is a continuous variable, then we clearly have: n 1 pn 1 ≡ p[ (pn 1)/ p] • Now, use this in (1) (interchanging the sum & the derivative): <n 1> = ∑(n 1 = 0 N) [N!/(n 1!(N-n 1)!]n 1 pn 1 qn 2 = ∑(n 1 = 0 N)[N!/(n 1!(N-n 1)!]p[ (pn 1)/ p]qn 2 = p[ / p]∑(n 1 = 0 N) [N!/(n 1!(N-n 1)!]pn 1 q. N-n 1 = p[ / p](p + q)N = p. N(p + q)N-1 But, for our special case (p + q) = 1, (p + q)N-1 = 1, so <n 1> = Np

 • Summary: The mean number of steps to the right is: <n 1>

• Summary: The mean number of steps to the right is: <n 1> = Np • We might have guessed this! Similarly, we can also easily show that The mean number of steps to the left is: <n 2> = Nq • Of course, <n 1> + <n 2> = N(p + q) = N as it should! • Question 2: What is the mean displacement, <x> = <m>ℓ? Clearly, m = n 1 – n 2, so <m> = <n 1> - <n 2> = N( p – q) So, if p = q = ½, <m> = 0 so, <x> = <m>ℓ = = 0

 • Question 3: What is the dispersion (or variance) <(Δn 1)2> = <(n

• Question 3: What is the dispersion (or variance) <(Δn 1)2> = <(n 1 - <n 1>)2> in the number of steps to the right? That is, what is the spread in n 1 values about <n 1>? • Our general discussion has shown that: <(Δn 1)2> = <(n 1)2> - (<n 1>)2 Also we’ve just seen that <n 1> = Np So, we first need to calculate the quantity <(n 1)2> = ∑(n 1 = 0 N) (n 1)2 WN(n 1) = ∑(n 1 = 0 N)(n 1)2[N!/(n 1!(N-n 1)!]pn 1 q. N-n 1 • Use a similar “trick” as we did before & note that: (n 1)2 pn 1 ≡ [p( / p)]2 pn 1 (2)

 • After algebra (in the book) & using p + q = 1,

• After algebra (in the book) & using p + q = 1, we find: <(n 1)2> = (Np)2 + Npq = (<n 1>)2 + Npq • So, finally, using <(Δn 1)2> = <(n 1)2> - (<n 1>)2 <(Δn 1)2> = Npq • This is the dispersion or variance of the binomial distribution. • The root mean square (rms) deviation from the mean is defined as: (Δ*n 1) [<(Δn 1)2>]½ (in general). For the binomial distribution this is (Δ*n 1) = [Npq]½ The distribution width • Again note that: <n 1> = Np. So, the relative width of the distribution is: (Δ*n 1)/<n 1> = [Npq]½ (Np) = (q½) (p. N)½ If p = q, this is: (Δ*n 1)/<n 1> = 1 (N)½ = (N)-½ As N increases, the mean value increases N but the relative width decreases (N)-½

 • Question 4: What is the dispersion <(Δm)2> = <(m - <m>)2> in

• Question 4: What is the dispersion <(Δm)2> = <(m - <m>)2> in the net displacement? x = mℓ. What is the spread in m values about <m>)? • We had, m = n 1 – n 2 = 2 n 1 – N. So, <m> = 2<n 1> – N. Δm = m - <m> = (2 n 1 – N) – (2<n 1> - N) = 2(n 1 – <n 1>) = 2(Δn 1) (Δm)2 = 4(Δn 1)2. So, <(Δm)2> = 4<(Δn 1)2> • Using <(Δn 1)2> = Npq, this becomes: <(Δm)2> = 4 Npq • If p = q = ½, <(Δm)2> = N

Summary: 1 Dimensional Random Walk Problem • Probability Distribution is Binomial: WN(n 1) =

Summary: 1 Dimensional Random Walk Problem • Probability Distribution is Binomial: WN(n 1) = [N!/(n 1!n 2!)]pn 1 qn 2 • Mean number of steps to the right: <n 1> = Np • Dispersion in n 1: <(Δn 1)2> = Npq Relative width: (Δ*n 1)/<n 1> = (q½) (p. N)½ for N increasing, the mean value increases N, the relative width decreases (N)-½

Some General Comments about the Binomial Distribution

Some General Comments about the Binomial Distribution

The Binomial Distribution applies to cases where there are only two possible outcomes: head

The Binomial Distribution applies to cases where there are only two possible outcomes: head or tail, success or failure, defective item or good item, etc. Requirements justifying the use of the Binomial Distribution 1. The experiment must consist of n identical trials. 2. Each trial must result in only one of two possible outcomes. 3. The outcomes of the trials must be statistically independent. 4. All trials must have the same probability for a particular outcome.

Common Notation for the Binomial Distribution r items of one type and (n –

Common Notation for the Binomial Distribution r items of one type and (n – r) of a second type can be arranged in n. Cr ways. Here: ≡ n. C r is called the binomial coefficient In this notation, the probability distribution can be written: Wn(r) = n. Cr pr(1 -p)n-r ≡ probability of finding r items of one type & n – r items of the other type. p = probability of a given item being of one type

Binomial Distribution Example Problem: A sample of n = 11 electric bulbs is drawn

Binomial Distribution Example Problem: A sample of n = 11 electric bulbs is drawn every day from those manufactured at a plant. The probabilities of getting defective bulbs are random and independent of previous results. The probability that a bulb is defective is p = 0. 04. 1. What is the probability of finding exactly three defective bulbs in a sample? (Probability that r = 3? ) 2. What is the probability of finding three or more defective bulbs in a sample? (Probability that r ≥ 3? )

Binomial Distribution, n = 11 No. of defective bulbs, r 0 Probability r(1 -p)n-r

Binomial Distribution, n = 11 No. of defective bulbs, r 0 Probability r(1 -p)n-r C p 11 r 11 C 0 (0. 04) 0(0. 96)11 = 0. 6382 1 1(0. 96)10 C (0. 04) 11 1 = 0. 2925 2 11 C 2 (0. 04) 2(0. 96)9 = 0. 0609 3 3(0. 96)8 C (0. 04) 11 3 = 0. 0076

Question 1: Probability of finding exactly three defective bulbs in a sample? P(r =

Question 1: Probability of finding exactly three defective bulbs in a sample? P(r = 3 defective bulbs) = W 11(r = 3) = 0. 0076 Question 2: Probability of finding three or more defective bulbs in a sample? P(r ≥ 3 defective bulbs ) = 1 - W 11(r = 0) – W 11(r = 1) – W 11(r = 2) = 1 – 0. 6382 - 0. 2925 – 0. 0609 = 0. 0084

Binomial Distribution, Same Problem, Larger r No. of defective bulbs Probability r n-r 11

Binomial Distribution, Same Problem, Larger r No. of defective bulbs Probability r n-r 11 Cr p (1 -p) 0 0(0. 96)11 = 0. 638239 C (0. 04) 11 0 1 1(0. 96)10 = 0. 292526 C (0. 04) 11 1 2 11 C 2 (0. 04) 2(0. 96)9 = 0. 060943 3 3(0. 96)8 C (0. 04) 11 3 = 0. 007618 4 4(0. 96)7 C (0. 04) 11 4 = 0. 000635 5 5(0. 96)6 C (0. 04) 11 5 = 0. 000037

Pr(Defective Items) Binomial Distribution Defective Items

Pr(Defective Items) Binomial Distribution Defective Items

Pr(Good Items) Binomial Distribution Good Items

Pr(Good Items) Binomial Distribution Good Items

Binomial distribution with n=10, p=0. 5

Binomial distribution with n=10, p=0. 5

“Wandering Photon” Animation found on the Internet!

“Wandering Photon” Animation found on the Internet!

The “Wandering Photon” Photon Walks straight for a random length Stops with probability g

The “Wandering Photon” Photon Walks straight for a random length Stops with probability g Turns in a random direction with probability (1 -g)

One Dimension

One Dimension

x After a random length x with probability g stop with probability (1 -g

x After a random length x with probability g stop with probability (1 -g )/2 continue in each direction

x

x

x

x

x

x

x

x

x

x

x

x

P(photon absorbed at x)? x pdf of the length of the first step 1/h

P(photon absorbed at x)? x pdf of the length of the first step 1/h is the average step length γ is the absorption probability

P(photon absorbed at x) = f (|x|, g, h) x pdf of the length

P(photon absorbed at x) = f (|x|, g, h) x pdf of the length of the first step 1/h is the average step length γ is the absorption probability

The sleepy drunk in higher dimensions

The sleepy drunk in higher dimensions

After a random length, with probability g stop with probability (1 -g ) pick

After a random length, with probability g stop with probability (1 -g ) pick a random direction

The sleepy drunk in higher dimensions r P(absorbed at r) = f (r, g,

The sleepy drunk in higher dimensions r P(absorbed at r) = f (r, g, h)