Introduction to Educational Statistics Joseph Stevens Ph D

  • Slides: 44
Download presentation
Introduction to Educational Statistics Joseph Stevens, Ph. D. , University of Oregon (541) 346

Introduction to Educational Statistics Joseph Stevens, Ph. D. , University of Oregon (541) 346 -2445, stevensj@uoregon. edu

WHAT IS STATISTICS? Statistics is a group of methods used to collect, analyze, present,

WHAT IS STATISTICS? Statistics is a group of methods used to collect, analyze, present, and interpret data and to make decisions.

POPULATION VERSUS SAMPLE A population consists of all elements – individuals, items, or objects

POPULATION VERSUS SAMPLE A population consists of all elements – individuals, items, or objects – whose characteristics are being studied. The population that is being studied is also called the target population.

POPULATION VERSUS SAMPLE cont. The portion of the population selected for study is referred

POPULATION VERSUS SAMPLE cont. The portion of the population selected for study is referred to as a sample.

POPULATION VERSUS SAMPLE cont. A study that includes every member of the population is

POPULATION VERSUS SAMPLE cont. A study that includes every member of the population is called a census. The technique of collecting information from a portion of the population is called sampling.

POPULATION VERSUS SAMPLE cont. A sample drawn in such a way that each element

POPULATION VERSUS SAMPLE cont. A sample drawn in such a way that each element of the population has an equal chance of being selected is called a simple random sample.

TYPES OF STATISTICS Descriptive Statistics consists of methods for organizing, displaying, and describing data

TYPES OF STATISTICS Descriptive Statistics consists of methods for organizing, displaying, and describing data by using tables, graphs, and summary measures.

TYPES OF STATISTICS Inferential Statistics consists of methods that use information from samples to

TYPES OF STATISTICS Inferential Statistics consists of methods that use information from samples to make predictions, decisions or inferences about a population.

Basic Definitions A variable is a characteristic under study that assumes different values for

Basic Definitions A variable is a characteristic under study that assumes different values for different elements. A variable on which everyone has the same exact value is a constant.

Basic Definitions The value of a variable for an element is called an observation

Basic Definitions The value of a variable for an element is called an observation or measurement.

Basic Definitions A data set is a collection of observations on one or more

Basic Definitions A data set is a collection of observations on one or more variables. A distribution is a collection of observations or measurements on a particular variable.

TYPES OF VARIABLES n Quantitative Variables Discrete Variables q Continuous Variables q n Qualitative

TYPES OF VARIABLES n Quantitative Variables Discrete Variables q Continuous Variables q n Qualitative or Categorical Variables

Quantitative Variables cont. A variable whose values are countable is called a discrete variable.

Quantitative Variables cont. A variable whose values are countable is called a discrete variable. In other words, a discrete variable can assume only a limited number of values with no intermediate values.

Quantitative Variables cont. A variable that can assume any numerical value over a certain

Quantitative Variables cont. A variable that can assume any numerical value over a certain interval or intervals is called a continuous variable.

Categorical Variables A variable that cannot assume a numerical value but can be classified

Categorical Variables A variable that cannot assume a numerical value but can be classified into two or more categories is called a categorical variable.

Scales of Measurement n n n How much information is contained in the numbers?

Scales of Measurement n n n How much information is contained in the numbers? Operational Definitions and measurement procedures Types of Scales q q Nominal Ordinal Interval Ratio

Descriptive Statistics Variables can be summarized and displayed using: Tables q Graphs and figures

Descriptive Statistics Variables can be summarized and displayed using: Tables q Graphs and figures q Statistical summaries: q n n n Measures of Central Tendency Measures of Dispersion Measures of Skew and Kurtosis

Measures of Central Tendency Mode – The most frequent score in a distribution n

Measures of Central Tendency Mode – The most frequent score in a distribution n Median – The score that divides the distribution into two groups of equal size n Mean – The center of gravity or balance point of the distribution n

Median The calculation of the median consists of the following two steps: n Rank

Median The calculation of the median consists of the following two steps: n Rank the data set in increasing order n Find the middle number in the data set such that half of the scores are above and half below. The value of this middle number is the median.

Arithmetic Mean The mean is obtained by dividing the sum of all values by

Arithmetic Mean The mean is obtained by dividing the sum of all values by the number of values in the data set. Mean for sample data:

Example: Calculation of the mean Four scores: 82, 95, 67, 92

Example: Calculation of the mean Four scores: 82, 95, 67, 92

The Mean is the Center of Gravity 82 67 92 95

The Mean is the Center of Gravity 82 67 92 95

The Mean is the Center of Gravity X 82 95 67 92 (X –

The Mean is the Center of Gravity X 82 95 67 92 (X – X) 82 – 84 = -2 95 – 84 = +11 67 – 84 = -17 92 – 84 = +8 ∑(X – X) = 0

Comparison of Measures of Central Tendency

Comparison of Measures of Central Tendency

Measures of Dispersion Range n Variance n Standard Deviation n

Measures of Dispersion Range n Variance n Standard Deviation n

Range Highest value in the distribution minus the lowest value in the distribution +

Range Highest value in the distribution minus the lowest value in the distribution + 1

Variance n Measure of how different scores are on average in squared units: ∑(X

Variance n Measure of how different scores are on average in squared units: ∑(X – X)2 / N

Standard Deviation n Returns variance to original scale units Square root of variance =

Standard Deviation n Returns variance to original scale units Square root of variance = sd

Other Descriptors of Distributions n Skew – how symmetrical is the distribution n Kurtosis

Other Descriptors of Distributions n Skew – how symmetrical is the distribution n Kurtosis – how flat or peaked is the distribution

Kinds of Distributions Uniform n Skewed n Bell-shaped or Normal n Ogive or S-shaped

Kinds of Distributions Uniform n Skewed n Bell-shaped or Normal n Ogive or S-shaped n

Normal distribution with mean μ and standard deviation σ Standard deviation = Mean =

Normal distribution with mean μ and standard deviation σ Standard deviation = Mean = μ σ x

Total area under a normal curve. The shaded area is 1. 0 or 100%

Total area under a normal curve. The shaded area is 1. 0 or 100% μ x

A normal curve is symmetric about the mean Each of the two shaded areas

A normal curve is symmetric about the mean Each of the two shaded areas is. 5 or 50% . 5 μ x

Areas of the normal curve beyond μ ± 3σ. Each of the two shaded

Areas of the normal curve beyond μ ± 3σ. Each of the two shaded areas is very close to zero μ – 3σ μ μ + 3σ x

Three normal distribution curves with the same mean but different standard deviations σ=5 σ

Three normal distribution curves with the same mean but different standard deviations σ=5 σ = 10 σ = 16 μ = 50 x

Three normal distributions with different means but the same standard deviation σ=5 µ =

Three normal distributions with different means but the same standard deviation σ=5 µ = 20 σ=5 µ = 30 µ = 40 x

Areas under a normal curve For a normal distribution approximately 1. 68% of the

Areas under a normal curve For a normal distribution approximately 1. 68% of the observations lie within one standard deviation of the mean 2. 95% of the observations lie within two standard deviations of the mean 3. 99. 7% of the observations lie within three standard deviations of the mean

Score Scales Raw Scores n Percentile Ranks n Grade Equivalents (GE) n Standard Scores

Score Scales Raw Scores n Percentile Ranks n Grade Equivalents (GE) n Standard Scores n Normal Curve Equivalents (NCE) q Z-scores q T-scores q College Board Scores q

 Converting an X Value to a z Value For a normal random variable

Converting an X Value to a z Value For a normal random variable X, a particular value of x can be converted to its corresponding z value by using the formula where μ and σ are the mean and standard deviation of the normal distribution of x, respectively.

The Logic of Inferential Statistics Population: the entire universe of individuals we are interested

The Logic of Inferential Statistics Population: the entire universe of individuals we are interested in studying n Sample: the selected subgroup that is actually observed and measured (with sample size N) n Sampling Distribution of a Statistic: a distribution of samples like ours n

The Three Distributions Used in Inferential Statistics I. Population III. Sampling Distribution of the

The Three Distributions Used in Inferential Statistics I. Population III. Sampling Distribution of the Statistic II. Sample