Discrimination and Classification Discrimination Classification Discrimination Goal is













- Slides: 13

Discrimination and Classification

Discrimination & Classification • Discrimination: Goal is to separate individual cases into known population groups by their measurements on several variables. § Makes use of graphical and algebraic methods. § Goal is to separate groups as much as possible based on numeric values § Referred to as “Separation” • Classification: Observing new cases along with numeric values and assigning them to groups based on their numeric values § Makes use of an algorithm generated on known cases and applies it to new cases whose population is unknown. § Referred to as “Allocation”

Notation and Concepts • Notation: Populations ≡ p 1, p 2 Measured Variables: X • Conceptual Settings of Unknown Population: § Incomplete Knowledge of Outcome: The outcome is in future and cannot be observed when X is measured § Destruction Necessary to Observe Outcome: A product must be destroyed to observe quality status. § Unavailable or Expensive Assessments of Outcome: Authorship unknown or assessment by expensive gold standard may be needed

Setting up a Discriminant Function • Prior Probabilities for the 2 Populations – Assumes knowledge of relative population sizes. Will tend to classify individual cases into the “larger” population unless strong evidence in favor of “smaller” population. • Misclassification Cost – Is cost of misclassification same for objects from each of the populations? • Probability Density Functions – The distributions of the numeric variables for the elements of the 2 populations. Population 1: f 1(x) Population 2: f 2(x) • Classification Regions – Given an observations’ x values, it will be assigned to Population 1 or 2, R 1 ≡ {x} s. t. an observation is classified to Population 1, R 2 ≡ W – R 1 is the set of x where it is classified to Population 2

Mathematical Notation

Regions that Minimize Expected Cost of Misclassification

Allocation of New Observation x 0 to Population

Normal Populations with Equal S

Sample Based Discrimination

Fisher’s Method for 2 Populations

Classification of Multivariate Normal Populations when S 1 ≠ S 2

Evaluation of Classification Functions

Jacknife Cross-Validation (Lauchenbruch’s Holdout Method) • For Population 1, remove each observation 1 -at-time and fit the classifier based on all (n 1 -1)+n 2 remaining cases. Classify the hold-out case. • Repeat for all n 1 cases from Population 1. n 1 m(H) ≡ # misclassified as p 2 • Repeat for all n 2 cases from Population 2. n 2 m(H) ≡ # misclassified as p 1
Discrimination vs classification
What is the goal of evolutionary classification
Usually negative, _____ is an attitude or prejudgment.
Power of discrimination definition
Chapter 16 section 3 segregation and discrimination
Chapter 8 section 3 segregation and discrimination
Gordon allport's scale of prejudice and discrimination
In what regions did the literacy test exist
Government chapter 21 diversity and discrimination
Diversity and discrimination
Merton's typology of prejudice and discrimination
Jobs for stutterers
Brightness adaptation and discrimination
Prejudice