Experiments Method and Methodology Mchel Foghl Executive Director

  • Slides: 57
Download presentation
Experiments: Method and Methodology Mícheál Ó Foghlú Executive Director Research TSSG, WIT mofoghlu@tssg. org

Experiments: Method and Methodology Mícheál Ó Foghlú Executive Director Research TSSG, WIT mofoghlu@tssg. org March 2009

Revised Schedule n Mon 12 th Jan Wed 14 th Jan Wed 21 st

Revised Schedule n Mon 12 th Jan Wed 14 th Jan Wed 21 st Jan Wed 28 th Jan Wed 4 th Feb Wed 11 th Feb Wed 25 th Feb Wed 4 th Mar Wed 11 th Mar Wed 18 th Mar Wed 25 th Mar Wed 1 st Apr Wed 22 nd Apr n Sessions 01 -05 to be delivered by Mícheál Ó Foghlú n n n Thomas Magedanz - Guest Lecture on IMS [DONE] Presentations [DONE] IPv 6 Summit (Dublin Castle) [DONE] EMPTY� Session 01 [DONE] EMPTY Session 02 [DONE] Session 03 [DONE] EMPTY Session 04 [Today] Session 05 Copyright © Mícheál Ó Foghlú 2009

Schedule Detail 01 What is research? – Philosophy, Epistemology, Methodology and Method n 02

Schedule Detail 01 What is research? – Philosophy, Epistemology, Methodology and Method n 02 How to write academically? n – Some simple language rules – Some simple structure rules n 03 What’s the big deal with plagiarism? – Bibliographies, references and citations, … – Doing it in Word – Doing it with other tools like La. Te. X/Bib. Te. X n 04 Results - how to do experiments – Support tools: simulation, data analysis, … n 05 Discussion Copyright © Mícheál Ó Foghlú 2009

Structure n Experimental Design (basics) n Statistical Analysis (basics) Copyright © Mícheál Ó Foghlú

Structure n Experimental Design (basics) n Statistical Analysis (basics) Copyright © Mícheál Ó Foghlú 2009

Experimental Design How to conduct a valid experiment. http: //www. slideshare. net/mrmularella/experimental-design

Experimental Design How to conduct a valid experiment. http: //www. slideshare. net/mrmularella/experimental-design

A Good Experiment Tests one variable at a time. If more than one thing

A Good Experiment Tests one variable at a time. If more than one thing is tested at a time, it won’t be clear which variable caused the end result. n Must be fair and unbiased. This means that the experimenter must not allow his or her opinions to influence the experiment. n Does not allow any outside factors to affect the outcome of the experiment. n Copyright © Mícheál Ó Foghlú 2009

A Good Experiment Is valid. The experimental procedure must test your hypothesis to see

A Good Experiment Is valid. The experimental procedure must test your hypothesis to see if it is correct. n If the procedure does not test your hypothesis, the experiment is not valid and the data will make no sense! n Has repeated trials. Repeating the trials in the experiment will reduce the effect of experimental errors and give a more accurate conclusion. n Copyright © Mícheál Ó Foghlú 2009

Variables A variable is anything in an experiment that can change or vary. n

Variables A variable is anything in an experiment that can change or vary. n It is any factor that can have an effect on the outcome of the experiment. n There are three main types of variables. n Copyright © Mícheál Ó Foghlú 2009

3 Kinds of Variables Independent Variable (IV) – something that is intentionally changed by

3 Kinds of Variables Independent Variable (IV) – something that is intentionally changed by the scientist – – What is tested What is manipulated Also called a “Manipulated Variable” You can only change ONE variable in an experiment!!! Copyright © Mícheál Ó Foghlú 2009

3 Kinds of Variables Independent Variable (IV) To determine the independent variable, ask yourself:

3 Kinds of Variables Independent Variable (IV) To determine the independent variable, ask yourself: “What is being changed? ” Finish this sentence… “I will change the _______” Copyright © Mícheál Ó Foghlú 2009

Independent Variable Levels of the IV n These are different ways you will change

Independent Variable Levels of the IV n These are different ways you will change the independent variable Example: Assume you are testing five brands of popcorn to see which has the most unpopped kernels. n The IV would be the different brands of popcorn. n The five different brands would be the different levels of the IV. Copyright © Mícheál Ó Foghlú 2009

3 Kinds of Variables Dependent Variable (DV) – something that might be affected by

3 Kinds of Variables Dependent Variable (DV) – something that might be affected by the change in the independent variable – What is observed and measured – The data collected during the investigation – Also called a “Responding Variable” Copyright © Mícheál Ó Foghlú 2009

3 Kinds of Variables Dependent Variable (DV) To determine the dependent variable, ask yourself:

3 Kinds of Variables Dependent Variable (DV) To determine the dependent variable, ask yourself: “What will I measure and observe? ” Finish this sentence… “I will measure and observe ________” Copyright © Mícheál Ó Foghlú 2009

Dependent Variable Operational Definition: n Define exactly how the dependent variable will be measured.

Dependent Variable Operational Definition: n Define exactly how the dependent variable will be measured. Example: Assume your DV in an experiment is “plant growth. ” How will you measure this? ! It could be… n Height (cm), mass (g), # of leaves, etc. n Be specific and include all necessary units! Copyright © Mícheál Ó Foghlú 2009

3 Kinds of Variables Controlled Variable (CV) – a variable that is not changed

3 Kinds of Variables Controlled Variable (CV) – a variable that is not changed and kept the same – Also called constants – Allows for a “fair test” – NOT the same as a “control”!! – Any given experiment will have many controlled variables Copyright © Mícheál Ó Foghlú 2009

3 Kinds of Variables Controlled Variable (CV) To determine the controlled variables, ask yourself:

3 Kinds of Variables Controlled Variable (CV) To determine the controlled variables, ask yourself: “What should not be allowed to change? ” Finish this sentence… “I will not allow the _______ to change. ” Copyright © Mícheál Ó Foghlú 2009

Control A group or individual in the experiment that is not tested, but is

Control A group or individual in the experiment that is not tested, but is used for comparison as a reference for what “normal” would be like. n Not all experiments have a control (though all experiments have controlled variables). Example: If you tested different pollutants to see their affect on plant growth, the control would only receive water. Copyright © Mícheál Ó Foghlú 2009

Example n Students of different ages were given the same jigsaw puzzle to put

Example n Students of different ages were given the same jigsaw puzzle to put together. n They were timed to see how long it took to finish the puzzle. Copyright © Mícheál Ó Foghlú 2009

Identify the variables in this investigation! Copyright © Mícheál Ó Foghlú 2009

Identify the variables in this investigation! Copyright © Mícheál Ó Foghlú 2009

What was the independent variable? Ages of the students – Different ages were tested

What was the independent variable? Ages of the students – Different ages were tested by the scientist Copyright © Mícheál Ó Foghlú 2009

What was the dependent variable? The time it to put the puzzle together –

What was the dependent variable? The time it to put the puzzle together – The time was observed and measured by the scientist Copyright © Mícheál Ó Foghlú 2009

What was a controlled variable? Same puzzle – All of the participants were tested

What was a controlled variable? Same puzzle – All of the participants were tested with the same puzzle. – It would not have been a fair test if some had an easy 30 piece puzzle and some had a harder 500 piece puzzle. Copyright © Mícheál Ó Foghlú 2009

Another Example: n An investigation was done with an electromagnetic system made from a

Another Example: n An investigation was done with an electromagnetic system made from a battery and wire wrapped around a nail. n Different sizes of nails were used. n The number of paper clips the electromagnet could pick up was measured. Copyright © Mícheál Ó Foghlú 2009

n What are the variables in this investigation? Copyright © Mícheál Ó Foghlú 2009

n What are the variables in this investigation? Copyright © Mícheál Ó Foghlú 2009

Independent variable: Sizes of nails – These were changed by the scientist. – They

Independent variable: Sizes of nails – These were changed by the scientist. – They used different sizes of nails in their experiment to see what effect that would have. Copyright © Mícheál Ó Foghlú 2009

Dependent variable: Number of paper clips picked up – The number of paper clips

Dependent variable: Number of paper clips picked up – The number of paper clips were observed and counted (measured) Copyright © Mícheál Ó Foghlú 2009

Controlled variables: Battery, wire, type of nail – None of these items were changed

Controlled variables: Battery, wire, type of nail – None of these items were changed – They had used the same battery, same wire, and same type of nail. – Changing any of these things would have made it an unfair test. Copyright © Mícheál Ó Foghlú 2009

Here’s another:

Here’s another:

n The temperature of water was measured at different depths of a pond. Copyright

n The temperature of water was measured at different depths of a pond. Copyright © Mícheál Ó Foghlú 2009

n Independent variable – depth of the water n Dependent variable – temperature n

n Independent variable – depth of the water n Dependent variable – temperature n Controlled variables – same pond; same thermometer Copyright © Mícheál Ó Foghlú 2009

Last one:

Last one:

n Students modified paper airplanes by cutting pieces off, adding tape, or adding paper

n Students modified paper airplanes by cutting pieces off, adding tape, or adding paper clips to increase the distance thrown. Copyright © Mícheál Ó Foghlú 2009

n Independent variable – weight of plane, center of gravity, air resistance (depended on

n Independent variable – weight of plane, center of gravity, air resistance (depended on student choice-but only one was tested) n Dependent variable – distance thrown n Controlled variables – same plane design; same paper; same throwing technique Copyright © Mícheál Ó Foghlú 2009

Now let’s take what we know about these variables and use them in an

Now let’s take what we know about these variables and use them in an experiment! Copyright © Mícheál Ó Foghlú 2009

We are going to test how many drops of water will fit on different

We are going to test how many drops of water will fit on different sized coins. Let’s think about how we could test this. – Identify the variables – What exactly will be changed? How will it be changed? – What exactly will be measured? How will it be measured? Copyright © Mícheál Ó Foghlú 2009

What are my variables? Independent variable – size of the coin (penny, nickel, dime,

What are my variables? Independent variable – size of the coin (penny, nickel, dime, quarter) n Dependent variable – amount of water held on coin (# of drops) n Controlled variables n – – Same eye dropper Same water Same side of coin (pick heads or tails) Same technique (height/angle of dropper) Copyright © Mícheál Ó Foghlú 2009

Statistical Analysis n http: //www. slideshare. net/sababutt/statistical-analysis-of-datafinal-presentation Copyright © Mícheál Ó Foghlú 2009

Statistical Analysis n http: //www. slideshare. net/sababutt/statistical-analysis-of-datafinal-presentation Copyright © Mícheál Ó Foghlú 2009

SIGNIFICANCE OF STATISTICS FOR ANALYSIS AND RESEARCH Copyright © Mícheál Ó Foghlú 2009

SIGNIFICANCE OF STATISTICS FOR ANALYSIS AND RESEARCH Copyright © Mícheál Ó Foghlú 2009

STATISTICS IS NECESSARY FOR ALL FIELDS OF LIFE REQUIRING RESEARCH AND DATA ANALYSIS In

STATISTICS IS NECESSARY FOR ALL FIELDS OF LIFE REQUIRING RESEARCH AND DATA ANALYSIS In all fields of life we have to analyze facts and interpret from these to make conclusions. The analysis needs statistics – to compare the qualities and quantities to help reach some conclusion, which will lead to decision making in business, government, industry etc and development of theories in science. Copyright © Mícheál Ó Foghlú 2009

BIOSTATISTICS IS A DISCIPLINE THAT IS CONCERNED WITH: designing experiments and other data collection,

BIOSTATISTICS IS A DISCIPLINE THAT IS CONCERNED WITH: designing experiments and other data collection, n summarizing information to aid understanding, n drawing conclusions from data, and n estimating the present or predicting the future. n In making predictions, Statistics uses the companion subject of Probability, which models chance mathematically and enables calculations of chance in complicated cases. Copyright © Mícheál Ó Foghlú 2009

SOME IMPORTANT DEFINITIONS Copyright © Mícheál Ó Foghlú 2009

SOME IMPORTANT DEFINITIONS Copyright © Mícheál Ó Foghlú 2009

POPULATION AND SAMPLE POPULATION: A population consists of an entire set of objects, observations,

POPULATION AND SAMPLE POPULATION: A population consists of an entire set of objects, observations, or scores that have something in common. For example, a population might be defined as all males between the ages of 15 and 18. SAMPLE: A sample is a subset of a Population Since it is usually impractical to test every member of a population, a sample from the population is typically the best approach available. Copyright © Mícheál Ó Foghlú 2009

PARAMETER AND STATISTIC PARAMETER: A parameter is a numerical quantity measuring some aspect of

PARAMETER AND STATISTIC PARAMETER: A parameter is a numerical quantity measuring some aspect of a population of scores. For example, the mean is a measure of central tendency in a population. STATISTIC: A "statistic" is defined as a numerical quantity (such as the mean calculated in a sample). Copyright © Mícheál Ó Foghlú 2009

MEASURES OF CENTRAL TENDENCY v. Mean (Arithmetic Mean) Average value of a sample or

MEASURES OF CENTRAL TENDENCY v. Mean (Arithmetic Mean) Average value of a sample or population v. Median Middle value of sample or population v. Mode The value repeated most Copyright © Mícheál Ó Foghlú 2009

The Arithmetic Mean or Mean is what is commonly called the average: When the

The Arithmetic Mean or Mean is what is commonly called the average: When the word "mean" is used without a modifier, it can be assumed that it refers to the arithmetic mean. The mean is the sum of all the scores divided by the number of scores. Formula of calculating Population Mean is: μ = ΣX/N, where μ = population mean, and N = number of scores. If the scores are from a sample, then the symbol X refers to the mean and n refers to the sample size, formula written as: X = ΣX/n Copyright © Mícheál Ó Foghlú 2009

Median: The median is the middle of a distribution: half the scores are above

Median: The median is the middle of a distribution: half the scores are above the median and half are below the median. The median is less sensitive to extreme scores than the mean and this makes it a better measure than the mean for highly skewed distributions. 5 3 4 2. 5 6 Mode: The mode is the most frequently occurring score in a distribution and is used as a measure of central tendency. The advantage of the mode as a measure of central tendency is that its meaning is obvious. 5 3 4 5 6 Copyright © Mícheál Ó Foghlú 2009

MEASURES OF DISPERSION After measuring the central value i. e. , mean, next is

MEASURES OF DISPERSION After measuring the central value i. e. , mean, next is to know that to which extent this central value represents all values, that is, to know the scattering or dispersion of the data. There are certain measures which gives values of dispersion. The most important and widely used of these in research are: v Variance v Standard Deviation v Standard Error of Mean Copyright © Mícheál Ó Foghlú 2009

HYPOTHESIS TESTING T test F test ANOVA Correlation Regression Copyright © Mícheál Ó Foghlú

HYPOTHESIS TESTING T test F test ANOVA Correlation Regression Copyright © Mícheál Ó Foghlú 2009

EXAMPLE OF DATA ANALYSIS Comparison of Weight to Height Ratio expressed by Body Mass

EXAMPLE OF DATA ANALYSIS Comparison of Weight to Height Ratio expressed by Body Mass Index of a population. BMI is calculated as weight in Kg / Height in Meter 2. Ø General surveys in USA and Europe showed that young population is overweight which is enhancing chances of diseases. We surveyed young female population of Punjab University for BMI. We measured BMI of 400 students randomly. Ø Copyright © Mícheál Ó Foghlú 2009

Subject No. BMI Copyright © Mícheál Ó Foghlú 2009

Subject No. BMI Copyright © Mícheál Ó Foghlú 2009

ARITHMETIC MEAN We have two tables of data: one giving BMI of girls, other

ARITHMETIC MEAN We have two tables of data: one giving BMI of girls, other BMI of boys. These are long data tables. n Now, we have to analyze it to conclude something from this data. What we need, now? n We need a measure of central tendency to indicate average BMI to compare with other populations, between boys and girls and with the normal range. n The most common and useful measure for the purpose is the Arithmetic Mean is calculated by taking sum of all values and dividing it by No. of observations. Copyright © Mícheál Ó Foghlú 2009

SAMPLING ERROR Then next, we have an average value but is this average representative

SAMPLING ERROR Then next, we have an average value but is this average representative of all values really. Is it possible that some values be very large and some very small? If it is so, the Mean is not representative of whole data. This is called sampling error because some students may have strong genetic tendency to being overweight, these values are somewhat different from population. This will make our result erroneous, i. e. , our Mean does not represent all data. Copyright © Mícheál Ó Foghlú 2009

EXAMPLE We have four values - 2, 3, 4, 10 Mean = Sum of

EXAMPLE We have four values - 2, 3, 4, 10 Mean = Sum of values / No of Observations 2 + 3 + 4 + 10 / 4 = 4. 75 This is far from three values in the data. This is because of a large value that exists in the data i. e. 10. Copyright © Mícheál Ó Foghlú 2009

STANDARD DEVIATION n Now, we need some statistical measure that tell us how to

STANDARD DEVIATION n Now, we need some statistical measure that tell us how to rule out sampling error. n This is the standard deviation – measure to find how the individual values vary from the average value, i. e. , Mean. Copyright © Mícheál Ó Foghlú 2009

Standard Deviation of that Data SD = s = ∑ (x – x) 2

Standard Deviation of that Data SD = s = ∑ (x – x) 2 n-1 Descriptive Statistics from MINITAB Variable C 1 N 4 Mean Median 4. 75 3. 50 St. Dev 3. 59 SE Mean 1. 80 Copyright © Mícheál Ó Foghlú 2009

T Test Two Sample T-Test and Confidence Interval Two sample T for BMI-F vs

T Test Two Sample T-Test and Confidence Interval Two sample T for BMI-F vs BMI-M N Mean St. Dev Mean BMI-F 30 31. 35 6. 26 1. 1 BMI-M 21 26. 96 4. 11 0. 90 SE 95% CI for mu BMI-F - mu BMI-M: ( 1. 5, 7. 31) T-Test mu BMI-F = mu BMI-M (vs not =): T= 3. 02 P=0. 0040 DF= 48 Copyright © Mícheál Ó Foghlú 2009

Other Issues n Covered – Basics of experimental design – Basics of statistical analysis

Other Issues n Covered – Basics of experimental design – Basics of statistical analysis n Not covered - experimental design – Block structured design (e. g. Latin Squares) – Understanding experimental errors n Not covered - statistical analysis – Understanding the T Test and the large battery of other tests (e. g. ANOVA) – Assumptions of tests (e. g. that observations are normally distributed) and when it is invalid to use a test – Discussion of significance n So this talk just scratched the surface! Copyright © Mícheál Ó Foghlú 2009