Industrial Automation Industrielle Automation 9 6 Safety analysis

  • Slides: 29
Download presentation
Industrial Automation Industrielle Automation 9. 6 Safety analysis and standards Analyse de sécurité et

Industrial Automation Industrielle Automation 9. 6 Safety analysis and standards Analyse de sécurité et normes Sicherheitsanalyse und Normen Dr. B. Eschermann ABB Research Center, Baden, Switzerland 2004 June BE

Overview Dependability Analysis 9. 6. 1 Qualitative Evaluation – Failure Mode and Effects Analysis

Overview Dependability Analysis 9. 6. 1 Qualitative Evaluation – Failure Mode and Effects Analysis (FMEA) – Fault Tree Analysis (FTA) – Example: Differential pressure transmitter 9. 6. 2 Quantitative Evaluation – Combinational Evaluation – Markov Chains – Example: Bus-bar Protection 9. 6. 3 Dependability Standards and Certification – Standardization Agencies – Standards EPFL - Industrial Automation 2004 June BE 2 9. 6 Dependability Analysis

Failure Mode and Effects Analysis (FMEA) Analysis method to identify component failures which have

Failure Mode and Effects Analysis (FMEA) Analysis method to identify component failures which have significant consequences affecting the system operation in the application considered. ® identify faults (component failures) that lead to system failures. effect on system ? component 1 failure mode 1 • • • component n failure mode 1 failure mode k • • • failure mode k FMEA is inductive (bottom-up). EPFL - Industrial Automation 2004 June BE 3 9. 6 Dependability Analysis

FMEA: Coffee machine example component failure mode water tank empty too full no coffee

FMEA: Coffee machine example component failure mode water tank empty too full no coffee produced electronics damaged coffee bean container too full effect on system empty no coffee produced coffee mill gets stuck coffee grounds container too full coffee grounds spilled ……… EPFL - Industrial Automation 2004 June BE 4 9. 6 Dependability Analysis

FMEA: Purpose (overall) There are different reasons why an FMEA can be performed: –

FMEA: Purpose (overall) There are different reasons why an FMEA can be performed: – Evaluation of effects and sequences of events caused by each identified item failure mode (® get to know the system better) – Determination of the significance or criticality of each failure mode as to the system’s correct function or performance and the impact on the availability and/or safety of the related process (® identify weak spots) – Classification of identified failure modes according to their detectability, diagnosability, testability, item replaceability and operating provisions (tests, repair, maintenance, logistics etc. ) (® take the necessary precautions) – Estimation of measures of the significance and probability of failure (® demonstrate level of availability/safety to user or certification agency) EPFL - Industrial Automation 2004 June BE 5 9. 6 Dependability Analysis

FMEA: Critical decisions Depending on the exact purpose of the analysis, several decisions have

FMEA: Critical decisions Depending on the exact purpose of the analysis, several decisions have to be made: – For what purpose is it performed (find weak spots « demonstrate safety to certification agency, demonstrate safety « compute availability) – When is the analysis performed (e. g. before « after detailed design)? – What is the system (highest level considered), where are the boundaries to the external world (that is assumed fault-free)? – Which components are analyzed (lowest level considered)? – Which failure modes are considered (electrical, mechanical, hydraulic, design faults, human/operation errors)? – Are secondary and higher-order effects considered (i. e. one fault causing a second fault which then causes a system failure etc. )? – By whom is the analysis performed (designer, who knows system best « third party, which is unbiased and brings in an independent view)? EPFL - Industrial Automation 2004 June BE 6 9. 6 Dependability Analysis

FMEA and FMECA FMEA only provides qualitative analysis (cause effect chain). FMECA (failure mode,

FMEA and FMECA FMEA only provides qualitative analysis (cause effect chain). FMECA (failure mode, effects and criticality analysis) also provides (limited) quantitative information. – each basic failure mode is assigned a failure probability and a failure criticality – if based on the result of the FMECA the system is to be improved (to make it more dependable) the failure modes with the highest probability leading to failures with the highest criticality are considered first. Coffee machine example: – If the coffee machine is damaged, this is more critical than if the coffee machine is OK and no coffee can be produced temporarily – If the water has to be refilled every 20 cups and the coffee has to be refilled every 2 cups, the failure mode “coffee bean container too full” is more probable than “water tank too full”. EPFL - Industrial Automation 2004 June BE 7 9. 6 Dependability Analysis

Criticality Grid Criticality levels I II IV very low EPFL - Industrial Automation medium

Criticality Grid Criticality levels I II IV very low EPFL - Industrial Automation medium low 2004 June BE 8 high Probability of failure 9. 6 Dependability Analysis

Failure Criticalities IV: Any event which could potentially cause the loss of primary system

Failure Criticalities IV: Any event which could potentially cause the loss of primary system function(s) resulting in significant damage to the system or its environment and causes the loss of life III: Any event which could potentially cause the loss of primary system function(s) resulting in significant damage to the system or its environment and negligible hazards to life II: Any event which degrades system performance function(s) without appreciable damage to either system, environment or lives I: Any event which could cause degradation of system performance function(s) resulting in negligible damage to either system or environment and no damage to life EPFL - Industrial Automation 2004 June BE 9 9. 6 Dependability Analysis

FMEA/FMECA: Result Depending on the result of the FMEA/FMECA, it may be necessary to:

FMEA/FMECA: Result Depending on the result of the FMEA/FMECA, it may be necessary to: – change design, introduce redundancy, reconfiguration, recovery etc. – introduce tests, diagnoses, preventive maintenance – focus quality assurance, inspections etc. on key areas – select alternative materials, components – change operating conditions (e. g. duty cycles to anticipate/avoid wear-out failures) – adapt operating procedures (allowed temperature range etc. ) – perform design reviews – monitor problem areas during testing, check-out and use – exclude liability for identified problem areas EPFL - Industrial Automation 2004 June BE 10 9. 6 Dependability Analysis

FMEA: Steps (1) 1) Break down the system into components. 2) Identify the functional

FMEA: Steps (1) 1) Break down the system into components. 2) Identify the functional structure of the system and how the components contribute to functions. f 1 EPFL - Industrial Automation f 2 2004 June BE f 3 f 5 f 4 11 f 6 f 7 9. 6 Dependability Analysis

FMEA: Steps (2) 3) Define failure modes of each component – new components: refer

FMEA: Steps (2) 3) Define failure modes of each component – new components: refer to similar already used components – commonly used components: base on experience and measurements – complex components: break down in subcomponents and derive failure mode of component by FMEA on known subcomponents – other: use common sense, deduce possible failures from functions and physical parameters typical of the component operation 4) Perform analysis for each failure mode of each component and record results in table: component function name/ID EPFL - Industrial Automation failure mode 2004 June BE failure cause failure effect local global 12 failure other remark detection provision 9. 6 Dependability Analysis

Example (Generic) Failure Modes - fails to remain (in position) - false actuation -

Example (Generic) Failure Modes - fails to remain (in position) - false actuation - fails to open - fails to stop - fails to close - fails to start - fails if open - fails to switch - fails if closed - erroneous input (increased) - restricted flow - erroneous input (decreased) - fails out of tolerance (high) - erroneous output (increased) - fails out of tolerance (low) - erroneous output (decreased) - inadvertent operation - loss of input - intermittent operation - loss of output - premature operation - erroneous indication - delayed operation - leakage EPFL - Industrial Automation 2004 June BE 13 9. 6 Dependability Analysis

Other FMEA Table Entries Failure cause: Why is it that the component fails in

Other FMEA Table Entries Failure cause: Why is it that the component fails in this specific way? To identify failure causes is important to - estimate probability of occurrence - uncover secondary effects - devise corrective actions Local failure effect: Effect on the system element under consideration (e. g. on the output of the analyzed component). In certain instances there may not be a local effect beyond the failure mode itself. Global failure effect: Effect on the highest considered system level. The end effect might be the result of multiple failures occurring as a consequence of each other. Failure detection: Methods to detect the component failure that should be used. Other provisions: Design features might be introduced that prevent or reduce the effect of the failure mode (e. g. redundancy, alarm devices, operating restrictions). EPFL - Industrial Automation 2004 June BE 14 9. 6 Dependability Analysis

Common Mode Failures (CMF) In FMEA all failures are analyzed independent of each other.

Common Mode Failures (CMF) In FMEA all failures are analyzed independent of each other. Common mode failures are related failures that can occur due to a single source such as design error, wrong operation conditions, human error etc. no problem failure mode x & common source failure mode y serious consequence no problem Example: Failure of power supply common to redundant units causes both redundant units to fail at the same time. EPFL - Industrial Automation 2004 June BE 15 9. 6 Dependability Analysis

Example: Differential Pressure Transmitter (1) Functionality: Measure difference in pressures p 1 – p

Example: Differential Pressure Transmitter (1) Functionality: Measure difference in pressures p 1 – p 2. diaphragm pressure p 2 pressure p 1 coil with inductivity L 2 iron core coil with inductivity L 1 i 2(t) i 1(t) u 2(t) u 1(t) p 1 – p 2 = f 1 (inductivity L 1, temperature T, static pressure p) p 1 – p 2 = f 2 (inductivity L 2, temperature T, static pressure p) EPFL - Industrial Automation 2004 June BE 16 9. 6 Dependability Analysis

Example: Differential Pressure Transmitter (2) acquisition of sensor inputs p 1 ® L 1

Example: Differential Pressure Transmitter (2) acquisition of sensor inputs p 1 ® L 1 sensor data preparation different failure effects p 2 ® L 2 sensor data processing output data generation proces- sing 1 watch- dog proces- sing 2 = pstatic checking (limits, consis- tency) Tempsens Tempelec safe output (e. g. upscale) = A/D conversion power supply EPFL - Industrial Automation controlled current generator 2004 June BE output current generator 4. . 20 m. A 17 9. 6 Dependability Analysis

FMEA for Pressure Transmitter ID- Nr Fun c t i o n Fai l

FMEA for Pressure Transmitter ID- Nr Fun c t i o n Fai l ure Mo de Lo c al Ef f e c t De t e c t i o n Me c h an i s m Fai l ure Han dl i n g Gl o b al Ef f e c t Co mme n t s 1. 1. 1 out of failsafe accuracy range pressure input via L 1 wrong limit check and consistency check (comparison with p 2) in software of sensor data processing go to safe state output driven to up/downscale diaphragm failure (both p 1 and p 2 wrong) detected by comparison with pstatic, requires that separate sensor is used for pstatic wrong but pressure input via within fail- L 1 slightly wrong safe accuracy range consistency check (comp. with p 2), detection of small failures not guaranteed (allowed difference p 1 p 2) not applicable (n/a) output value slightly wrong, but within failsafe accuracy range out of failsafe accuracy range limit check and consistency check (comparison with p 1) in software of sensor data processing go to safe state output driven to up/downscale consistency check (comp. with p 1), detection of small failures not guaranteed (allowed difference p 1 p 2) n/a output value slightly wrong, but within failsafe accuracy range p 1 measurement 1. 1. 2. 1 1. 2. 2 p 2 measurement pressure input via L 2 wrong but pressure input via within fail- L 2 slightly wrong safe accuracy range continue on your own. . . EPFL - Industrial Automation 2004 June BE 18 9. 6 Dependability Analysis

Fault Tree Analysis (FTA) In contrast to FMEA (which is inductive, bottom-up), FTA is

Fault Tree Analysis (FTA) In contrast to FMEA (which is inductive, bottom-up), FTA is deductive (top-down). FTA FMEA system state to avoid failures of system possible causes of the state failure modes of components The main problem with both FMEA and FTA is to not forget anything important. Doing both FMEA and FTA may help to become more complete (2 different views). EPFL - Industrial Automation 2004 June BE 19 9. 6 Dependability Analysis

Example Fault Tree Analysis coffee machine doesn’t work ³ 1 water tank empty &

Example Fault Tree Analysis coffee machine doesn’t work ³ 1 water tank empty & EPFL - Industrial Automation no coffee beans undeveloped event: analyzed elsewhere 2004 June BE 20 power switch off basic event: not further developed 9. 6 Dependability Analysis

Example: Protection System tripping algorithm 1 inputs & trip signal underfunctions increased Putot =

Example: Protection System tripping algorithm 1 inputs & trip signal underfunctions increased Putot = 2 Pu - Pu 2 tripping algorithm 1 inputs comparison trip signal & 2004 June BE dynamic modeling necessary repair tripping algorithm 2 EPFL - Industrial Automation overfunctions reduced Potot = Po 2 21 9. 6 Dependability Analysis

FTA: IEC Standard defines basic principles of FTA provides required steps for analysis identifies

FTA: IEC Standard defines basic principles of FTA provides required steps for analysis identifies appropriate assumptions, events and failure modes provides identification rules and symbols EPFL - Industrial Automation 2004 June BE 22 9. 6 Dependability Analysis

Markov Model l 2(1 -c) (l 1+l 2)(1 -c) latent underfunction 2 chains, n.

Markov Model l 2(1 -c) (l 1+l 2)(1 -c) latent underfunction 2 chains, n. detectable latent overfunction 1 chain, n. detectable (l 1+l 2)c+l 3 s 1+l 1(1 -c) (l 1+l 2+l 3)c OK m detectable error 1 chain, repair l 1(1 -c) s 2 l 1+l 2+l 3 c l 3(1 -c) s 2 latent underfunction not detectable l 1=0. 01, l 2=l 3=0. 025, s 1=5, s 2=1, m=365, EPFL - Industrial Automation 2004 June BE overfunction 23 underfunction c=0. 9 [1/ Y] 9. 6 Dependability Analysis

Analysis Results mean time to underfunction [Y] 400 permanent comparison (SW) assumption: SW error-free

Analysis Results mean time to underfunction [Y] 400 permanent comparison (SW) assumption: SW error-free weekly test 300 permanent comparison (red. HW) 200 2 -yearly test 50 EPFL - Industrial Automation 5000 500 2004 June BE 24 mean time to overfunction [Y] 9. 6 Dependability Analysis

Example: IEC 61508 Generic standard for safety-related systems. Specifies 4 safety integrity levels, or

Example: IEC 61508 Generic standard for safety-related systems. Specifies 4 safety integrity levels, or SILs (with specified max. failure rates): safety integrity level control systems [per hour] protection systems [per operation] 4 ³ 10 -9 to < 10 -8 ³ 10 -5 to < 10 -4 3 ³ 10 -8 to < 10 -7 ³ 10 -4 to < 10 -3 2 ³ 10 -7 to < 10 -6 ³ 10 -3 to < 10 -2 1 ³ 10 -6 to < 10 -5 ³ 10 -2 to < 10 -1 For each of the safety integrity levels it specifies requirements (see copy out of standard). EPFL - Industrial Automation 2004 June BE 25 9. 6 Dependability Analysis

Cradle-to-grave reliability (IEC 61508) 1 concept 2 overall scope definition 3 hazard and risk

Cradle-to-grave reliability (IEC 61508) 1 concept 2 overall scope definition 3 hazard and risk analysis 4 overall safety requirements 5 safety requirements allocation overall planning 6 overall operation and maintenance planning 7 overall safety validation planning 8 EPFL - Industrial Automation overall installation and commissioning planning 2004 June BE 9 safety-related systems: E/E/PES realisation 12 overall installation and commissioning 13 overall safety validation 14 overall operation, maintenance and repair 16 decommissioning and disposal 26 10 safety-related systems: other technology realisation 15 11 external risk reduction facilities realisation overall modifications and retrofit 9. 6 Dependability Analysis

IEC 61580 EPFL - Industrial Automation 2004 June BE 27 9. 6 Dependability Analysis

IEC 61580 EPFL - Industrial Automation 2004 June BE 27 9. 6 Dependability Analysis

Software safety integrity and the development lifecycle (V-model) EPFL - Industrial Automation 2004 June

Software safety integrity and the development lifecycle (V-model) EPFL - Industrial Automation 2004 June BE 28 9. 6 Dependability Analysis

EPFL - Industrial Automation 2004 June BE 29 9. 6 Dependability Analysis

EPFL - Industrial Automation 2004 June BE 29 9. 6 Dependability Analysis