Safety Integrity and Random Failures

  • Chapter
  • First Online:
Systems, Functions and Safety
  • 527 Accesses

Abstract

Safety integrity is the property of a system that provides safety-relevant operations, describing their resilience to dangerous failures. In functional safety, each safety function that is defined needs to comply with the safety integrity requirements, which are based on the safety integrity level allocated to the safety function following the hazard analysis and risk assessment. This chapter lays out typical safety integrity requirements, concerning two big groups: safety integrity against random failures and safety integrity against systematic failures. Safety integrity against random failures is then thoroughly introduced, including the definition of all relevant metrics for quantification, such as failure probability, reliability, failure rate, mean time to failure (MTTF), etc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now
Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Authors

Appendices

Exercise 7

One of the safety functions prescribed for your system needs a relay switch as an actuator to cut off the power and bring your system into the safe state. Your company evaluated the relay switch component for reliability, performing the accelerated life testing (ALT) on the sample of 10,000 switches over the course of 3 weeks. It is expected that the switch is activated once per hour during the system runtime, being that the function of the system for which the safety function is prescribed executes in the on-demand mode, once per hour. During ALT, the test is performed by activating the relay switch once every 10 seconds, and logging the total number of failed switches every 10,000 cycles, as shown in the ALT log.

Cycles tested

Switches failed

10,000

2156

20,000

1134

30,000

1108

40,000

543

50,000

22

60,000

11

70,000

33

80,000

14

90,000

20

100,000

31

110,000

41

120,000

39

130,000

352

140,000

1450

150,000

2113

<160,000

933

Based on the ALT results, define a burn-in testing interval using a bathtub curve principle, and estimate the constant failure rate of the relay switch for the release, as well as its MTTF, approximating it with an e-system. Evaluate whether the switch is suitable for the application requiring automotive safety integrity level (ASIL) A of 1000 FIT, and also for ASIL C of 100 FIT. Calculate the reliability and failure probability of the relay switch (after release, approximated as e-system) after the runtime of 100,000 h in the previously described use case.

Your Tasks for the Exercise

  • Perform all calculations using the Excel sheet provided as a supplement.

  • Calculate the failure rate upon each testing interval; make sure to convert the units properly according to the safety function on-demand frequency.

  • Plot the failure rate curve and notice the DFR/CFR and IFR zones as in the bathtub curve.

  • Estimate the constant failure rate by averaging failure rates from the CFR zone.

  • Calculate MTTF.

  • Compare the obtained failure rate with the values required for ASIL A and ASIL C and discuss.

  • Calculate the failure probability and reliability of the relay switch after the runtime of 100,000 h.

To-Do List

  • Perform the exercise individually or with your peers. One can share the screen and keep notes, all contribute.

  • Compare the results you obtained with your peers or your instructor.

  • Discuss your solution and share it with others.

Note: Digital files for this exercise are available at sfs7.ex.nit-institute.com

Exercise 7 Solution

The failure rate per each inspection interval of 10,000 cycles can be calculated by dividing the number of failed units from column 3 by the number of surviving units from column 2. To calculate the failure rate in proper units (per hour), considering that one cycle happens each hour, we need to divide the obtained failure rate from column 4 by 10,000 to get the results shown in column 5.

A table has 5 columns and 17 rows. The columns labels are number of cycles, surviving, failed, failure rate, and failure rate per hour.

If we now plot the obtained failure rates, we get a graph similar to the bathtub curve:

A line graph of failure rate versus time. The resultant bathtub curve starts with a decreasing failure rate, becomes constant with no failure between 5 to 12 hours, then rises steeply and ends at a failure rate of 0.0001 at 16 hours.

Finally, we can approximate the constant failure rate by averaging the middle part of the bathtub curve. Comparing that with prescribed failure rate targets for ASIL levels, we conclude the potential compliance with ASIL A. From the failure rate, by using the aforementioned formulae, we can easily get the MTTF, as well as reliability and failure probability after 100,000 h of operation using the approximated constant failure rate.

A table of 3 columns and 9 rows. It contains the data for constant failure rate, ASIL A and C, M T T F in years, blank row, reliability and failure probability at 100, 000 hours, blank row, and replacement after approximately 10 years.

This solution is also available as a digital spreadsheet at sfs7.ex.nit-institute.com.

Key Recap Questions

Considering the project you finalized within Chapter 6, now think about the safety functions defined and:

  • Recall their SIL requirements.

  • On top of which unit they are executed?

  • What kind of safety integrity requirements need to be addressed?

  • Try to quantify random failures of the hardware.

  • How the probability of failure is expressed?

  • What is the MTTF of the safety-related system?

  • How the obtained value is plugged back into the SIL requirement?

Self-assessment

Now take the time to self-assess your knowledge by taking the quiz below. Each listed statement is either correct or incorrect. Please mark your answer and then check in the key at the end of the book.

  1. 1.

    Safety integrity level is always defined for a complete system.

  2. 2.

    To verify the compliance of a safety function with the prescribed safety integrity level, it is enough to assess random dangerous failures of that function and compare them with the prescribed values.

  3. 3.

    The level of resistance against systematic failures can be quantified by using reliability theory (e.g., failure rate, MTTF).

  4. 4.

    Failure probability of a system is a probability of a system failing at exactly the time t.

  5. 5.

    If the reliability of the system after 1 year in operation is 0.9, we can expect that out of ten deployed systems, nine are still working.

  6. 6.

    The failure rate for a system is always a constant value.

  7. 7.

    System A is always more reliable than System B in the case the failure rate of System A at MTTF, ha(MTTFa), is lower than the failure rate of System B at MTTF, hb(MTTFb).

  8. 8.

    E-systems are always defined by a constant failure rate, which can be used to assess the safety integrity of the system against the level prescribed by the appropriate safety standard.

  9. 9.

    Reliability at MTTF is always the same for any e-system.

  10. 10.

    The manufacturer provides a resistor documented by its constant failure rate λ = 5 FIT. The failure probability of this resistor after the system runtime does not depend on the use case or system configuration and is always the same.

Self-assessment Key

  1. 1.

    False

  2. 2.

    False

  3. 3.

    False

  4. 4.

    False

  5. 5.

    True

  6. 6.

    False

  7. 7.

    False

  8. 8.

    True

  9. 9.

    True

  10. 10.

    False

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Bjelica, M.Z. (2023). Safety Integrity and Random Failures. In: Systems, Functions and Safety. Springer, Cham. https://doi.org/10.1007/978-3-031-15823-0_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-15823-0_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-15822-3

  • Online ISBN: 978-3-031-15823-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Navigation