JSEA  Vol.3 No.5 , May 2010
Experiences Analyzing Faults in a Hybrid Distributed System with Access Only to Sanitized Data
Abstract: In this paper we report on a work in progress assessing the faults observed and reported in a distributed, safety-critical, largely embedded system with both electrical and mechanical components. We illustrate why standard software testing techniques are not sufficient and indicate some of the technical and non-technical problems encountered in examining the faults and the initial results obtained. While the application domain is elevator operation, the techniques described here are general enough to apply to many other domains. Much of the data analyzed here would be considered imprecise in the software industry if it were used in software testing or to help increase fault tolerance. The paper includes a discussion of the use of multiple views of data, assessment of missing data, and analysis of informal information to produce its conclusions about fault avoidance and fault tolerance.
Cite this paper: nullR. Leach, "Experiences Analyzing Faults in a Hybrid Distributed System with Access Only to Sanitized Data," Journal of Software Engineering and Applications, Vol. 3 No. 5, 2010, pp. 446-454. doi: 10.4236/jsea.2010.35050.

[1]   Unnamed elevator company, Unnamed Service Database, 2008.

[2]   A. Avizienis and J. P. Kelly, “Fault Tolerance by Design Diversity: Concepts and Experiments,” IEEE Computer, Vol. 17, No. 8, August 1984, pp. 67-80.

[3]   B. Randell, “System Structure for Software Fault Tole- rance,” IEEE Transactions on Software Engineering, Vol. 11, No. 2, June 1975, pp. 220-232.

[4]   R. Amuthakkannan, S. M. Kannan, K. Vijayalakshmi and N. Ramaraj, “Reliability Analysis of Programmable Mechatronics System Using Bayesian Approach,” Intern- ational Journal of Industrial and Systems Engineering, Vol. 4, No. 3, 2009, pp. 303-325.

[5]   V. Dhudsia, “Guidelines for Equipment Reliability,” Technical Publication, Sematech, Inc, 1997. http://www.

[6]   G. K. Fourlas, “An Approach towards Fault Tolerant Hybrid Control Systems,” Control & Automation Mediter- ranean Conference on MED, Corsica, 27-29 June 2007, pp. 1-6.

[7]   J. D. Musa, A. Iannino and K. Okumoto, “Software Relia- bility: Measurement, Prediction, Application,” Mc-Graw- Hill, Inc. New York, 1987.

[8]   R. Isermann, “Mechatronic Systems Fundamentals,” Springer, London. 2003.

[9]   K. Lee, K. C. Kang, E. Koh, W. Chae, B. Kim and B. W. Choi, “Domain-Oriented Engineering of Elevator Control Software: A Product Line Practice,” Proceedings of the First Software Product Line Conference, Denver, August 2000, pp. 3-22.

[10]   “Facility System Safety Guidebook,” NASA-STD-8719.7, National Aeronautics and Space Administration, 1996.

[11]   “The use of Computers in Safety Critical Operations,” Final Report of the Study Group on the Safety of Ope- rational Computer Operations, Health and Safety Commission, UK. ters.pdf

[12]   N. Leveson, “Software Safety: Why, What, and How,” ACM Computing Surveys, Vol. 18, No. 2, June 1986, pp. 125-163.

[13]   D. E. Knuth, “Fundamental Algorithms,” The Art of Com- puter Programming, 3rd Edition, Addison-Wesley, Read- ing, Massachusetts, Vol. 1, 1973.