THE PROCEDURE FOR ENSURING FAULT TOLERANCE OF A COMPUTER SYSTEM BASED ON THE USE OF MODULAR ARITHMETIC

Authors

  • A. Yanko
  • V. Krasnobayev
  • R. Liubchenko
  • P. Sabelnikova

DOI:

https://doi.org/10.26906/SUNZ.2023.4.125

Keywords:

active fault tolerance, types of redundancy, computer system, modular arithmetic, computing tract

Abstract

The article proposes for consideration the process of functioning of a computer system (CS) in modular arithmetic (MA) based on the use of the principle of fault tolerance. In MA, the fault tolerance of the CS is ensured by taking into account the main properties of this modular numbering system, namely: independence, equality and low-bitness, the totality of which forms a modular code structure. In order to facilitate and visualize the study of the functioning of the computer system in the MA, the article introduces an n-bit binary code that reflects the set of signs of the operational state of the computing tracts of the CS. With the help of this code, an analysis of various modes of fault tolerance of CS in MA was carried out. The fault-tolerance property provides the possibility of performing specified computing functions after the occurrence of failures, both due to a decrease in the permissible limits of any indicators of the quality of functioning, and without deterioration of the indicators of the quality of the functioning of the CS. The article considers the mode of fault tolerance of CS in MA, which is implemented by the method of replacement. The essence of which is as follows, if the reserve is exhausted (reserve computing path), then the failure-tolerance mode of the CS in the MA is implemented by the method of gradual degradation. CS in MA can be considered as a multi-microprocessor computing system with dynamic redundancy and subsequent gradual degradation of functionality in the event of failures. The structure of the functioning of the fault-tolerant CS in MA has been developed. Examples of CS fault tolerance for a specific set of MA modules are given. The comparative analysis of the research results carried out in the article showed the following: with fault-tolerant functioning in the modes of replacement or replacement with subsequent gradual degradation, the CS in MA is more effective than the similar CS in the binary positional numbering system. Using MA as a counting system allows you to create a CS structure similar to the structure of a reserved computing system in a binary positional numbering system. This circumstance, along with other positive aspects of the use of modular coding, caused by the influence of the main features of MA on the structure and principles of operation of the CS, allows to consider MA as a powerful tool for increasing fault tolerance.

Downloads

Download data is not yet available.

References

Avresky D. R., Geoghegan S. J., Varoglu Y. Evaluation of Software-Implemented Fault-Tolerance (SIFT) Approach in Gracefully Degradable Multi-Computer Systems. IEEE Transactions on Reliability, Volume 55, Issue: 3, September 2006, pp. 451-457. doi: 10.1109/TR.2006.879663.

Wang T., Liu H., Sun M., Liu Z., Zhou M. Fault tolerance on improved distributed spanning tree structure. 2nd International Conference on Advanced Computer Control, Shenyang, 2010, pp. 296-300. doi: 10.1109/ICACC.2010.5487249.

Krasnobayev V., Kuznetsov A., Yanko A., Koshman S., Zamula A. and Kuznetsova T. Data processing in the system of residual classes. Monograph. ASC Academic Publishing, 2019, 208 p. ISBN: 978-0-9989826-6-3, ISBN: 978-0-9989826-7-0 (Ebook).

Raphaël M., Raison B. A probabilistic fault localization method for distribution networks using fault indicators. 2013 IEEE Grenoble Conference, Grenoble, 2013, pp. 1-6. doi: 10.1109/PTC.2013.6652237.

Krasnobaev V., Kuznetsov A., Kiian A., Kuznetsova K. Fault Tolerance Computer System Structures Functioning in Residue Classes. Proceedings of 2021 11th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS), Cracow, 2021, pp. 471-474. doi: 10.1109/IDAACS53288.2021.9660919.

Geng Z., Han M., Bao W., Xie W. New Submodule Topology Based Fault Tolerance for Modular Multilevel Converters. IEEE Transactions on Power Delivery, 2023. pp 1-12. doi: 10.1109/TPWRD.2023.3237891.

Published

2023-12-12

Most read articles by the same author(s)

1 2 > >>