SSFT: selective software fault tolerance

buir.advisorÖztürk, Özcan
dc.contributor.authorTurhan, Tuncer
dc.date.accessioned2016-07-01T11:10:23Z
dc.date.available2016-07-01T11:10:23Z
dc.date.issued2014
dc.descriptionCataloged from PDF version of article.en_US
dc.description.abstractAs technology advances, the processors are shrunk in size and manufactured using higher density transistors which makes them cheaper, more power efficient and more powerful. While this progress is most beneficial to end-users, these advances make processors more vulnerable to outside radiation causing soft errors which occur mostly in the form of single bit flips on data. For protection against soft errors, hardware techniques like ECC (Error Correcting Code) and Ram Parity Memory are proposed to provide error detection and even error correction capabilities. While hardware techniques provide effective solutions, software only techniques may offer cheaper and more flexible alternatives where additional hardware is not available or cannot be introduced to existing architectures. Software fault detection techniques -while powerful- rely mostly on redundancy which causes significant amount of performance overhead and increase in the number of bits susceptible to soft errors. In most cases, where reliability is a concern, the availability and performance of the system is even a bigger concern, which actually requires a multi objective optimization approach. In applications where a certain margin of error is acceptable and availability is important, the existing software fault tolerance techniques may not be applied directly because of the unacceptable performance overheads they introduce to the system. Our technique Selective Software Fault Tolerance (SSFT) aims at providing availability and reliability simultaneously, by providing only required amount of protection while preserving the quality of the program output. SSFT uses software profiling information to understand application’s vulnerabilities against transient faults. Transient faults are more likely to occur in instructions that have higher execution counts. Additionally, the instructions that cause greater damage in program output when hit by transient faults, should be considered as application weaknesses in terms of reliability. SSFT combines these information to eliminate the instructions from fault tolerance, that are less likely to be hit by transient errors or cause errors in program output. This approach reduces power consumption and redundancy (therefore less data bits susceptible to soft errors), while improving performance and providing acceptable reliability. This technique can easily be adapted to existing software fault tolerance techniques in order to achieve a more suitable form of protection that will satisfy different concerns of the application. Similarly, hybrid and hardware only approaches may also take advantage of the optimizations provided by our technique.en_US
dc.description.provenanceMade available in DSpace on 2016-07-01T11:10:23Z (GMT). No. of bitstreams: 1 0006636.pdf: 2077218 bytes, checksum: aaedd954ac204220e0fcd81f81e90a73 (MD5) Previous issue date: 2014en
dc.description.statementofresponsibilityTurhan, Tunceren_US
dc.format.extentxiii, 82 leaves, graphics, tablesen_US
dc.identifier.itemidB139622
dc.identifier.urihttp://hdl.handle.net/11693/30002
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectSoftware Fault Toleranceen_US
dc.subjectMulti objective optimization: Reliability and Availabilityen_US
dc.subjectReliabilityen_US
dc.subjectSoftware Profiling for Reliabilityen_US
dc.subjectSoftware Fault Injectionen_US
dc.subject.lccQA76.9.F38 T87 2014en_US
dc.subject.lcshFault-tolerant computing.en_US
dc.titleSSFT: selective software fault toleranceen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0006636.pdf
Size:
1.98 MB
Format:
Adobe Portable Document Format
Description:
Full printable version