- Subscribe to this page:
- RSS
- Email Alert
fault-tolerant servers
(400 results)-
White Papers
Fault Tolerance for HPC With OpenVZ Virtualization by Lite Migration Toolkit
Dec 2010
The reliability of large-scale parallel jobs within a cluster or even across multi-clusters under the Grid or distributed computing environment is a long term issue due to its difficulties...
Provided by National Center for High-Performance Computing (NCHC)
-
White Papers
Using Virtualization to Validate Fault-Tolerant Distributed Systems
Nov 2010
Asynchronous events and complex system state distributed across independent nodes make exposure and diagnosis of flaws in distributed systems a challenge. The difficulties are exacerbated when the...
Provided by University of California, Los Angeles (Anderson)
-
White Papers
Adaptive Virtual Network Provisioning
Sep 2010
In the future, virtual networks will be allocated, maintained and managed much like clouds offering flexibility, extensibility and elasticity with resources acquired for a limited time and even on...
Provided by Association for Computing Machinery
-
White Papers
Improving Scalability and Fault Tolerance in an Application Management Infrastructure
Jun 2008
This paper explores the challenges associated with distributed application management in large-scale computing environments. In particular, the authors investigate several techniques for extending...
Provided by University of California, San Diego
-
White Papers
A Light-Weight Cache-Based Fault Detection and Checkpointing Scheme for MPSoCs Enabling Relaxed Execution Synchronization
Oct 2008
While technology advances have made MPSoCs a standard architecture for embedded systems, their applicability is increasingly being challenged by dramatic increases in the amount of device failures...
Provided by Association for Computing Machinery
-
White Papers
Reducing Overhead for Soft Error Coverage in High Availability Systems
Dec 2007
High reliability/availability systems typically use redundant computation and components to achieve detection, isolation and recovery from faults. Chip multiprocessors (CMPs) incorporate multiple...
Provided by University of Wisconsin
-
White Papers
Robust and Flexible Power-Proportional Storage
Jun 2010
Power-proportional cluster-based storage is an important component of an overall cloud computing infrastructure. With it, substantial subsets of nodes in the storage cluster can be turned off to...
Provided by Association for Computing Machinery
-
White Papers
Achieving Power-Efficiency in Clusters Without Distributed File System Complexity
May 2010
Power-efficient operation is a desirable property, particularly for large clusters housed in datacenters. Recent work has advocated turning off entire nodes to achieve power-proportionality, but...
Provided by Georgia Tech
-
White Papers
Multiple-Objective Metric for Placing Multiple Base Stations in Wireless Sensor Networks
Aug 2007
The placement of base stations in wireless sensor networks affects the coverage of sensor nodes, the tolerance against faults or attacks, the energy consumption and the congestion from...
Provided by Korea University
-
White Papers
Online Failure Forecast for Fault-Tolerant Data Stream Processing
Dec 2007
In this paper, the authors present a new online failure forecast system to achieve predictive failure management for fault-tolerant data stream processing. Different from previous reactive or...
Provided by University of Illinois
-
White Papers
Tolerating File-System Mistakes With EnvyFS
May 2009
The authors introduce EnvyFS, an N-version local file system designed to improve reliability in the face of file-system bugs. EnvyFS, implemented as a thin VFS-like layer near the top of the...
Provided by University of Wisconsin
-
White Papers
Membrane: Operating System Support for Restartable File Systems
Sep 2010
The authors introduce Membrane, a set of changes to the operating system to support restartable file systems. Membrane allows an operating system to tolerate a broad class of file system failures,...
Provided by Association for Computing Machinery
-
White Papers
A Comparison of Overlay Routing and Multihoming Route Control
Jan 2011
The limitations of BGP routing in the Internet are often blamed for poor end-to-end performance and prolonged connectivity interruptions. Recent work advocates using overlays to effectively bypass...
Provided by Carnegie Mellon University
-
White Papers
Proactive Service Migration for Long-Running Byzantine Fault Tolerant Systems
Mar 2009
This paper describes a proactive recovery scheme based on service migration for long-running Byzantine fault tolerant systems. Proactive recovery is an essential method for ensuring long term...
Provided by Cleveland State University
-
White Papers
Design and Implementation of a Byzantine Fault Tolerance Framework for Web Services
Dec 2008
Many Web services are expected to run with high degree of security and dependability. To achieve this goal, it is essential to use a Web-services compatible framework that tolerates not only crash...
Provided by Cleveland State University
-
White Papers
A Game Theoretical View of Byzantine Fault Tolerance Design
Oct 2007
This paper investigates the optimal Byzantine Fault Tolerance (BFT) design strategies from a game theoretical point of view. The problem of BFT is formulated as a constant-sum game played by the...
Provided by RAMS Consultants
-
White Papers
Fault Tolerance Middleware for Cloud Computing
Mar 2010
The Low Latency Fault Tolerance (LLFT) middleware provides fault tolerance for distributed applications deployed within a cloud computing or data center environment, using the leader/follower...
Provided by Institute of Electrical and Electronics Engineers
-
White Papers
Integrity-Preserving Replica Coordination for Byzantine Fault Tolerant Systems
Jun 2008
The use of good random numbers is essential to the integrity of many mission-critical systems. However, when such systems are replicated for Byzantine fault tolerance, a serious issue arises,...
Provided by Cleveland State University
-
White Papers
Byzantine Fault Tolerant Coordination for Web Services Business Activities
Apr 2008
This paper presents a comprehensive study on the threats towards the coordination services for Web services business activities and explores the most optimal solution to mitigate such threats. A...
Provided by Cleveland State University
-
White Papers
Byzantine Fault Tolerance for Electric Power Grid Monitoring and Control
Apr 2008
The stability of the electric power grid is crucial to every nation's security and well-being. As revealed by a number of large-scale blackout incidents in North America, the data communication...
Provided by Cleveland State University
-
White Papers
A Lightweight Fault Tolerance Framework for Web Services
Aug 2007
This paper presents the design and implementation of a lightweight fault tolerance framework for Web services. With the framework, a Web service can be rendered fault tolerant by replicating it...
Provided by Cleveland State University
-
White Papers
Byzantine Fault Tolerant Coordination for Web Services Atomic Transactions
Jul 2007
This paper presents the mechanisms needed for Byzan-tine fault tolerant coordination of Web services atomic transactions. The mechanisms have been incorporated into an open-source framework...
Provided by Cleveland State University
-
White Papers
A Byzantine Fault Tolerant Distributed Commit Protocol
Jul 2007
This paper presents a Byzantine fault tolerant distributed commit protocol for transactions running over untrusted networks. The traditional two-phase commit protocol is enhanced by replicating...
Provided by Cleveland State University
-
White Papers
Byzantine Fault Tolerance for Non Deterministic Applications
Jun 2007
All practical applications contain some degree of non-determinism. When such applications are replicated to achieve Byzantine Fault Tolerance (BFT), their nondeterministic operations must be...
Provided by Cleveland State University
-
White Papers
Independent Faults in the Cloud
Jul 2010
Byzantine Fault Tolerant (BFT) protocols are replication-based solutions to the problem of tolerating the arbitrary failures of software and hardware components. The essential assumption for...
Provided by EPFL
-
White Papers
Proactive Process-Level Live Migration in HPC Environments
Jul 2008
As the number of nodes in high-performance computing environments keeps increasing, faults are becoming common place. Reactive Fault Tolerance (FT) often does not scale due to massive I/O...
Provided by North Carolina State University
-
White Papers
Hybrid Full/Incremental Checkpoint/Restart for MPI Jobs in HPC Environments
Jun 2009
As the number of cores in high-performance computing environments keeps increasing, faults are becoming common place. Checkpointing addresses such faults but captures full process images even...
Provided by North Carolina State University
-
White Papers
A Fault Tolerance Approach for Enterprise Applications
Apr 2008
Service Oriented Architectures (SOAs) have emerged as a preferred solution to tackle the complexity of large-scale, complex, distributed, and heterogeneous systems. Key to successful operation of...
Provided by University of California, San Diego
-
White Papers
Three Approximation Algorithms for Energy-Efficient Query Dissemination in Sensor Database System
Aug 2009
Sensor database is a type of database management system which offers sensor data and stored data in its data model and query languages. In this system, when a user poses a query to this sensor...
Provided by Springer Science+Business Media
-
White Papers
CIFTS: A Coordinated Infrastructure for Fault-Tolerant Systems
Jun 2009
Considerable work has been done on providing fault tolerance capabilities for different software components on large scale high-end computing systems. Thus far, however, these fault tolerant...
Provided by Indiana University
-
White Papers
An Efficient Hardware-Software Approach to Network Fault Tolerance With InfiniBand
Jul 2009
In the last decade or so, clusters have observed a tremendous rise in popularity due to excellent price to performance ratio. A variety of Interconnects have been proposed during this period, with...
Provided by Ohio State University
-
White Papers
A Software Based Approach for Providing Network Fault Tolerance in Clusters With UDAPL Interface: MPI Level Design and Performance Evaluation
Jan 2011
In the arena of cluster computing, MPI has emerged as the de facto standard for writing parallel applications. At the same time, introduction of high speed RDMA-enabled interconnects like...
Provided by Ohio State University
-
Whitepapers
A Probabilistic Bundle Relay Strategy in Two-Hop Vehicular Delay Tolerant Networks
Apr 2011
A persisting major challenge in Vehicular Delay Tolerant Networks (VDTNs) is the delay minimization of data delivery when communicating nodes are stationary, arbitrarily deployed along roadsides...
Provided by Institute of Electrical & Electronic Engineers
-
White Papers
Fault-Tolerant Architecture With Dynamic Wavelength and Bandwidth Allocation Scheme in WDM-EPON
Mar 2008
This study proposes a novel fault-tolerant architecture in WDM-EPON, Cost-based Fault-tolerant WDM-EPON (CFT-WDM-EPON), to provide overall protection. The CFT-WDM-EPON only equips a backup feeder...
Provided by Yuan Ze University
-
White Papers
A Fault Tolerant, Peer-to-Peer Replication Network
Jan 2011
A peer-to-peer, commonly abbreviated to P2P, is any distributed network architecture composed of participants that make a portion of their resources (such as processing power, disk storage or...
Provided by Institute of Electrical and Electronics Engineers
-
White Papers
On the Possibility of Consensus in Asynchronous Systems
Jan 2011
The authors demonstrate that the leader election and consensus problems are solvable in a timed asynchronous distributed system provided a majority of processes are always eventually able to...
Provided by King Fahd University of Petroleum & Minerals
-
White Papers
Algorithms for Fault-Tolerant Topology in Heterogeneous Wireless Sensor Networks
Apr 2008
This paper addresses fault-tolerant topology control in a heterogeneous wireless sensor network consisting of several resource-rich supernodes, used for data relaying, and a large number of...
Provided by Institute of Electrical and Electronics Engineers
-
White Papers
Designing Efficient Algorithms for the Eventually Perfect Failure Detector Class
Oct 2007
The concept of an unreliable failure detector was introduced by Chandra and Toueg as a mechanism that provides (possibly incorrect) information about process failures. This mechanism has been used...
Provided by Academy Publisher
-
White Papers
Prime: Byzantine Replication Under Attack
Jun 2010
Existing Byzantine-resilient replication protocols satisfy two standard correctness criteria, safety and liveness, even in the presence of Byzantine faults. The runtime performance of these...
Provided by Johns Hopkins University
-
White Papers
Intrusion-Tolerant Group Management for Mobile Ad-Hoc Networks
Mar 2009
This paper presents PICO, a generic infrastructure for secure group communication in Mobile Ad-hoc NETworks (MANETs). PICO provides an intrusion-tolerant group management service, allowing clients...
Provided by Johns Hopkins University
-
Whitepapers
PSO Technique in Fault Diagnosis of Multilevel Inverter Drive System
Sep 2011
A fault diagnosis system in a multilevel-inverter using a compact PSO and neural network is proposed in this paper. It is difficult to diagnosis a MultiLevel-Inverter Drive (MLID) system using a...
Provided by EuroJournals
-
Whitepapers
A Distributed Intrusion Detection System Based on Mobile Agents with Fault Tolerance
Oct 2011
Now-a-days networks are becoming more open and subject to set of security threats. Recently there has been much focus on building secure distributed systems. Thus, a key challenge is to provide...
Provided by EuroJournals
-
Whitepapers
Efficient Fault Tolerant Adaptive Routing for Spidergon NoC Architecture
Nov 2011
The wide range of on chip network applications' performance demands adopting different architectures suited for an application. The performance of a topology can be improved by employing different...
Provided by EuroJournals
-
Whitepapers
An Analysis on the Effect of Mobility and Load on Fault Tolerant Service Selection Framework for Pervasive Environments
Dec 2011
Service selection in pervasive environments is a challenging research domain as it enables the user to identify the best service provider based on its requirements. Fault Tolerant Service...
Provided by EuroJournals
-
Whitepapers
Youla Parameterisation Based Fault Tolerant Control
Feb 2012
This paper proposes a switching Fault-Tolerant Control (FTC) approach for linear systems subject to time-varying actuator and sensor faults. The faults under consideration include effectiveness...
Provided by EuroJournals
-
Whitepapers
An Adaptive Fault Tolerant Multipath Routing (AFTMR) Protocol for Wireless Ad Hoc Networks
Jul 2012
The increasing popularity in wireless communication devices and the advancements in wireless technology make the communication in an effective and efficient manner. Mobile Ad-hoc NETwork (MANET)...
Provided by EuroJournals
-
Whitepapers
A Fault Tolerant Multipath Routing Protocol for Mobile Ad Hoc Networks
Sep 2012
Mobile Ad-hoc NETworks (MANETs) are self-configuring networks that designing an efficient routing is one of the most important challenging issues of them due to nodes mobility and wireless...
Provided by EuroJournals
-
Whitepapers
Fault Tolerant Most Fitting Resource Scheduling Algorithm (FMFRS) for Computational Grid
Sep 2012
In computational Grid, fault tolerance is an imperative issue to be considered during job scheduling. Due to the widespread use of resources, systems are highly prone to errors and failures. Hence...
Provided by EuroJournals
-
Whitepapers
Fault Tolerant Advance Reservation-Based Scheduling in Computational Grid
Oct 2012
Grid computing provides the ability to access, utilize and control a variety of underutilized heterogeneous resources distributed across multiple administrative domains. Advance reservation...
Provided by EuroJournals
-
Whitepapers
Algorithm-Based Fault Tolerance for Dense Matrix Factorizations
Feb 2012
Dense matrix factorizations, such as LU, Cholesky and QR, are widely used for scientific applications that require solving systems of linear equations, eigenvalues and linear least squares...
Provided by Association for Computing Machinery
-
Whitepapers
Enabling Application Resilience With and Without the MPI Standard
Jan 2012
As recent research has demonstrated, it is becoming a necessity for large scale applications to have the ability to tolerate process failure during an execution. As the number of processes...
Provided by University of Tehran
-
Whitepapers
Cloud Platform Datastore Support
Sep 2012
There are many datastore systems to choose from that differ in many ways including public versus private cloud support, data management interfaces, programming languages, supported feature sets,...
Provided by University of Calgary
-
Whitepapers
A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI
May 2012
Most predictions of Exa-scale machines picture billion way parallelism, encompassing not only millions of cores, but also tens of thousands of nodes. Even considering extremely optimistic advances...
Provided by University of Tehran
-
Whitepapers
Weak Estimation-Based Algorithm Fault-Tolerant Routing Wireless Sensor Networks
Nov 2011
Wireless Sensor Networks are characterized by the cooperative engagement of mobile nodes that constitute networks possessing continuously-changing infrastructures, the bereavement of centralized...
Provided by Islamic Azad University
-
Whitepapers
Secured QoS Routing Path Discovery in Wireless Ad hoc Networks Using Cache Mechanism
Oct 2012
Wireless ad-hoc networks consist of a set of nodes which construct a random network topology by means of several communication media. Wireless Ad-hoc network present a diversification in...
Provided by International Journal of Computer Networks and Wireless Communications (IJCNWC)
-
Whitepapers
Energy Efficient Architecture to Cognitive Radio Wireless Sensor Networks
Dec 2012
Cognitive radio has been considered as a key technology for future wireless communications and mobile computing. In this cognitive radio technology spectrum play an important role. Effective...
Provided by International Journal of Computer Networks and Wireless Communications (IJCNWC)
-
Whitepapers
EFP: New Energy - Efficient Fault-Tolerant Protocol for Wireless Sensor Network
Nov 2012
Saving energy and increasing network lifespan are important problems in Wireless Sensor Networks (WSNs). WSNs with many small nodes can be used for monitoring and controlling the physical...
Provided by AIRCC
-
Whitepapers
Construction of Power Efficient Fault Tolerant Wireless Network Using Cone Based Algorithm
Nov 2012
Fault tolerant topology control for all to one communication holds significance in dynamic wireless networks with asymmetric wireless links. In this paper the author Investigates the various...
Provided by International Journal of Computer Technology and Applications
-
Whitepapers
Fault Tolerance-Challenges, Techniques and Implementation in Cloud Computing
Jan 2012
Fault tolerance is a major concern to guarantee availability and reliability of critical services as well as application execution. In order to minimize failure impact on the system and...
Provided by International Journal of Computer Science Issues
-
Whitepapers
An Intrusion Tolerance Approach for Internet Security
Oct 2012
The Internet has become essential to most enterprises and many private individuals. However, both network and the computer systems connected to it are still vulnerable to attacks which are...
Provided by International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE)
-
Whitepapers
Virtualisation and Cloud Computing - Optimised Power, Cooling and Management Maximises Benefits
Jan 2013
IT virtualisation, the engine behind cloud computing, can have significant consequences on the data centre physical infrastructure (DCPI). Higher power densities that often result can challenge...
Provided by Schneider Electric - AU
-
Whitepapers
Fault-Tolerant Scheduling With Dynamic Number of Replicas in Heterogeneous Systems
Sep 2010
In this paper, the authors show that it does not always lead to a higher reliability with more replicas. Besides, the more replicas implies more resource consumption and higher economic cost. To...
Provided by Institute of Electrical & Electronic Engineers
-
Webcasts
Cassandra NYC 2011: Matt Dennis - Data Modeling Workshop
Aug 2012
In this webcast, the presenter explains about DataStax which currently focused on high-level architecture, design, data models, deployment and algorithms for some of the largest, highest volume...
Provided by Oleksiy Kovyrin
-
Webcasts
Cassandra NYC 2011: Nathan Marz - The Storm and Cassandra Realtime Computation Stack
Aug 2012
In this webcast, the presenter will explain about Storm which is a distributed, reliable, and fault-tolerant stream processing system. Its use cases are so broad that the presenters consider it to...
Provided by Oleksiy Kovyrin
-
Whitepapers
A Simple Automotive Application Using Flexray Protocol
Nov 2012
FlexRay protocol is emerging as the next generation automotive communication protocol which offers high data rate, deterministic, fault tolerant, flexible in-vehicle data communication. This...
Provided by R.V. College of Engineering
-
Whitepapers
Measures of Fault Tolerance in Distributed Simulated Annealing
Dec 2012
In this paper, the authors examine the different measures of Fault Tolerance in a Distributed Simulated Annealing process. Optimization by Simulated Annealing on a distributed system is prone to...
Provided by Cornell University
-
Whitepapers
Suitable Node Deployment Based on Geometric Patterns Considering Fault Tolerance in Wireless Sensor Networks
Dec 2012
Wireless Sensor Networks (WSNs) consist of small power-constrained nodes with sensing, computation and wireless communication capabilities. These nodes are deployed in the sensing region to...
Provided by International Journal of Computer Applications
-
Whitepapers
A Generic Checkpoint-Restart Mechanism for Virtual Machines
Dec 2012
It is common today to deploy complex software inside a Virtual Machine (VM). Snapshots provide rapid deployment, migration between hosts, dependability (fault tolerance), and security (insulating...
Provided by Northeastern University
-
Whitepapers
Energy Efficient and Fault-Tolerant Broadcast Protocol in Wireless Ad-Hoc Networks
Nov 2009
To achieve efficient broadcasting with low interference and low energy consumption, each node optimizes its transmission power. In a tree based topology, the message of node user can be overheard...
Provided by Institute of Electrical & Electronic Engineers
-
Whitepapers
A Method to Construct an Attack and Fault Tolerant Scalable Distributed Network
Jul 2012
Distributed networks have attracted much attention due to their scalability and inexpensiveness as compared to traditional centralized networks. While distributed networks are appropriate to...
Provided by Institute of Electrical & Electronic Engineers
-
Whitepapers
Shark: Fast Data Analysis Using Coarse-Grained Distributed Memory
May 2012
Shark is a research data analysis system built on a novel coarse-grained distributed shared-memory abstraction. Shark marries query processing with deep data analysis, providing a unified system...
Provided by Association for Computing Machinery
-
Whitepapers
Fault Tolerant Message Efficient Coordinator Election Algorithm in High Traffic Bidirectional Ring Network
Dec 2012
Now-a-days use of distributed systems such as internet and cloud computing is growing dramatically. Coordinator existence in these systems is crucial due to processes coordinating and consistency...
Provided by mecs-press
-
Whitepapers
Vector Routing for Delay Tolerant Networks
Aug 2008
Recently, much research work has paid attention to Delay Tolerant Networks (DTNs), which are networks with a frequent occurrence of network partitioning. Since the successful establishment of an...
Provided by Institute of Electrical & Electronic Engineers
-
Whitepapers
Redundant Dissimilar Sensor Fusion With Dynamic Driver Input Classification and Graceful Degradation for Drive-by-Wire Applications
Mar 2010
Dissimilar sensor redundancy with force and displacement measurements can provide extended fault coverage for drive-by-wire applications. However, large variances occur when correlating these...
Provided by University of California, Santa Cruz
-
Whitepapers
Active Quorum Systems
Sep 2010
This paper outlines a flexible suite of object replication protocols that brings together Byzantine quorum systems registers and state machine replication. These protocols enable the...
Provided by Universidade de Coimbra
-
Whitepapers
Byzantine Consensus With Unknown Participants
Oct 2008
Consensus is a fundamental building block used to solve many practical problems that appear on reliable distributed systems. In spite of the fact that consensus is being widely studied in the...
Provided by Springer Healthcare
-
Whitepapers
An Efficient Byzantine-Resilient Tuple Space
Mar 2009
Open distributed systems are typically composed by an unknown number of processes running in heterogeneous hosts. Their communication often requires tolerance to temporary disconnections and...
Provided by Institute of Electrical & Electronic Engineers
-
Whitepapers
Spin One's Wheels - Byzantine Fault Tolerance With a Spinning Primary
Jul 2009
Most Byzantine Fault-Tolerant state machine replication (BFT) algorithms have a primary replica that is in charge of ordering the clients requests. Recently it was shown that this dependence...
Provided by Carnegie Mellon University
-
Whitepapers
Sharing Memory Between Byzantine Processes Using Policy-Enforced Tuple Spaces
Jan 2009
Despite the large amount of Byzantine fault-tolerant algorithms for message-passing systems designed through the years, only recent algorithms for the coordination of processes subject to...
Provided by Institute of Electrical & Electronic Engineers
-
Whitepapers
Efficient Middleware for Byzantine Fault Tolerant Database Replication
Apr 2011
Byzantine Fault Tolerance (BFT) enhances the reliability and availability of replicated systems subject to software bugs, malicious attacks, or other unexpected events. This paper presents...
Provided by Association for Computing Machinery
-
White Papers
Improving Scalability and Fault Tolerance in an Application Management Infrastructure
Jun 2008
This paper explores the challenges associated with distributed application management in large-scale computing environments. In particular, the authors investigate several techniques for extending...
Provided by University of California, San Diego
-
White Papers
A Light-Weight Cache-Based Fault Detection and Checkpointing Scheme for MPSoCs Enabling Relaxed Execution Synchronization
Oct 2008
While technology advances have made MPSoCs a standard architecture for embedded systems, their applicability is increasingly being challenged by dramatic increases in the amount of device failures...
Provided by Association for Computing Machinery
-
White Papers
Reducing Overhead for Soft Error Coverage in High Availability Systems
Dec 2007
High reliability/availability systems typically use redundant computation and components to achieve detection, isolation and recovery from faults. Chip multiprocessors (CMPs) incorporate multiple...
Provided by University of Wisconsin
-
White Papers
Robust and Flexible Power-Proportional Storage
Jun 2010
Power-proportional cluster-based storage is an important component of an overall cloud computing infrastructure. With it, substantial subsets of nodes in the storage cluster can be turned off to...
Provided by Association for Computing Machinery
-
White Papers
Achieving Power-Efficiency in Clusters Without Distributed File System Complexity
May 2010
Power-efficient operation is a desirable property, particularly for large clusters housed in datacenters. Recent work has advocated turning off entire nodes to achieve power-proportionality, but...
Provided by Georgia Tech
-
White Papers
Multiple-Objective Metric for Placing Multiple Base Stations in Wireless Sensor Networks
Aug 2007
The placement of base stations in wireless sensor networks affects the coverage of sensor nodes, the tolerance against faults or attacks, the energy consumption and the congestion from...
Provided by Korea University
-
White Papers
Online Failure Forecast for Fault-Tolerant Data Stream Processing
Dec 2007
In this paper, the authors present a new online failure forecast system to achieve predictive failure management for fault-tolerant data stream processing. Different from previous reactive or...
Provided by University of Illinois
-
White Papers
Tolerating File-System Mistakes With EnvyFS
May 2009
The authors introduce EnvyFS, an N-version local file system designed to improve reliability in the face of file-system bugs. EnvyFS, implemented as a thin VFS-like layer near the top of the...
Provided by University of Wisconsin
-
White Papers
Membrane: Operating System Support for Restartable File Systems
Sep 2010
The authors introduce Membrane, a set of changes to the operating system to support restartable file systems. Membrane allows an operating system to tolerate a broad class of file system failures,...
Provided by Association for Computing Machinery
-
White Papers
A Comparison of Overlay Routing and Multihoming Route Control
Jan 2011
The limitations of BGP routing in the Internet are often blamed for poor end-to-end performance and prolonged connectivity interruptions. Recent work advocates using overlays to effectively bypass...
Provided by Carnegie Mellon University
-
White Papers
Proactive Service Migration for Long-Running Byzantine Fault Tolerant Systems
Mar 2009
This paper describes a proactive recovery scheme based on service migration for long-running Byzantine fault tolerant systems. Proactive recovery is an essential method for ensuring long term...
Provided by Cleveland State University
-
White Papers
Design and Implementation of a Byzantine Fault Tolerance Framework for Web Services
Dec 2008
Many Web services are expected to run with high degree of security and dependability. To achieve this goal, it is essential to use a Web-services compatible framework that tolerates not only crash...
Provided by Cleveland State University
-
White Papers
A Game Theoretical View of Byzantine Fault Tolerance Design
Oct 2007
This paper investigates the optimal Byzantine Fault Tolerance (BFT) design strategies from a game theoretical point of view. The problem of BFT is formulated as a constant-sum game played by the...
Provided by RAMS Consultants
-
White Papers
Fault Tolerance Middleware for Cloud Computing
Mar 2010
The Low Latency Fault Tolerance (LLFT) middleware provides fault tolerance for distributed applications deployed within a cloud computing or data center environment, using the leader/follower...
Provided by Institute of Electrical and Electronics Engineers
-
White Papers
Integrity-Preserving Replica Coordination for Byzantine Fault Tolerant Systems
Jun 2008
The use of good random numbers is essential to the integrity of many mission-critical systems. However, when such systems are replicated for Byzantine fault tolerance, a serious issue arises,...
Provided by Cleveland State University
-
White Papers
Byzantine Fault Tolerant Coordination for Web Services Business Activities
Apr 2008
This paper presents a comprehensive study on the threats towards the coordination services for Web services business activities and explores the most optimal solution to mitigate such threats. A...
Provided by Cleveland State University
-
White Papers
Byzantine Fault Tolerance for Electric Power Grid Monitoring and Control
Apr 2008
The stability of the electric power grid is crucial to every nation's security and well-being. As revealed by a number of large-scale blackout incidents in North America, the data communication...
Provided by Cleveland State University
-
White Papers
A Lightweight Fault Tolerance Framework for Web Services
Aug 2007
This paper presents the design and implementation of a lightweight fault tolerance framework for Web services. With the framework, a Web service can be rendered fault tolerant by replicating it...
Provided by Cleveland State University
-
White Papers
Byzantine Fault Tolerant Coordination for Web Services Atomic Transactions
Jul 2007
This paper presents the mechanisms needed for Byzan-tine fault tolerant coordination of Web services atomic transactions. The mechanisms have been incorporated into an open-source framework...
Provided by Cleveland State University
-
White Papers
A Byzantine Fault Tolerant Distributed Commit Protocol
Jul 2007
This paper presents a Byzantine fault tolerant distributed commit protocol for transactions running over untrusted networks. The traditional two-phase commit protocol is enhanced by replicating...
Provided by Cleveland State University
-
White Papers
Byzantine Fault Tolerance for Non Deterministic Applications
Jun 2007
All practical applications contain some degree of non-determinism. When such applications are replicated to achieve Byzantine Fault Tolerance (BFT), their nondeterministic operations must be...
Provided by Cleveland State University
-
White Papers
Independent Faults in the Cloud
Jul 2010
Byzantine Fault Tolerant (BFT) protocols are replication-based solutions to the problem of tolerating the arbitrary failures of software and hardware components. The essential assumption for...
Provided by EPFL
-
White Papers
Proactive Process-Level Live Migration in HPC Environments
Jul 2008
As the number of nodes in high-performance computing environments keeps increasing, faults are becoming common place. Reactive Fault Tolerance (FT) often does not scale due to massive I/O...
Provided by North Carolina State University
-
White Papers
Hybrid Full/Incremental Checkpoint/Restart for MPI Jobs in HPC Environments
Jun 2009
As the number of cores in high-performance computing environments keeps increasing, faults are becoming common place. Checkpointing addresses such faults but captures full process images even...
Provided by North Carolina State University
-
White Papers
A Fault Tolerance Approach for Enterprise Applications
Apr 2008
Service Oriented Architectures (SOAs) have emerged as a preferred solution to tackle the complexity of large-scale, complex, distributed, and heterogeneous systems. Key to successful operation of...
Provided by University of California, San Diego
-
White Papers
Three Approximation Algorithms for Energy-Efficient Query Dissemination in Sensor Database System
Aug 2009
Sensor database is a type of database management system which offers sensor data and stored data in its data model and query languages. In this system, when a user poses a query to this sensor...
Provided by Springer Science+Business Media
-
White Papers
CIFTS: A Coordinated Infrastructure for Fault-Tolerant Systems
Jun 2009
Considerable work has been done on providing fault tolerance capabilities for different software components on large scale high-end computing systems. Thus far, however, these fault tolerant...
Provided by Indiana University
-
White Papers
An Efficient Hardware-Software Approach to Network Fault Tolerance With InfiniBand
Jul 2009
In the last decade or so, clusters have observed a tremendous rise in popularity due to excellent price to performance ratio. A variety of Interconnects have been proposed during this period, with...
Provided by Ohio State University
-
White Papers
A Software Based Approach for Providing Network Fault Tolerance in Clusters With UDAPL Interface: MPI Level Design and Performance Evaluation
Jan 2011
In the arena of cluster computing, MPI has emerged as the de facto standard for writing parallel applications. At the same time, introduction of high speed RDMA-enabled interconnects like...
Provided by Ohio State University
-
Whitepapers
A Probabilistic Bundle Relay Strategy in Two-Hop Vehicular Delay Tolerant Networks
Apr 2011
A persisting major challenge in Vehicular Delay Tolerant Networks (VDTNs) is the delay minimization of data delivery when communicating nodes are stationary, arbitrarily deployed along roadsides...
Provided by Institute of Electrical & Electronic Engineers
-
White Papers
Fault-Tolerant Architecture With Dynamic Wavelength and Bandwidth Allocation Scheme in WDM-EPON
Mar 2008
This study proposes a novel fault-tolerant architecture in WDM-EPON, Cost-based Fault-tolerant WDM-EPON (CFT-WDM-EPON), to provide overall protection. The CFT-WDM-EPON only equips a backup feeder...
Provided by Yuan Ze University
-
White Papers
A Fault Tolerant, Peer-to-Peer Replication Network
Jan 2011
A peer-to-peer, commonly abbreviated to P2P, is any distributed network architecture composed of participants that make a portion of their resources (such as processing power, disk storage or...
Provided by Institute of Electrical and Electronics Engineers
-
White Papers
On the Possibility of Consensus in Asynchronous Systems
Jan 2011
The authors demonstrate that the leader election and consensus problems are solvable in a timed asynchronous distributed system provided a majority of processes are always eventually able to...
Provided by King Fahd University of Petroleum & Minerals
-
White Papers
Algorithms for Fault-Tolerant Topology in Heterogeneous Wireless Sensor Networks
Apr 2008
This paper addresses fault-tolerant topology control in a heterogeneous wireless sensor network consisting of several resource-rich supernodes, used for data relaying, and a large number of...
Provided by Institute of Electrical and Electronics Engineers
-
White Papers
Designing Efficient Algorithms for the Eventually Perfect Failure Detector Class
Oct 2007
The concept of an unreliable failure detector was introduced by Chandra and Toueg as a mechanism that provides (possibly incorrect) information about process failures. This mechanism has been used...
Provided by Academy Publisher
-
White Papers
Prime: Byzantine Replication Under Attack
Jun 2010
Existing Byzantine-resilient replication protocols satisfy two standard correctness criteria, safety and liveness, even in the presence of Byzantine faults. The runtime performance of these...
Provided by Johns Hopkins University
-
White Papers
Intrusion-Tolerant Group Management for Mobile Ad-Hoc Networks
Mar 2009
This paper presents PICO, a generic infrastructure for secure group communication in Mobile Ad-hoc NETworks (MANETs). PICO provides an intrusion-tolerant group management service, allowing clients...
Provided by Johns Hopkins University
-
White Papers
MobiCom Poster Abstract: An Energy-Efficient Fault-Tolerant Monitoring System for Sensor Networks
Oct 2007
Because sensors are often deployed in harsh and/or adversarial environments, the sensors or the communication links may fail and hence endanger the mission of the sensor network. Although using...
Provided by Pennsylvania State University
-
White Papers
Failure Tolerance in Petascale Computers
Nov 2007
Three of the most difficult and growing problems in future High-Performance Computing (HPC) installations will be avoiding, coping and recovering from failures. The coming PetaFLOPS clusters will...
Provided by Carnegie Mellon University
-
White Papers
Reputation-Based Framework for High Integrity Sensor Networks
May 2008
Sensor network technology promises a vast increase in automatic data collection capabilities through efficient deployment of tiny sensing devices. The technology will allow users to measure...
Provided by Association for Computing Machinery
Keep Up with TechRepublic
Submit a Paper
Get your content listed in our directory!
Our directory is the largest library of vendor-supplied technical content on the Web. It’s also the first place IT decision makers turn to when researching technology solutions. Our members are already finding your competitors’ papers here - shouldn’t they find yours, too? It's FREE so click here and submit your white paper, case study, data sheet, research report, or other document today!



