An Efficient Multi-Tier Tablet Server Storage Architecture
Distributed, structured data stores such as Big Table, HBase, and Cassandra use a cluster of machines, each running a database-like software system called the Tablet Server Storage Layer or TSSL. A TSSL's performance on each node directly impacts the performance of the entire cluster. In this paper, the authors introduce an efficient, scalable, multi-tier storage architecture for tablet servers. Their system can use any layered mix of storage devices such as Flash SSDs and magnetic disks. Their experiments show that by using a mix of technologies, performance for certain workloads can be improved beyond configurations using strictly two-tier approaches with one type of storage technology. They utilized, adapted, and integrated cache-oblivious algorithms and data structures, as well as Bloom filters, to improve scalability significantly.