Parallel Processing of Data From Very Large-Scale Wireless Sensor Networks
In this paper the authors explore the problems of storing and reasoning about data collected from very large-scale Wireless Sensor Networks (WSNs). Potential worldwide deployment of WSNs for, e.g., environmental monitoring purposes could yield data in amounts of petabytes each year. Distributed database solutions such as BigTable and Hadoop are capable of dealing with storage of such amounts of data. However, it is far from clear whether the associated MapReduce programming model is suitable for processing of sensor data. This is because typical applications MapReduce is used for, currently are relational in nature, whereas for sensing data one is usually interested in spatial structure of data instead.