TY - GEN
T1 - An efficient stream archiving method by operator merge and write control
AU - Abe, Taiga
AU - Kawashima, Hideyuki
AU - Kitagawa, Hiroyuki
PY - 2012
Y1 - 2012
N2 - Stream data have become ubiquitous with the development of sensing devices, and therefore stream processing technology has also gained significance. A requirement of stream data processing is to provide persistence to stream data. This is simple, however it is not a trivial problem because, to provide persistence to data, data should be written to a persistent device such as disk. Disk access speed tends to be degraded by random accesses, and multiple queries exacerbate the problem. In this paper, we propose an efficient archiving method for data streams. The first technique is referred to as goperator merge. h It merges multiple operators to a single operator. Since the number of operators accessing the disk is reduced from N to 1 for each stream, the number of disk accesses is reduced, thereby accelerating archiving. The second technique is referred to as gwrite control. h It divides archiving procedures into three parts, it then assigns threads for each part so that the threads run simultaneously. The first thread conducts operators, the second writes tuples to a shared intermediate region on disk, and the third reads tuples from it and finalizes the writing of tuples. Controlling the threads so that they do not interfere with disk accesses accelerates archiving. Experiments showed that our proposal dramatically improved performance compared with the nave method. With a tuple size of 10KB, our proposed system did not violate any deadline misses until the input rate reached 100 tuples /s.
AB - Stream data have become ubiquitous with the development of sensing devices, and therefore stream processing technology has also gained significance. A requirement of stream data processing is to provide persistence to stream data. This is simple, however it is not a trivial problem because, to provide persistence to data, data should be written to a persistent device such as disk. Disk access speed tends to be degraded by random accesses, and multiple queries exacerbate the problem. In this paper, we propose an efficient archiving method for data streams. The first technique is referred to as goperator merge. h It merges multiple operators to a single operator. Since the number of operators accessing the disk is reduced from N to 1 for each stream, the number of disk accesses is reduced, thereby accelerating archiving. The second technique is referred to as gwrite control. h It divides archiving procedures into three parts, it then assigns threads for each part so that the threads run simultaneously. The first thread conducts operators, the second writes tuples to a shared intermediate region on disk, and the third reads tuples from it and finalizes the writing of tuples. Controlling the threads so that they do not interfere with disk accesses accelerates archiving. Experiments showed that our proposal dramatically improved performance compared with the nave method. With a tuple size of 10KB, our proposed system did not violate any deadline misses until the input rate reached 100 tuples /s.
KW - Data streams
KW - data archiving
KW - multiple query optimization
UR - http://www.scopus.com/inward/record.url?scp=84860764298&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84860764298&partnerID=8YFLogxK
U2 - 10.1109/WAINA.2012.227
DO - 10.1109/WAINA.2012.227
M3 - Conference contribution
AN - SCOPUS:84860764298
SN - 9780769546520
T3 - Proceedings - 26th IEEE International Conference on Advanced Information Networking and Applications Workshops, WAINA 2012
SP - 783
EP - 788
BT - Proceedings - 26th IEEE International Conference on Advanced Information Networking and Applications Workshops, WAINA 2012
T2 - 26th IEEE International Conference on Advanced Information Networking and Applications Workshops, WAINA 2012
Y2 - 26 March 2012 through 29 March 2012
ER -