Programming platforms for big data analysis
Cao, Jiannong, Chawla, Shailey, Wang, Yuqi, and Wu, Hanqing (2017) Programming platforms for big data analysis. In: Zomaya, Albert Y., and Sakr, Sherif, (eds.) Handbook of Big Data Technologies. Springer, Cham, Switzerland, pp. 65-99.
PDF (Published Version)
- Published Version
Restricted to Repository staff only |
Abstract
Big data analysis imposes new challenges and requirements on programming support. Programming platforms need to provide new abstractions and run time techniques with key features like scalability,fault tolerance, efficient task distribution, usability and processing speed. In this chapter, we first provide a comprehensive survey of the requirements, give an overview and classify existing big data programming platforms based on different dimensions. Then, we present details of the architecture, methodology and features of major programming platforms like MapReduce, Storm, Spark, Pregel, GraphLab, etc. Last, we compare existing big data platforms, discuss the need for a unifying framework, present our proposed framework MatrixMap, and give a vision about future work.