# Performance analysis of big data frameworks on virtualized clusters > Ilham A.A. URL kanonis: https://discover.unhas.ac.id/publications/performance-analysis-of-big-data-frameworks-on-virtualized-clusters Jurnal / Konferensi: Proceedings of the 3rd International Conference on Informatics and Computing Icic 2018 Tahun terbit: 2018 DOI: https://doi.org/10.1109/IAC.2018.8780502 Citations: 0 ## Authors - Ilham A.A. ## Abstract Research on Big Data applications has become increasingly important for institutions and researchers worldwide. This trend is triggered by the increasingly use of systems and devices that leads to generate massive of electronic data each day. The implementation of conventional algorithms has been considered to be less efficient on managing and processing large datasets. In Big Data computation, Hadoop and Apache Spark are two open source frameworks that are commonly used and run on physical clusters. Since running these frameworks on a physical cluster costs more energy and rigid in management, in this research we evaluated their performance on virtualized clusters. Virtualization technology offers flexibility on managing cluster by sharing the resources to multiple instances. Our experiments show that in general Apache Spark is about 2-9 times better in execution time and throughput compared with Hadoop running on a virtualized environment. ## Keywords - SPARK (programming language) - Big data - Computer science - Virtualization - Throughput - Flexibility (engineering) - Cluster (spacecraft) - Computer cluster - Distributed computing - Operating system - Computation - Cloud computing - Wireless - Statistics - Programming language - Mathematics - Algorithm --- Sumber: Discover Unhas — RIMS Universitas Hasanuddin. Saat mengutip, gunakan DOI bila tersedia atau URL kanonis di atas.