PPS-ADS: A Framework for Privacy-Preserved and Secured Distributed System Architecture for Handling Big Data

Mohd Abdul Ahad (1), Ranjit Biswas (2)
(1) Jamia Hamdard
(2) Jamia Hamdard
Fulltext View | Download
How to cite (IJASEIT) :
Ahad, Mohd Abdul, and Ranjit Biswas. “PPS-ADS: A Framework for Privacy-Preserved and Secured Distributed System Architecture for Handling Big Data”. International Journal on Advanced Science, Engineering and Information Technology, vol. 8, no. 4, Aug. 2018, pp. 1333-42, doi:10.18517/ijaseit.8.4.5465.
The exponential expansion of Big Data in 7V’s (velocity, variety, veracity, value, variability and visualization) brings forth new challenges to security, reliability, availability and privacy of these data sets. Traditional security techniques and algorithms fail to complement this gigantic big data. This paper aims to improve the recently proposed Atrain Distributed System (ADS) by incorporating new features which will cater to the end-to-end availability and security aspects of the big data in the distributed system. The paper also integrates the concept of Software Defined Networking (SDN) in ADS to effectively control and manage the routing of the data item in the ADS. The storage of data items in the ADS is done on the basis of the type of data (structured or unstructured), the capacity of the distributed system (or coach) and the distance of coach from the pilot computer (PC). In order to maintain the consistency of data and to eradicate the possible loss of data, the concept of “forward positive” and “backward positive” acknowledgment is proposed. Furthermore, we have incorporated “Twofish” cryptographic technique to encrypt the big data in the ADS. Issues like “data ownership”, “data security, “data privacy” and data reliability” are pivotal while handling the big data. The current paper presents a framework for a privacy-preserved architecture for handling the big data in an effective manner.

R. Biswas, Atrain distributed system (ads): An infinitely scalable architecture for processing big data of any 4V, Computational Intelligence for Big Data Analysis Frontier Advances and Applications: edited by D.P. Acharjya, Satchidananda Dehuri and Sugata Sanyal, Springer International Publishing. Switzerland. 2015, 3-53.

R. Biswas, r-train (train) : A new flexible dynamic data structure, INFORMATION : An International Journal (Japan) 14 (4) (2011) 1231-1246.

R. Biswas, Heterogeneous data structure r-atrain, INFORMATION : An International Journal (Japan) 15 (2) (2012) 879-902.

B. A. Forouzan, Data Communication and Networking, 4th Edition, McGraw-Hill, 2007.

Ovsiannikov M, Rus S, Reeves D, Sutter P, Rao S, Kelly J. The quantcast file system. Proceedings of the 39th International conference on Very Large Scale Databases (VLDB Endowment); Trento. 2013. p.1092-1101.doi:10.14778/2536222.2536234

Ho S, Wu C, Zhou J, Chen W, Hsu C, Hsiao H, Chung Y. Distributed metaserver mechanism and recovery mechanism support in quantcast file system. IEEE Proceeding of 39th Annual Computer Software and Applications Conference (COMPSAC); USA. 2015. p. 758-63. doi:10.1109/ compsac.2015.109

Lakshman A, Cassandra MP. A decentralized structured storage system. Proceeding of ACM SIGOPS Operating Systems; USA. 2010. p. 35-40.

Dede E, Sendir B, Kuzlu P, Hartog J, Govindaraju M. An Evaluation of Cassandra for Hadoop. IEEE Proceeding of 6th International Conference on Cloud Computing; USA.2013. p. 494-501. doi:10.1109/cloud.2013.31

Luciani J, Brisk. Better Hadoop with Cassandra. Available from: http://www.datastax.com/wp-content/uploads/ 2011/07/Brisk_fully_distributed_Hadoop.pdf

Comparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS), White Paper, By Datastax Corporation. 2016. Available from: https://www.datastax.com/wp-content/uploads/2012/09/WP-DataStaxHDFSvsCFS.pdf

Shvachko K, Kuang H, Radia S, Chansler R. The Hadoop distributed file system. IEEE Proceeding of the 26th Symposium on Mass Storage Systems and Technologies (MSST); USA. 2010. p. 1-10.

Borthakur D. HDFS Architecture Guide, Apache Foundation.2016. Available from: https://hadoop.apache.org/ docs/r1.2.1/hdfs_design.pdf

Gupta L. HDFS - Hadoop Distributed File System Architecture Tutorial. Available from: http://howtodoinjava.com/big-data/hadoop/hdfs-hadoop-distributed-file-systemarchitecture-tutorial/

D Singh and C K Reddy, A survey on platforms for big data analytics, Journal of Big Data, 2014, 1:8

http://www.journalofbigdata.com/content/1/1/8

M. Chen et al., Big Data: Related Technologies, Challenges and Future Prospects, SpringerBriefs in Computer Science, DOI 10.1007/978-3-319-06245-7__4

Martin Strohbach, Jorg Daubert, Herman Ravkin, and Mario Lischka, Big Data Storage, Chapter 7, J.M. Cavanillas et al. (eds.), New Horizons for a Data-Driven Economy 2016.

J.K. Park, J. Kim, Big data storage configuration and performance evaluation utilizing NDAS storage systems, AKCE International Journal of Graphs and Combinatorics (2017), https://doi.org/10.1016/j.akcej.2017.09.003.

Wei Zhou, Dan Feng, Zhipeng Tan, Yingfei Zheng, Improving Big Data Storage Performance in Hybrid Environment, Journal of Computational Science, (2017) http://dx.doi.org/10.1016/j.jocs.2017.01.003

R. Kemp, Legal aspects of managing big data, Computer Law and Security Review, 30 (5) (2014), pp. 6482-491. Elsevier. https://doi.org/10.1016/j.clsr.2014.07.006

Bruce Schneier, John Kelsey, Twofish: A 128-bit block cipher, AES Round 1 Technical Evaluation CD-1: Documentation, National Institute of Standards and Technology.

B. Schneier, J. Kelsey, N. Ferguson, The Twofish Encryption Algorithm, A 128-Bit Block Cipher, John Wiley & Sons, 1999.

https://oauth.net/2/

Ryan Boyd, Getting Started with OAuth 2.0 2012, Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472.

S. A. Diego Kreutz, Fernando M. V. Ramos, S. Uhlig, Software-defined networking: A comprehensive survey, Proceedings of the IEEE 103 (1) (2015) 14-76.

Introduction to software defined networking (SDN), Washington University in Saint Louis, Url : http://www.cse.wustl.edu/ jain/cse570-13/.

W. Braun, M. Menth, , Software-defined networking using openflow: Protocols, applications and architectural design choices, Future Internet 6 (2014) 302-336.

Software-defined networking: The new norm for networks, Open Networking Foundation (ONF),White Paper (2012) 1-12.

Authors who publish with this journal agree to the following terms:

    1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
    2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
    3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).