Volume 1, issue 1 of Open Journal of Big Data(OJBD), ISSN 2365-029X

Volume 1, issue 1 of Open Journal of Big Data(OJBD), ISSN 2365-029X http://www.ronpub.com/index.php/journals/OJBD/issues?volume=1&issue=1 All papers of this issue en-us Victor Chang, Muthu Ramachandran, Robert John Walters and Gary B. Wills: Introductory Editorial, Open Journal of Big Data (OJBD), 1 (1), pages 1-3, URN: urn:nbn:de:101:1-201705194326, 2015 https://www.ronpub.com/ojbd/OJBD_2015v1i1n01_Chang.html http://nbn-resolving.de/urn:nbn:de:101:1-201705194326 The Open Journal of Big Data is a new open access journal published by RonPub, and RonPub is an academic publisher of online, open access, peer-reviewed journals. OJBD addresses aspects of Big Data, including new methodologies, processes, case studies, poofs-of-concept, scientific demonstrations, industrial applications and adoption. This editorial presents the two articles in this first issue. The first paper is on An Efficient Approach for Cost Optimization of the Movement of Big Data, which mainly focuses on the challenge of moving big data from one data center to other.The second paper is on Cognitive Spam Recognition Using Hadoop and Multicast-Update, which describes a method to make machines cognitively label spam using Machine Learning and the Naive Bayesian approach. OJBD has a rising reputation thanks to the support of research communities, which help us set up the First International Conference on Internet of Things and Big Data 2016 (IoTBD 2016), in Rome, Italy, between 23 and 25 April 2016. Prasad Teli, Manoj V. Thomas and K. Chandrasekaran: An Efficient Approach for Cost Optimization of the Movement of Big Data, Open Journal of Big Data (OJBD), 1 (1), pages 4-15, URN: urn:nbn:de:101:1-201705194335, 2015 https://www.ronpub.com/ojbd/OJBD_2015v1i1n02_Teli.html http://nbn-resolving.de/urn:nbn:de:101:1-201705194335 With the emergence of cloud computing, Big Data has caught the attention of many researchers in the area of cloud computing. As the Volume, Velocity and Variety (3 Vs) of big data are growing exponentially, dealing with them is a big challenge, especially in the cloud environment. Looking at the current trend of the IT sector, cloud computing is mainly used by the service providers to host their applications. A lot of research has been done to improve the network utilization of WAN (Wide Area Network) and it has achieved considerable success over the traditional LAN (Local Area Network) techniques. While dealing with this issue, the major questions of data movement such as from where to where this big data will be moved and also how the data will be moved, have been overlooked. As various applications generating the big data are hosted in geographically distributed data centers, they individually collect large volume of data in the form of application data as well as the logs. This paper mainly focuses on the challenge of moving big data from one data center to other. We provide an efficient algorithm for the optimization of cost in the movement of the big data from one data center to another for offline environment. This approach uses the graph model for data centers in the cloud and results show that the adopted mechanism provides a better solution to minimize the cost for data movement. Mukund YR, Sunil Sandeep Nayak and K. Chandrasekaran: Cognitive Spam Recognition Using Hadoop and Multicast-Update, Open Journal of Big Data (OJBD), 1 (1), pages 16-28, URN: urn:nbn:de:101:1-201705194340, 2015 https://www.ronpub.com/ojbd/OJBD_2015v1i1n03_YR.html http://nbn-resolving.de/urn:nbn:de:101:1-201705194340 In today's world of exponentially growing technology, spam is a very common issue faced by users on the internet. Spam not only hinders the performance of a network, but it also wastes space and time, and causes general irritation and presents a multitude of dangers - of viruses, malware, spyware and consequent system failure, identity theft, and other cyber criminal activity. In this context, cognition provides us with a method to help improve the performance of the distributed system. It enables the system to learn what it is supposed to do for different input types as different classifications are made over time and this learning helps it increase its accuracy as time passes. Each system on its own can only do so much learning, because of the limited sample set of inputs that it gets to process. However, in a network, we can make sure that every system knows the different kinds of inputs available and learns what it is supposed to do with a better success rate. Thus, distribution and combination of this cognition across different components of the network leads to an overall improvement in the performance of the system. In this paper, we describe a method to make machines cognitively label spam using Machine Learning and the Naive Bayesian approach. We also present two possible methods of implementation - using a MapReduce Framework (hadoop), and also using messages coupled with a multicast-send based network - with their own subtypes, and the pros and cons of each. We finally present a comparative analysis of the two main methods and provide a basic idea about the usefulness of the two in various different scenarios.