Volume 1 of Open Journal of Semantic Web(OJSW), ISSN 2199-336X

Volume 1 of Open Journal of Semantic Web(OJSW), ISSN 2199-336X http://www.ronpub.com/index.php/journals/OJSW/issues?volume=1&issue=ALL All papers of this volume en-us Jinwu Li, Vincent Wade and Melike Sah: Developing Knowledge Models of Social Media: A Case Study on LinkedIn, Open Journal of Semantic Web (OJSW), 1 (2), pages 1-24, URN: urn:nbn:de:101:1-201705194841, 2014 https://www.ronpub.com/ojsw/OJSW-v1i2n01_Li.html http://nbn-resolving.de/urn:nbn:de:101:1-201705194841 User Generated Content (UGC) exchanged via large Social Network is considered a very important knowledge source about all aspects of the social engagements (e.g. interests, events, personal information, personal preferences, social experience, skills etc.). However this data is inherently unstructured or semi-structured. In this paper, we describe the results of a case study on LinkedIn Ireland public profiles. The study investigated how the available knowledge could be harvested from LinkedIn in a novel way by developing and applying a reusable knowledge model using linked open data vocabularies and semantic web. In addition, the paper discusses the crawling and data normalisation strategies that we developed, so that high quality metadata could be extracted from the LinkedIn public profiles. Apart from the search engine in LinkedIn.com itself, there are no well known publicly available endpoints that allow users to query knowledge concerning the interests of individuals on LinkedIn. In particular, we present a system that extracts and converts information from raw web pages of LinkedIn public profiles into a machine-readable, interoperable format using data mining and Semantic Web technologies. The outcomes of our research can be summarized as follows: (1) A reusable knowledge model which can represent LinkedIn public users and company profiles using linked data vocabularies and structured data, (2) a public SPARQL endpoint to access structured data about Irish industry and public profiles, (3) a scalable data crawling strategy and mashup based data normalisation approach. The proposed data mining and knowledge representation proposed in this paper are evaluated in four ways: (1) We evaluate metadata quality using automated techniques, such as data completeness and data linkage. (2) Data accuracy is evaluated via user studies. In particular, accuracy is evaluated by comparison of manually entered metadata fields and the metadata which was automatically extracted. (3) User perceived metadata quality is measured by asking users to rate the automatically extracted metadata in user studies. (4) Finally, the paper discusses how the extracted metadata suits for a user interface design. Overall, the evaluations show that the extracted metadata is of high quality and meets the requirements of a data visualisation user interface. Sven Groppe, Thomas Kiencke, Stefan Werner, Dennis Heinrich, Marc Stelzner and Le Gruenwald: P-LUPOSDATE: Using Precomputed Bloom Filters to Speed Up SPARQL Processing in the Cloud, Open Journal of Semantic Web (OJSW), 1 (2), pages 25-55, URN: urn:nbn:de:101:1-201705194858, 2014 https://www.ronpub.com/ojsw/OJSW-v1i2n02_Groppe.html http://nbn-resolving.de/urn:nbn:de:101:1-201705194858 Increasingly data on the Web is stored in the form of Semantic Web data. Because of today's information overload, it becomes very important to store and query these big datasets in a scalable way and hence in a distributed fashion. Cloud Computing offers such a distributed environment with dynamic reallocation of computing and storing resources based on needs. In this work we introduce a scalable distributed Semantic Web database in the Cloud. In order to reduce the number of (unnecessary) intermediate results early, we apply bloom filters. Instead of computing bloom filters, a time-consuming task during query processing as it has been done traditionally, we precompute the bloom filters as much as possible and store them in the indices besides the data. The experimental results with data sets up to 1 billion triples show that our approach speeds up query processing significantly and sometimes even reduces the processing time to less than half. José M. Giménez-Garcia, Javier D. Fernández and Miguel A. Martínez-Prieto: MapReduce-based Solutions for Scalable SPARQL Querying, Open Journal of Semantic Web (OJSW), 1 (1), pages 1-18, URN: urn:nbn:de:101:1-201705194824, 2014 https://www.ronpub.com/ojsw/OJSW-v1i1n02_Garcia.html http://nbn-resolving.de/urn:nbn:de:101:1-201705194824 The use of RDF to expose semantic data on the Web has seen a dramatic increase over the last few years. Nowadays, RDF datasets are so big and rconnected that, in fact, classical mono-node solutions present significant scalability problems when trying to manage big semantic data. MapReduce, a standard framework for distributed processing of great quantities of data, is earning a place among the distributed solutions facing RDF scalability issues. In this article, we survey the most important works addressing RDF management and querying through diverse MapReduce approaches, with a focus on their main strategies, optimizations and results. Ismael Navas-Delgado and José F. Aldana-Montes: BioSStore: A Client Interface for a Repository of Semantically Annotated Bioinformatics Web Services, Open Journal of Semantic Web (OJSW), 1 (1), pages 19-29, URN: urn:nbn:de:101:1-201705194836, 2014 https://www.ronpub.com/ojsw/OJSW-v1i1n03_Delgado.html http://nbn-resolving.de/urn:nbn:de:101:1-201705194836 Bioinformatics has shown itself to be a domain in which Web services are being used extensively. In this domain, simple but real services are being developed. Thus, there are huge repositories of real services available (for example BioMOBY main repository includes more than 1500 services). Besides, bioinformatics repositories usually have active communities using and working on improvements. However, these kinds of repositories do not exploit the full potential of Web services (and SOA, Service Oriented Applications, in general). On the other hand, sophisticated technologies have been proposed to improve SOA, including the annotation on Web services to explicitly describe them. However, these approaches are lacking in repositories with real services. In the work presented here, we address the drawbacks present in bioinformatics services and try to improve the current semantic model by introducing the use of the W3C standard Semantic Annotations for WSDL and XML Schema (SAWSDL) and related proposals (WSMO Lite). This paper focuses on a user interface that takes advantage of a repository of semantically annotated bioinformatics Web services. In this way, we exploit semantics for the discovery of Web services, showing how the use of semantics will improve the user searches. The BioSStore is available at http://biosstore.khaos.uma.es. This portal will contain also future developments of this proposal.