User-defined Search in RonPub publications http://www.ronpub.com/publications/search.php?journal=ALL&author=Dennis+Marten&exactauthor=on&title=&abstract=&volume=&issue=&year1=&year2=&searchtype=advanced This feed contains the result of an user-defined search in RonPub publications en-us Dennis Marten and Andreas Heuer: Machine Learning on Large Databases: Transforming Hidden Markov Models to SQL Statements, Open Journal of Databases (OJDB), 4 (1), pages 22-42, URN: urn:nbn:de:101:1-2017100112181, 2017 https://www.ronpub.com/ojdb/OJDB_2017v4i1n02_Marten.html http://nbn-resolving.de/urn:nbn:de:101:1-2017100112181 Machine Learning is a research field with substantial relevance for many applications in different areas. Because of technical improvements in sensor technology, its value for real life applications has even increased within the last years. Nowadays, it is possible to gather massive amounts of data at any time with comparatively little costs. While this availability of data could be used to develop complex models, its implementation is often narrowed because of limitations in computing power. In order to overcome performance problems, developers have several options, such as improving their hardware, optimizing their code, or use parallelization techniques like the MapReduce framework. Anyhow, these options might be too cost intensive, not suitable, or even too time expensive to learn and realize. Following the premise that developers usually are not SQL experts we would like to discuss another approach in this paper: using transparent database support for Big Data Analytics. Our aim is to automatically transform Machine Learning algorithms to parallel SQL database systems. In this paper, we especially show how a Hidden Markov Model, given in the analytics language R, can be transformed to a sequence of SQL statements. These SQL statements will be the basis for a (inter-operator and intra-operator) parallel execution on parallel DBMS as a second step of our research, not being part of this paper. Dennis Marten, Holger Meyer, Daniel Dietrich and Andreas Heuer: Sparse and Dense Linear Algebra for Machine Learning on Parallel-RDBMS Using SQL, Open Journal of Big Data (OJBD), 5 (1), pages 1-34, URN: urn:nbn:de:101:1-2018122318341069172957, 2019 https://www.ronpub.com/ojbd/OJBD_2019v5i1n01_Marten.html http://nbn-resolving.de/urn:nbn:de:101:1-2018122318341069172957 While computational modelling gets more complex and more accurate, its calculation costs have been increasing alike. However, working on big data environments usually involves several steps of massive unfiltered data transmission. In this paper, we continue our work on the PArADISE framework, which enables privacy aware distributed computation of big data scenarios, and present a study on how linear algebra operations can be calculated over parallel relational database systems using SQL. We investigate the ways to improve the computation performance of algebra operations over relational databases and show how using database techniques impacts the computation performance like the use of indexes, choice of schema, query formulation and others. We study the dense and sparse problems of linear algebra over relational databases and show that especially sparse problems can be efficiently computed using SQL. Furthermore, we present a simple but universal technique to improve intra-operator parallelism for linear algebra operations in order to support the parallel computation of big data.