Design and Implementation of HSQL: A SQL-like language for Data Analysis in Distributed Systems

Guardado en:
書目詳細資料
發表在:International Journal of Advanced Computer Science and Applications vol. 12, no. 11 (2021), p. n/a
主要作者: Anurag Singh Bhadauria
其他作者: Bain, Atreya, Shetty, Jyoti, Shobha, G, Chala, Arjuna, Clements, Jeremy
出版:
Science and Information (SAI) Organization Limited
主題:
在線閱讀:Citation/Abstract
Full Text - PDF
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!

MARC

LEADER 00000nab a2200000uu 4500
001 2655113462
003 UK-CbPIL
022 |a 2158-107X 
022 |a 2156-5570 
024 7 |a 10.14569/IJACSA.2021.0121190  |2 doi 
035 |a 2655113462 
045 2 |b d20210101  |b d20211231 
100 1 |a Anurag Singh Bhadauria 
245 1 |a Design and Implementation of HSQL: A SQL-like language for Data Analysis in Distributed Systems 
260 |b Science and Information (SAI) Organization Limited  |c 2021 
513 |a Journal Article 
520 3 |a In today’s modern world, we’re experiencing a substantial increase in the use of data in various fields, and this has necessitated the use of distributed systems to consume and process Big Data. Machine learning tends to benefit from the usage of Big Data, and the models generated from such techniques tend to be more effective. However, there is a steep learning curve to getting used to handling Big Data, as traditional data management tools fail to perform well. Distributed systems have become popular, where the task of data processing is split amongst various nodes in clusters. SQL, is a popular database management language popular to data scientists. It is often given second class support, where SQL can be embedded into a primary language of use (e.g. SQL in Scala for Spark), which allows for using SQL but one still needs to know the primary language of the platform (Scala, as per the example, or ECL in HPCC Systems). It may also be present as a supported language. In either case, using useful tooling such as Visualizing data and creating and using machine learning models become difficult, as the user needs to fall back to the primary language of the system. In the proposed work, a new SQL-like language, HSQL, an open source distributed systems solution, was developed for allowing new users to get used to its distributed architecture and the ECL language, with which it primarily operates with (which was chosen as a target). Additionally, a program that could translate HSQL-based programs to ECL for use was made. HSQL was made to be completely inter-compatible with ECL programs, and it was able to provide a compact and easy to comprehend SQL-like syntax for performing general data analysis, creation of Machine learning models and visualizations while allowing a modular structure to such programs. 
653 |a Language 
653 |a Big Data 
653 |a Machine learning 
653 |a Data analysis 
653 |a Data base management systems 
653 |a Data management 
653 |a Data processing 
653 |a Learning curves 
653 |a Tooling 
653 |a Modular structures 
653 |a Computer networks 
653 |a Query languages 
700 1 |a Bain, Atreya 
700 1 |a Shetty, Jyoti 
700 1 |a Shobha, G 
700 1 |a Chala, Arjuna 
700 1 |a Clements, Jeremy 
773 0 |t International Journal of Advanced Computer Science and Applications  |g vol. 12, no. 11 (2021), p. n/a 
786 0 |d ProQuest  |t Advanced Technologies & Aerospace Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/2655113462/abstract/embedded/L8HZQI7Z43R0LA5T?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/2655113462/fulltextPDF/embedded/L8HZQI7Z43R0LA5T?source=fedsrch