Deduplication Methods Using Levenshtein Distance Algorithm

में बचाया:
ग्रंथसूची विवरण
में प्रकाशित:Journal of Electrical Systems vol. 20, no. 7s (2024), p. 997
मुख्य लेखक: Valeriano, Eugene S
प्रकाशित:
Engineering and Scientific Research Groups
विषय:
ऑनलाइन पहुंच:Citation/Abstract
Full Text - PDF
टैग: टैग जोड़ें
कोई टैग नहीं, इस रिकॉर्ड को टैग करने वाले पहले व्यक्ति बनें!

MARC

LEADER 00000nab a2200000uu 4500
001 3081859542
003 UK-CbPIL
022 |a 1112-5209 
035 |a 3081859542 
045 2 |b d20240101  |b d20241231 
100 1 |a Valeriano, Eugene S  |u Tarlac Agricultural University, Santa Ignacia, Philippines 
245 1 |a Deduplication Methods Using Levenshtein Distance Algorithm 
260 |b Engineering and Scientific Research Groups  |c 2024 
513 |a Journal Article 
520 3 |a The study aimed to propose methods to improve the data integrity of the Relational databases such as MS SQL, MySQL and PostgreSQL via record duplication detection. The FODORS and ZAGAT Restaurant database benchmark datasets have been utilized to facilitate the processes involved in preparing and delivering high-quality data. Furthermore, the Levenshtein distance algorithm was used to propose three (3) methods namely: default, eliminating equal string, and knowledge-based libraries to cut duplicate records in the database. In the 70% selected threshold, the average detected duplicate records of 88 out of 112 records in the restaurant dataset. Finally, to efficiently detect duplicate records in the database, depend on the data being analyzed and threshold selected. 
653 |a Algorithms 
653 |a Datasets 
653 |a Relational data bases 
653 |a Data integrity 
653 |a Books 
653 |a Databases 
653 |a Methods 
653 |a Libraries 
653 |a Restaurants 
653 |a Structured Query Language-SQL 
653 |a Servers 
653 |a Efficiency 
653 |a COVID-19 
773 0 |t Journal of Electrical Systems  |g vol. 20, no. 7s (2024), p. 997 
786 0 |d ProQuest  |t Engineering Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3081859542/abstract/embedded/L8HZQI7Z43R0LA5T?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3081859542/fulltextPDF/embedded/L8HZQI7Z43R0LA5T?source=fedsrch