SimCleaner -- Sistema de Padroniza\c{c}\~ao de Bases de Dados utilizando Fun\c{c}\~oes de Similaridade

The Knowledge Discovery in Database (KDD) process permits the detection of pattern in databases, where this analysis may be compromised if database is not consistent, making necessary the use of data cleaning techniques. This paper presents a tool based in similarity functions to help the preprocess...

Full description

Saved in:
Bibliographic Details
Main Authors: Damasceno, Carlos Diego Nascimento, Lobato, Fabio Manoel França, Moutinho, Elton Rocha, de França, Arilene Santos, de Oliveira, Ivan Ikikame, de Santana, Ádamo Lima
Format: Journal Article
Language:English
Published: 27-07-2021
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The Knowledge Discovery in Database (KDD) process permits the detection of pattern in databases, where this analysis may be compromised if database is not consistent, making necessary the use of data cleaning techniques. This paper presents a tool based in similarity functions to help the preprocessing of databases and it behaved efficiently in the standardization of a System of Public Security of the State of Par\'a database and may be reused with other databases and other data mining projects.
DOI:10.48550/arxiv.2107.12884