SimCleaner -- Sistema de Padroniza\c{c}\~ao de Bases de Dados utilizando Fun\c{c}\~oes de Similaridade
The Knowledge Discovery in Database (KDD) process permits the detection of pattern in databases, where this analysis may be compromised if database is not consistent, making necessary the use of data cleaning techniques. This paper presents a tool based in similarity functions to help the preprocess...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Journal Article |
Language: | English |
Published: |
27-07-2021
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The Knowledge Discovery in Database (KDD) process permits the detection of
pattern in databases, where this analysis may be compromised if database is not
consistent, making necessary the use of data cleaning techniques. This paper
presents a tool based in similarity functions to help the preprocessing of
databases and it behaved efficiently in the standardization of a System of
Public Security of the State of Par\'a database and may be reused with other
databases and other data mining projects. |
---|---|
DOI: | 10.48550/arxiv.2107.12884 |