Text2arff: A text representation library
Which features are the most important for the text classification tasks? In the automatic text categorization area, several studies seek answers to this question. In this paper, new version of Text2arff (a library for text representation) and its new features (word2vec, Word trajectories, etc.) are...
Saved in:
Published in: | 2016 24th Signal Processing and Communication Application Conference (SIU) pp. 197 - 200 |
---|---|
Main Authors: | , |
Format: | Conference Proceeding |
Language: | English |
Published: |
IEEE
01-05-2016
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Which features are the most important for the text classification tasks? In the automatic text categorization area, several studies seek answers to this question. In this paper, new version of Text2arff (a library for text representation) and its new features (word2vec, Word trajectories, etc.) are presented. Also, the software is now a java library which can be used in the user's own projects. In the experiments, the library is run on two sample datasets. The results show that the effect of text representation method is bigger than the classification method. This result also emphasizes the importance of developing new test representation methods. |
---|---|
DOI: | 10.1109/SIU.2016.7495711 |