Text2arff: A text representation library

Which features are the most important for the text classification tasks? In the automatic text categorization area, several studies seek answers to this question. In this paper, new version of Text2arff (a library for text representation) and its new features (word2vec, Word trajectories, etc.) are...

Full description

Saved in:
Bibliographic Details
Published in:2016 24th Signal Processing and Communication Application Conference (SIU) pp. 197 - 200
Main Authors: Can, Ender, Amasyali, Mehmet Fatih
Format: Conference Proceeding
Language:English
Published: IEEE 01-05-2016
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Which features are the most important for the text classification tasks? In the automatic text categorization area, several studies seek answers to this question. In this paper, new version of Text2arff (a library for text representation) and its new features (word2vec, Word trajectories, etc.) are presented. Also, the software is now a java library which can be used in the user's own projects. In the experiments, the library is run on two sample datasets. The results show that the effect of text representation method is bigger than the classification method. This result also emphasizes the importance of developing new test representation methods.
DOI:10.1109/SIU.2016.7495711