Applied result detail

Text Preprocessing Tool

ŠABATKA, O.; BARTÍK, V.

Original Title

Text Preprocessing Tool

English Title

Text Preprocessing Tool

Type

Software

Abstract

The tool enables text preprocessing of documents for text mining. It offers several possibilities of document representation (words or N-grams as terms) and several weighting methods (binary, TF or TF-IDF). It also provides two standard pre-processing procedures of text - stopwords removal and stemming.

Abstract in English

The tool enables text preprocessing of documents for text mining. It offers several possibilities of document representation (words or N-grams as terms) and several weighting methods (binary, TF or TF-IDF). It also provides two standard pre-processing procedures of text - stopwords removal and stemming.

Keywords

text mining, preprocessing, document representation, N-grams,  TF-IDF

Key words in English

text mining, preprocessing, document representation, N-grams,  TF-IDF

Location

http://www.fit.vutbr.cz/~bartik/Arcbc/download.htm

Licence fee

Use of the result by another entity is possible without acquiring a license in some cases

www