Detail aplikovaného výsledku

Text Preprocessing Tool

ŠABATKA, O.; BARTÍK, V.

Originální název

Text Preprocessing Tool

Anglický název

Text Preprocessing Tool

Druh

Software

Abstrakt

The tool enables text preprocessing of documents for text mining. It offers several possibilities of document representation (words or N-grams as terms) and several weighting methods (binary, TF or TF-IDF). It also provides two standard pre-processing procedures of text - stopwords removal and stemming.

Abstrakt aglicky

The tool enables text preprocessing of documents for text mining. It offers several possibilities of document representation (words or N-grams as terms) and several weighting methods (binary, TF or TF-IDF). It also provides two standard pre-processing procedures of text - stopwords removal and stemming.

Klíčová slova

text mining, preprocessing, document representation, N-grams,  TF-IDF

Klíčová slova anglicky

text mining, preprocessing, document representation, N-grams,  TF-IDF

Umístění

http://www.fit.vutbr.cz/~bartik/Arcbc/download.htm

Licenční poplatek

Využití výsledku jiným subjektem je v některých případech možné bez nabytí licence

www