Detail aplikovaného výsledku

WTF-LOD Extractor

OTRUSINA, L.; SMRŽ, P.

Originální název

WTF-LOD Extractor

Anglický název

WTF-LOD Extractor

Druh

Software

Abstrakt

This software creates the Web TextFull linkage to Linked Open Data (WTF-LOD) dataset intended for large-scale evaluation of named entity recognition (NER) systems from the largest publically-available textual corpora, including Wikipedia dumps, monthly runs of the CommonCrawl, and ClueWeb09/12. The software performs de-duplication of the data and advanced cleaning procedures.

Abstrakt anglicky

This software creates the Web TextFull linkage to Linked Open Data (WTF-LOD) dataset intended for large-scale evaluation of named entity recognition (NER) systems from the largest publically-available textual corpora, including Wikipedia dumps, monthly runs of the CommonCrawl, and ClueWeb09/12. The software performs de-duplication of the data and advanced cleaning procedures.

Klíčová slova

named entity evaluation, linked open data, CommonCrawl, ClueWeb, Wikipedia

Klíčová slova anglicky

named entity evaluation, linked open data, CommonCrawl, ClueWeb, Wikipedia

Umístění

http://www.fit.vutbr.cz/research/prod/index.php?id=480

Licenční poplatek

K využití výsledku jiným subjektem je vždy nutné nabytí licence

www

Dokumenty