Detail aplikovaného výsledku

PDF DOM Parser

BURGET, R.

Originální název

PDF DOM Parser

Anglický název

PDF DOM Parser

Druh

Software

Abstrakt

Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2D0m may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.

Abstrakt aglicky

Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2D0m may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.

Klíčová slova

PDF DOM HTML parser convertor java

Klíčová slova anglicky

PDF DOM HTML parser convertor java

Umístění

http://cssbox.sourceforge.net/pdf2dom

Licenční poplatek

K využití výsledku jiným subjektem je vždy nutné nabytí licence

www