Přístupnostní navigace
E-application
Search Search Close
Detail aplikovaného výsledku
BURGET, R.
Original Title
PDF DOM Parser
English Title
Type
Software
Abstract
Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2D0m may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.
Abstrakt aglicky
Keywords
PDF DOM HTML parser convertor java
Key words in English
Location
http://cssbox.sourceforge.net/pdf2dom
Licence fee
In order to use the result by another entity, it is always necessary to acquire a license
www