Detail aplikovaného výsledku

PDF Analysis Tools

BURGET, R.

Originální název

PDF Analysis Tools

Anglický název

PDF Analysis Tools

Druh

Software

Abstrakt

A set of utilities for advanced PDF document analysis. Unlike the existing PDF to HTML convertors that focus on obtaining a DOM or HTML representation of the document that is visually as close as possible to the original document, the goal of the PDF Analysis Tools is to produce an output document that has the same logical stucture. For this purpose, the tools implement different algorithms for detecting common graphical patterns in the source PDF document that can be represented by some standard HTML elements and CSS constructions. The resulting document may not display exactly as the source PDF but it is more suitable for further analysis and/or editing.

Abstrakt aglicky

A set of utilities for advanced PDF document analysis. Unlike the existing PDF to HTML convertors that focus on obtaining a DOM or HTML representation of the document that is visually as close as possible to the original document, the goal of the PDF Analysis Tools is to produce an output document that has the same logical stucture. For this purpose, the tools implement different algorithms for detecting common graphical patterns in the source PDF document that can be represented by some standard HTML elements and CSS constructions. The resulting document may not display exactly as the source PDF but it is more suitable for further analysis and/or editing.

Klíčová slova

document analysis, PDF

Klíčová slova anglicky

document analysis, PDF

Umístění

https://github.com/FitLayout/PDFAnalyzer

Licenční poplatek

K využití výsledku jiným subjektem je vždy nutné nabytí licence

www