Skip to content

vasgat/pdf-table-extractor

Repository files navigation

pdf-table-extractor

PDF Table Extractor is a tool that leverages tabulapdf library to extract tables from pdf files and performs some further processing and stores the resulted tables into csv formatted files.

Note: Here you can find an online version of the tool and some documentation for better usage

You can run pdf-table-extractor from command line as follows:

      java -jar .\pdf-table-extractor-1.0.jar ..\pdf_examples\COGSuppliers.pdf

      java -jar .\pdf-table-extractor-1.0.jar ..\pdf_examples\Benetton.pdf WITHOUT_RULINGS UPPER_MERGE

About

A tool that leverages tabulapdf to extract table from pdf performs some further processing and stores the resulted tables to csv files

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages