Skip to content

CrazyCrud/pagexml-mets-viewer

Repository files navigation

PageXML Viewer

There are great PageXML viewers out there:

I just wanted to add another, lightweight viewer, that can read in multiple PageXML files as well as a OCR-D workspace (simple parsing of METS file and displaying the file groups).

Moreover, it's possible to:

  • edit or add transcriptions to existing TextLine elements and export the newly edited PageXML files.
  • add, remove or edit TextRegion as well as TextLine elements

myimage

Installation

Pip

pip install -r requirements

And then run python app.py. The app runs then under http://127.0.0.1:5000.

Docker

docker compose up --build

The app runs under http://127.0.0.1:8000/.

About

Web app to upload and display multiple PageXML files

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published