You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+28-4Lines changed: 28 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,12 +15,15 @@ Welcome to the official repository of StructEqTable-Deploy, a solution that conv
15
15
## Abstract
16
16
Table is an effective way to represent structured data in scientific publications, financial statements, invoices, web pages, and many other scenarios. Extracting tabular data from a visual table image and performing the downstream reasoning tasks according to the extracted data is challenging, mainly due to that tables often present complicated column and row headers with spanning cell operation. To address these challenges, we present TableX, a large-scale multi-modal table benchmark extracted from [DocGenome benchmark](https://unimodal4reasoning.github.io/DocGenome_page/) for table pre-training, comprising more than 2 million high-quality Image-LaTeX pair data covering 156 disciplinary classes. Besides, benefiting from such large-scale data, we train an end-to-end model, StructEqTable, which provides the capability to precisely obtain the corresponding LaTeX description from a visual table image and perform multiple table-related reasoning tasks, including structural extraction and question answering, broadening its application scope and potential.
17
17
18
+
## Release
19
+
-[2024/7/30] 🔥 We have released the first version of StructEqTable. (Current version of StructEqTable is able to process table images from scientific documents such as arXiv, Scihub papers. Times New Roman And Songti(宋体) are main fonts used in table image, other fonts may decrease the accuracy of the model's output.)
18
20
19
21
## TODO
20
22
21
23
-[x] Release inference code and checkpoints of StructEqTable.
22
24
-[x] Support Chinese version of StructEqTable.
23
-
-[ ] Improve the inference speed of StructEqTable.
25
+
-[ ] Expand more domains of table image to improve the model's general capabilities.
26
+
-[ ] Accelerated version of StructEqTable using TensorRT-LLM.
parser.add_argument('--image_path', type=str, default='demo.png', help='data path for table image')
12
12
parser.add_argument('--ckpt_path', type=str, default='U4R/StructTable-base', help='ckpt path for table model, which can be downloaded from huggingface')
13
+
parser.add_argument('-t', '--max_waiting_time', type=int, default=60, help='maximum waiting time of model inference')
13
14
parser.add_argument('--cpu', action='store_true', default=False, help='using cpu for inference')
14
-
parser.add_argument('--html', action='store_true', default=False, help='output html format table code')
0 commit comments