Skip to content

Commit 668f4ab

Browse files
committed
Support Chinese version of StructEqTable
1 parent 4ed4b38 commit 668f4ab

File tree

2 files changed

+9
-8
lines changed

2 files changed

+9
-8
lines changed

README.md

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ Table is an effective way to represent structured data in scientific publication
1919
## TODO
2020

2121
- [x] Release inference code and checkpoints of StructEqTable.
22-
- [ ] Support Chinese version of StructEqTable.
22+
- [x] Support Chinese version of StructEqTable.
2323
- [ ] Improve the inference speed of StructEqTable.
2424

2525

@@ -34,13 +34,13 @@ pip install "git+https://github.com/UniModal4Reasoning/StructEqTable-Deploy.git"
3434

3535
```
3636

37-
## Demo
38-
- run the demo.py
37+
## Quick Demo
38+
- run the demo/demo.py
3939
```shell script
4040
cd demo
41-
python demo.py \
42-
--image_path demo/demo.png \
43-
--ckpt_path ${CKPT_PATH}
41+
42+
python demo.py \ --image_path ./demo.png \
43+
--ckpt_path ${CKPT_PATH}
4444
```
4545

4646
- Visualization Results
@@ -52,9 +52,10 @@ python demo.py \
5252

5353

5454
## Acknowledgements
55-
- [UniMERNet](https://github.com/opendatalab/UniMERNet). A Universal Network for Real-World Mathematical Expression Recognition.
5655
- [DocGenome](https://github.com/UniModal4Reasoning/DocGenome). An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models.
5756
- [ChartVLM](https://github.com/UniModal4Reasoning/ChartVLM). A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning.
57+
- [Pix2Struct](https://github.com/google-research/pix2struct). Screenshot Parsing as Pretraining for Visual Language Understanding.
58+
- [UniMERNet](https://github.com/opendatalab/UniMERNet). A Universal Network for Real-World Mathematical Expression Recognition.
5859
- [Donut](https://huggingface.co/naver-clova-ix/donut-base). The UniMERNet's Transformer Encoder-Decoder are referenced from Donut.
5960
- [Nougat](https://github.com/facebookresearch/nougat). The tokenizer uses Nougat.
6061

demo/demo.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@
99
def parse_config():
1010
parser = argparse.ArgumentParser(description='arg parser')
1111
parser.add_argument('--image_path', type=str, default='demo.png', help='data path for table image')
12-
parser.add_argument('--ckpt_path', type=str, default='', help='ckpt path for table model')
12+
parser.add_argument('--ckpt_path', type=str, default='U4R/StructTable-base', help='ckpt path for table model, which can be downloaded from huggingface')
1313
parser.add_argument('--cpu', action='store_true', default=False, help='using cpu for inference')
1414
parser.add_argument('--html', action='store_true', default=False, help='output html format table code')
1515
args = parser.parse_args()

0 commit comments

Comments
 (0)