Apache Iceberg version
0.9.1 (latest release)
Please describe the bug 🐞
When calling count() on a DataScan, limit is not respected. Seems trivial but if I set a limit of 5 I expect 5 or less rows back, at least with a scan-like implementation
The underlying ArrowScan does not get passed the limit param
https://github.com/apache/iceberg-python/blob/main/pyiceberg/table/__init__.py#L1940
This results in scans taking longer due to not respecting the limit.
The fix will involve more than just passing the limit to the ArrowScan
Willingness to contribute
Apache Iceberg version
0.9.1 (latest release)
Please describe the bug 🐞
When calling
count()on aDataScan, limit is not respected. Seems trivial but if I set a limit of 5 I expect 5 or less rows back, at least with a scan-like implementationThe underlying
ArrowScandoes not get passed the limit paramhttps://github.com/apache/iceberg-python/blob/main/pyiceberg/table/__init__.py#L1940
This results in scans taking longer due to not respecting the limit.
The fix will involve more than just passing the limit to the
ArrowScanWillingness to contribute