PyKX vs Polars vs Pandas As-Of Join Test

This notebook presents a comparison of Pandas, PyKX (kdb+) and Polars (both eager and lazy execution modes) for performing as-of joins and a basic slippage calculation on time-series market data.

Note:
This is an independent, community-driven test and is not an official KX (kdb+) or Polars benchmark. It is intended for technical discussion, reproducibility, and as a baseline for further performance exploration.

Overview

Goal:
Compare the speed and memory usage of Pandas PyKX and Polars for typical financial analytics (as-of joins with slippage).
Approach:
All engines operate on identical data loaded from local parquet files with consistent preprocessing (sorting, key alignment).
Metrics:
- Wall-clock runtime (multiple iterations)
- Memory usage (incremental, per run)
Environment:
Tests are performed as “warm” runs within a single Python process, reflecting typical batch analytical workflows.

Methodology & Fairness

Each engine (PyKX, Polars Eager, Polars Lazy) runs the same join and calculation logic.
Preprocessing:
- All tables sorted by sym, time.
- kdb+ g# (grouped) attribute set for appropriate columns.
- Polars DataFrames sorted.
No vendor-specific hacks or hidden optimizations.
Memory profiler limitations:
- Memory increments may be zero for highly efficient libraries or due to sampling granularity.
Limitations:
- Not a “cold start” test (no process restart between runs).
- Not measuring file scan or on-disk materialization.

How to Reproduce

Download and install PyKX & license
Open and run the notebook in an environment with Python 3, PyKX, Polars, Numpy, Pandas, and memory_profiler installed.
Review and adjust any schema or path settings if your data differs.

Disclaimer

This test is not affiliated with, nor endorsed by, KX Systems, Kdb+, or the Polars core team.
Results should be interpreted as one independent set of measurements—for further research and community discussion.

Feedback

Open an issue or submit a pull request to help improve the transparency and value of this comparison!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
PyKX_Test.ipynb		PyKX_Test.ipynb
README.md		README.md
demo_functions.py		demo_functions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PyKX vs Polars vs Pandas As-Of Join Test

Overview

Methodology & Fairness

How to Reproduce

Disclaimer

Feedback

About

Uh oh!

Releases

Packages

Languages

RyanSieglerKX/PyKX_test

Folders and files

Latest commit

History

Repository files navigation

PyKX vs Polars vs Pandas As-Of Join Test

Overview

Methodology & Fairness

How to Reproduce

Disclaimer

Feedback

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages