Hardware Matrix Decomposition

Decompose neural network weight matrices into sets of smaller matrices, each operating in a weight-stationary fashion suitable for hardware accelerator execution.

Overview

Neural network inference relies on large matrix multiplications. This project explores decomposing those weight matrices into many smaller tiles that can each be mapped to a weight-stationary compute unit — keeping weights fixed in local storage while streaming activations through.

Setup

uv sync
uv sync --extra dev  # include dev tools

Development

uv run pytest
uv run ruff check .
uv run mypy src/

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src/hardware_matrix_decomp		src/hardware_matrix_decomp
tests		tests
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hardware Matrix Decomposition

Overview

Setup

Development

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hardware Matrix Decomposition

Overview

Setup

Development

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages