nntel tensor library with 0 dependencies (even gemm and gemv are implemented from scratch with perfomance near to MPS)