Matrix Multiplication (matmul): `numpy` hard to beat? even by mojo? - Modular