ModularM
Modular2y ago
3 replies
tonystratum

max engine leaking?

Apple M2 Max 96G
Was trying to run a text2text onnx model, but ram usage just keeps rising (see video).
I am using https://github.com/bloomberg/memray as a profiler. Seems like it is the engine specifically
GitHub
Memray is a memory profiler for Python. Contribute to bloomberg/memray development by creating an account on GitHub.
GitHub - bloomberg/memray: Memray is a memory profiler for Python
Was this page helpful?