LXC->Docker->Portainer immich-machine-learning healthcheck

Hello, I have 2 LXCs with iGPU (frigate and immich) and sometimes immich-machine-learning python process is stucking, but http://localhost:3003/ is alive. How I can write healthcheck to check direct onnx process health? If I manually restart immich-machine-learning container, the main server catch new instance and all it is fine. My idea is create healthcheck and rerstart container, but I have no trigger to restart. Thanks for help.
Fail: [ONNXRuntimeError] : 1 : FAIL : Non-zero     

                             status code returned while running                 

                             OpenVINO-EP-subgraph_2 node.                       

                             Name:'OpenVINOExecutionProvider_OpenVINO-EP-subgrap

                             h_2_0' Status Message:                             

                             /onnxruntime/onnxruntime/core/providers/openvino/ov

                             _interface.cc:243 void                             

                             onnxruntime::openvino_ep::OVInferRequest::WaitReque

                             st() [OpenVINO-EP]  Wait Model Failed: Exception   

                             from src/inference/src/cpp/infer_request.cpp:245:  

                             Exception from                                     

                             src/plugins/intel_gpu/src/runtime/ocl/ocl_stream.cp

                             p:433:                                             

                             [GPU] clWaitForEvents, error code: -14             
Was this page helpful?