We are currently testing various Docker files to ensure the stability and reliability of the systems we’ve built. However, we’ve encountered significant challenges with the logging system.
At this time, logs only appear to function properly about 10% of the time. Additionally, telemetry data tends to reset whenever we open the details for individual workers, and the log output is blank in approximately 90% of cases.
Could you please let us know if this is a known issue, or if there’s a recommended approach for capturing logs and debugging errors more effectively in this environment?
Thank you,
Recent Announcements
Continue the conversation
Join the Discord to ask follow-up questions and connect with the community
R
Runpod
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!