Hey everyone,
Quick update on a logging issue some of you experienced.
What happened: For SaaS customers, a code update introduced a new metric that occasionally resulted in failed Clickhouse inserts, causing logs to not appear in the UI.
Fix: We've added validation to prevent negative values and recovered 650k+ logs from our dead-letter queue.
Please Note: Your requests were never affected. The Portkey gateway continued processing all LLM calls normally throughout. This was purely a log storage issue. This issue was also limited to customers who are on Portkey SaaS.
Potential gaps: Logs from Jan 15 1pm UTC to Jan 16 6pm UTC (~29 hours) may have some missing entries due to queue retention limits. This affects a very small number of requests.
If you notice gaps outside this window, let us know in No Access
Thank you @Unknown user & @Unknown user for pointing this out, and being patient with us while we fixed it.
