The wording around how sampling is done is unclear - which row from a sampled region gets stored? What if I'm using this to count interactions with a service, and a user spams some low-tier requests to "hide" high-tier requests if they are dropped when sampling?
There are example queries on the docs page to "account for" sampling, and others aren't really affected, but if it's more openly communicated with examples I'm sure people would understand and accept it more