Generally I'd aim closer to ~2k concurrent per pod as a maximum.

That way a pod dying is taking out 1/5th the connections, instead of 1/3rd the connections, because handling those reconnections is the most expensive part.

If you're looking at the metrics that streaming exposes and constantly seeing only <1000 concurrent connections, you probably only need a single instance, maybe two for failover.

#mastoadmin