I’ve been running a few vNodes for a while and started upgrading them from 20201008_1 to 20210106_1. I have noticed that CPU and network usage has increased 15-20x from versions starting 20201028_1 (tried upgrading one-by-one).
I’ve compared env variables, configs, mounts, open files, TCP connections and all is the same, except obviously the updated code AND a new TCP connection which only exists on nodes starting 20201028_1:
TCP node4-878f457cb-gvl6r:9433->ns3157884.ip-51-83-237.eu:7337 (ESTABLISHED)
Not entirely sure what this is, but it looks like a highway connection, and it’s taking up most of the CPU and network bandwidth.
Usage on 20201008_1: 0.1-0.5 CPU, ~50 kbit/s up/down
Usage starting 20201028_1 to 20210106_1: 1.5-2 CPU, ~3-5mbit/s up/down
Please see the metrics I’ve captured during yesterday’s upgrade for 2 nodes:
Is there anything I can do to debug this further and reduce the CPU/network usage? the logs are the same as before and nothing else changed in terms of env, config and mounts.