To understand the source of 100% cpu usage data node

I noticed uneven CPU usage on different pods. As you can see, around 30 data node pods hit 100% all the time but some others are pretty idle.
How’s the right way to find where the busyness from?

Check disk IO in atop. Most common issue is high CPU duo to disk busy.

