HighNodeCPUFrequency
Meaning
This alert fires when a CPU frequency on a node exceeds 80% of the maximum
frequency for more than 5 minutes.
Impact
High CPU frequency can indicate:
- High CPU utilization pushing the processor to higher frequencies
- Potential thermal issues causing frequency scaling
- Power management concerns affecting system stability
- Reduced CPU lifespan due to sustained high-frequency operation
Diagnosis
- Identify the affected node and CPU:
- Check current CPU frequency on the node:
kubectl debug node/<node-name> -it --image=registry.redhat.io/ubi8/ubi
Then run inside the debug pod:
cat /proc/cpuinfo | grep -i "cpu mhz"
- Monitor CPU utilization and temperature:
kubectl top pods --all-namespaces --sort-by=cpu
Check system temperature (if available):
- Review node resource allocation:
kubectl describe node <node-name>
- Check for CPU-intensive workloads:
ps aux --sort=-%cpu | head -20
Mitigation
- Immediate actions:
- Monitor the CPU temperature to ensure it’s within safe limits
- Check if the high frequency is due to legitimate high CPU demand
- Verify CPU settings if needed
- If caused by high CPU utilization:
- Identify and analyze CPU-intensive pods
- Consider redistributing workloads across nodes
- Evaluate pod resource limits and requests
- Scale horizontally if needed
- If thermal throttling is suspected:
- Check system cooling and ventilation
- Monitor ambient temperature
- Consider reducing workload temporarily
- Verify proper thermal management settings
If you cannot resolve the issue, see the following resources: