Device High Temperature Incident
Focus
Focus

Device High Temperature Incident

Table of Contents

Device High Temperature Incident

Lets see the device high temperature incident and monitor in prisma sd-wan.
There are four temperature sensors in the ION 1200-S-5G device for CPU, ACPITZ, Cellular Modem, and PSE. An incident DEVICEHW_TEMPERATURE_SENSOR is raised when one or more thermal sensors report temperatures beyond the operationally safe threshold value. This incident is helpful to monitor the temperature sensor trends in a device. If a high thermal condition persists for a longer time, the device is shut down in the following cases:
  • If the device has a high CPU temperature, an incident is raised. The system will monitor the temperatures every 5 minutes. If the high CPU temperature persists for 3 continuous readings, the system will log an error and trigger a system shutdown.
  • If there are any 2 temperature sensors other than the CPU that cross the defined thresholds, the system will be shut down. The system will monitor the temperatures every 5 minutes, if there are any 2 sensors that cross the threshold 3 times in a set of 5 continuous readings, the system will be shut down.
The incident is cleared only if all sensors are within the threshold. It may take up to 25 minutes (if multiple sensors reported high temperatures before falling within the threshold) to clear the incident after all sensors are within the threshold value. If the shutdown was system-initiated, then the following actions must be taken:
  • Initiate a system shutdown in order to prevent any further damage to the device or the surroundings.
  • Set the potential reboot reason as Thermal condition shutdown.
  • After the system is shut down, you will need to manually bring the device back up.
When operating in high temperatures, it is suggested to monitor the device temperature activity on the Prisma SD-WAN web interface regularly. If the high-temperature condition persists for 15-30 mins, the device will be shut down. You need to bring the device back up manually.
  1. Navigate to Monitor ION DevicesDevice Activity.
  2. Filter to view your device.
  3. View the time series Device Temperature chart.
    The ION 1200-S-5G has four sensors, you can see the time-series temperature mapping for each sensor CPU, ACPITZ, Cellular Modem, and PSE. The temperature threshold for the various sensors is:
    • CPU - 90C.
    • ACPITZ - 67C.
    • Cellular modem - <-30C or >105C.
    • PSE - <-30C or > 115C.
    • POE shutdown when CPU temp reaches 85C, no power to the PDs. If CPU temp comes down to 75, power to PDs.
    • Reduced performance when temperature between 100C - 118C.
    • RF activity is suspended when the temperature is at 118C or -45C.
    • Device shutdown when the temperature reaches more than 135C.
    • Incident generation threshold when the temperature is <-30C or >105C.
    • Incident clear threshold when the temperature is > -25C or <100C.
    Monitor the temperature trends around the device. Make sure the device is away from direct sunlight and keep the temperature under control. Check for any external heat sources. Call Palo Alto Networks support if the issue persists.