Built-in shutdown logic protects the card from damage by removing power to the device when either electrical or thermal limits (given in the following table) reach or exceed their respective card shutdown thresholds. Thermal management is implemented in the RPU which monitors the external inlet, outlet, and FPGA temperature sensors while the voltage regulator module (VRM) monitors the VCCINT current and temperature. When any of the thresholds are exceeded, card power is removed. A cold reboot of the server hosting the card is subsequently necessary to reload the device configuration and re-enumerate the card in the server.
The following table lists the card shutdown thermal and electrical thresholds. The thresholds apply equally with and without AUX power connected.
|Sensor Description||Card Shutdown Threshold|