TY - GEN
T1 - Implementation and evaluation of fine-grain run-time power gating for a multiplier
AU - Usami, Kimiyoshi
AU - Nakata, Mitsutaka
AU - Shirai, Toshiaki
AU - Takeda, Seidai
AU - Seki, Naomi
AU - Amano, Hideharu
AU - Nakamura, Hiroshi
PY - 2009/12/1
Y1 - 2009/12/1
N2 - In a 32bx32b multiplier, when the bit width of both operands is less than 16-bit, the upper array of the multiplier computing the upper bits of the product does not need to operate and hence consumes wasteful leakage energy. We propose a technique to control run-time power gating (RTPG) for the upper array by dynamically detecting the operand width. Since RTPG suffers from energy overhead due to turning on/off power switches, the sleep time at each sleep event should be longer than the break-even time (BET) to gain in energy savings. Using an analytical model we built, we show that BET reduces exponentially with higher temperature. Since the chip temperature goes up during the operation, the sleep time becomes more likely to exceed the shortened BET, leading to the increase of energy savings. We evaluated our technique through designing a 32bx32b multiplier and implementing in a commercial 90nm CMOS technology. Post-layout simulation results showed that BET reduces from 32 cycles at 25°C to 10 cycles at 65°C and to 3 cycles at 100°C at 100MHz. We also simulated energy dissipation by incorporating our multiplier into a MIPS R3000 based CPU and running a JPEG encoding program. Results showed that our technique reduces energy by 5% at 65°C and by 39% at 100°C over the PG-disabled case even counting the overhead. In contrast, energy was increased by 36% at 25°C. The ground bounce at the wakeup was effectively suppressed to 91mV by using delay-skewed buffering for power switches, while achieving the wakeup time of 1.44ns.
AB - In a 32bx32b multiplier, when the bit width of both operands is less than 16-bit, the upper array of the multiplier computing the upper bits of the product does not need to operate and hence consumes wasteful leakage energy. We propose a technique to control run-time power gating (RTPG) for the upper array by dynamically detecting the operand width. Since RTPG suffers from energy overhead due to turning on/off power switches, the sleep time at each sleep event should be longer than the break-even time (BET) to gain in energy savings. Using an analytical model we built, we show that BET reduces exponentially with higher temperature. Since the chip temperature goes up during the operation, the sleep time becomes more likely to exceed the shortened BET, leading to the increase of energy savings. We evaluated our technique through designing a 32bx32b multiplier and implementing in a commercial 90nm CMOS technology. Post-layout simulation results showed that BET reduces from 32 cycles at 25°C to 10 cycles at 65°C and to 3 cycles at 100°C at 100MHz. We also simulated energy dissipation by incorporating our multiplier into a MIPS R3000 based CPU and running a JPEG encoding program. Results showed that our technique reduces energy by 5% at 65°C and by 39% at 100°C over the PG-disabled case even counting the overhead. In contrast, energy was increased by 36% at 25°C. The ground bounce at the wakeup was effectively suppressed to 91mV by using delay-skewed buffering for power switches, while achieving the wakeup time of 1.44ns.
UR - http://www.scopus.com/inward/record.url?scp=77950330455&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77950330455&partnerID=8YFLogxK
U2 - 10.1109/ICICDT.2009.5166253
DO - 10.1109/ICICDT.2009.5166253
M3 - Conference contribution
AN - SCOPUS:77950330455
SN - 9781424429332
T3 - 2009 IEEE International Conference on Integrated Circuit Design and Technology, ICICDT 2009
SP - 7
EP - 10
BT - 2009 IEEE International Conference on Integrated Circuit Design and Technology, ICICDT 2009
T2 - 2009 IEEE International Conference on Integrated Circuit Design and Technology, ICICDT 2009
Y2 - 18 May 2009 through 20 May 2009
ER -