CIRCUITS
SESSION 12 – Tapa 1
Technology/Circuits
Joint Focus Session - Design Enablement in Scaled CMOS
Thursday, June 14, 1:30 p.m.
Chairpersons: K.
Wilcox, AMD
K.
Nose, Renesas Electronics Corp.
12.1 - 1:30 p.m.
A
22nm Dynamically Adaptive Clock Distribution for Voltage Droop Tolerance, K. Bowman, C. Tokunaga, T. Karnik, V.
De, J. Tschanz, Intel
An all-digital dynamically adaptive clock distribution
mitigates the impact of high-frequency supply voltage (Vcc) droops on
microprocessor performance and energy efficiency. Silicon measurements from a
test chip in a 22nm tri-gate technology demonstrate simultaneous throughput
gains and energy reductions ranging from 14% and 3% at 1.0V to 31% and 15% at
0.6V, respectively, for a 10% Vcc droop.
12.2 - 1:55 p.m.
Voltage
Droop Reduction Using Throttling Controlled by Timing Margin Feedback, M. Floyd, A. Drake*, R. Berry, H.
Chase, R. Willaman, J. Pena, IBM System and Technology Group, *IBM Austin
Research Lab
An active processor throttling control loop was enabled in
the shipping POWER7™ based P775 supercomputer to mitigate voltage droop.
Critical path measurement circuits built into the POWER7 processor chips are
used to dynamically measure and react to loss of timing margin. This technique
was used to save power without dropping frequency and to only engage if a worst-case
droop event occurred in the system. As a result, worst-case workload-induced
voltage droop events are reduced by around 50% compared to the system operating
without the control loop. The reduction in operating voltage afforded by this
technique translates to significant yield improvement, reduced failure rates
(around 60% FIT reduction), and improved power efficiency (32W per processor
chip, which translates into more than $600 per node per year, which is well
more than $250,000 per year in a proposed 512 node installation).
12.3 - 2:20 p.m.
An
On-Die All-Digital Delay Measurement Circuit with 250fs Accuracy, M. Mansuri, B. Casper, F. O'Mahony,
Intel Corporation
This paper demonstrates an in-situ delay measurement circuit
which precisely characterizes key clocking circuits such as full phase rotation
interpolators. This on-die all-digital circuit produces a digital output value
proportional to the relative delay between two clocks, normalized to the clock
period. This circuit requires no calibration for variation or process, voltage,
temperature (PVT) and measures the delay with 250fs absolute accuracy and
repeatability of 10fs-rms.
12.4 - 2:45 p.m.
A
47% Access Time Reduction with a Worst-Case Timing-Generation Scheme Utilizing
a Statistical Method for Ultra Low Voltage SRAMs, A.
Kawasumi, Y. Takeyama, O. Hirabayashi, K. Kushida, F. Tachibana, Y. Niki, S.
Sasaki, T. Yabe, Toshiba
A variation tolerant sense amplifier timing generator which
utilizes a statistical method is proposed. The circuit monitors all the bitline
delays and generates the worst timing from the delay distribution. The timing
generating circuits have been implemented in 28nm and 40nm SRAMs. The 47%
access time improvement at 0.5V has been confirmed in measured results.