© 2014 IEEE

Proceedings of the 29th Applied Power Electronics Conference and Exposition (APEC 2014), Texas, Houston, USA, March 16-20, 2014

# A Deep Trench Capacitor Based 2:1 and 3:2 Reconfigurable On-Chip Switched Capacitor DC-DC Converter in 32 nm SOI CMOS

T. Andersen, F. Krismer, J. W. Kolar, T. Toifl, C Menolfi, L. Kull, T. Morf, M. Kossel, M. Brändli, P. Buchmann, P. Francese

This material is published in order to provide access to research results of the Power Electronic Systems Laboratory / D-ITET / ETH Zurich. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the copyright holder. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.



# A Deep Trench Capacitor Based 2:1 and 3:2 Reconfigurable On-Chip Switched Capacitor DC-DC Converter in 32 nm SOI CMOS

Toke M. Andersen\*<sup>†</sup>, Florian Krismer\*, Johann W. Kolar\*, Thomas Toifl<sup>†</sup>, Christian Menolfi<sup>†</sup>, Lukas Kull<sup>†</sup>,

Thomas Morf<sup>†</sup>, Marcel Kossel<sup>†</sup>, Matthias Brändli<sup>†</sup>, Peter Buchmann<sup>†</sup>, Pier Andrea Francese<sup>†</sup>,

\* Power Electronic Systems Laboratory, ETH Zurich, Zurich, Switzerland

<sup>†</sup> IBM Research – Zurich, Rüschlikon, Switzerland

Abstract-On-chip switched capacitor (SC) converters for multicore microprocessor power delivery have the potential to reduce the overall energy consumption of future multicore microprocessor systems by independently regulating the voltage supply of each core. This paper describes an on-chip SC converter that can be reconfigured between a 2:1 and a 3:2 voltage conversion ratio to support a wide output voltage range from a single input supply. Regarding SC converter analysis and modeling, this paper extends an existing state space model framework to include the flying capacitors' parasitic bottom plate capacitors, which for on-chip SC converters significantly influence both the capacitor currents and the converter efficiency. A reconfigurable SC converter that supports an output voltage range of 700 mV to 1150 mV from a 1.8 V input supply is implemented in a 32 nm SOI CMOS technology that features the high-density deep trench capacitor. The converter achieves a maximum efficiency of 85.2% at  $2.1 \,\mathrm{W/mm^2}$  power density in the 2:1 configuration and a maximum efficiency of 84.1% at  $3.2 \,\mathrm{W/mm^2}$  in the 3:2 configuration.

# I. INTRODUCTION

High-performance microprocessor systems could benefit significantly on critical aspects such as total energy consumption by incorporating on-chip voltage regulators (OCVR). An OCVR is a fully integrated voltage regulator that generates the microprocessor's desired supply voltage (e.g. 0.9 V [1]) from a higher-than-nominal supply voltage (e.g. 1.8 V). Furthermore, the OCVR is integrated on the same chip die as the microprocessor itself, thereby acting as a true point of load (POL) converter.

From a package point of view, an OCVR can reduce the number of power/ground pins that carry the high supply currents required by modern high-performance microprocessors [2–4]. Reducing the number of power/ground pins is extremely attractive because more than half the total number of package pins in today's microprocessors are reserved for power/ground [1], and trend analyses confirm these characteristics also for future microprocessor systems [4]. Furthermore, OCVRs enable per-core regulation in multicore microprocessor. Having one dedicated OCVR per microprocessor core facilitates new power management architectures in which the supply voltage of each core can be regulated according to its independent need. Applying ultra-fast dynamic voltage and frequency scaling (DVFS), which extends traditional DVFS by

capturing within-workload supply voltage variations, has the potential to reduce the overall microprocessor system energy by up to 21% [5].

Traditionally, buck converters are used as POL converters for microprocessor power delivery. Research in microfabricated inductors have focused on achieving high inductor quality factors at small footprints. The current state of the art targets 3D chip integration, where the buck converter is implemented on an interposer in close proximity to the microprocessor chip die [6–9]. However, buck converters are typically not integrated on the same deep submicron chip die as the microprocessor. Inductors using only metals available in the chip metal stack (air core inductors) achieve poor quality factors because of the small metal thicknesses defined by the fabrication process [10]. Furthermore, magnetic materials that increase the quality factor and the inductance typically are not readily available in deep submicron processes.

In contrast, switched capacitor (SC) converters can be implemented using only switches and capacitors that are readily available in the deep submicron semiconductor technologies. For this reason, this paper focuses on on-chip SC converters.

A widely accepted model framework for SC converters was introduced in [11] and further developed in [12]. This model framework, which can be applied on any realizable SC converter topology, can be used to derive a switching frequency dependent equivalent output resistance  $R_{eq}$  that accounts for the converter's conduction losses. However, it has two disadvantages regarding OCVR applications: firstly, an approximation is used to calculate  $R_{eq}$ , and this approximation is least accurate when the SC converter is operated at its highest efficiency [13]. Secondly, it does not include switching losses, which are mainly associated with the parasitic bottom plate capacitors of the flying capacitors. Switching losses due to the parasitic bottom plate capacitor may not be of major concern for discrete SC converters since the parasitic bottom plate capacitors of discrete capacitors can often be neglected. However, they cannot be neglected for on-chip SC converters and may therefore have significant influence on the converter's output current and efficiency. In [13], a SC model framework based on conventional circuit analysis put into a state space model representation is used. Once all node equations have

been put into matrix form,  $R_{\rm eq}$  can be calculated accurately. However, also this model framework does not account for switching losses. This paper extends the state space modeling in [13] to take the effect of the parasitic bottom plate capacitor (i.e., switching losses) on capacitor currents and converter efficiency into account.

The voltage conversion ratio of a SC converter is determined by the topology, i.e. by the configuration of switches and capacitors. Prior SC converter art overcomes this limitation by using reconfigurable (gearbox) power stages, which can switch between voltage conversion ratios to increase the input/output voltage range [11, 12, 14–16]. However, the efficiency and power density performance of the these designs are limited by the MOS or MIM integrated capacitors available in the semiconductor processes used. Recent SC converter designs, which are implemented using integrated deep trench capacitors having high capacitance density and low parasitic bottom plate capacitance, have shown much improved efficiency and power density performance than SC converters using conventional MOS or MIM capacitors [2, 3]. However, these designs are single voltage conversion ratio only.

This paper presents an on-chip SC converter that can be reconfigured to having a 2:1 or a 3:2 step-down voltage conversion ratio. From a 1.8 V input supply, an output voltage range of 700 mV to 1150 mV is supported, thus making this design suited for ultra-fast DVFS in high-performance microprocessor applications. The SC converter is implemented in a 32 nm SOI CMOS technology with deep trench capacitors for high efficiency and high power density.

Section II treats the concept of the 2:1 and 3:2 reconfigurable SC converter. Section III introduces the improved model framework which includes switching losses. The model is verified against Matlab Simulink simulations. Section IV details the implementation of the reconfigurable SC converter in the 32 nm SOI CMOS semiconductor process. In Section V, measurement results of the prototype reconfigurable SC converter are presented and compared with prior art, revealing a more than twofold improvement in power density at an overall higher efficiency, as concluded in Section VI.

# II. RECONFIGURABLE SWITCHED CAPACITOR CONVERTERS

A SC converter is often perceived as a converter with a fixed voltage conversion ratio. However, this is not the complete picture, as the output voltage of a SC converter can be operated below the voltage resulting from the conversion ratio. For instance, a 2:1 conversion ratio SC converter can support output voltages below half the input voltage. For a microprocessor application using DVFS, the output voltage range required can exceed the range covered by the 2:1 converter. Instead, a 3:2 conversion ratio SC converter may be more suitable, covering the output voltage below two-thirds of the input voltage. However, a characteristic of SC converters is that the efficiency drops linearly with the output voltage, as will be discussed in more detail in Section III. Hence it is undesirable to operate SC converters at an output voltage far



Fig. 1. Efficiency of a 2:1 and a 3:2 reconfigurable SC converter with  $V_{\rm in} = 1.8$  V. Reconfigurable SC converters are a means to efficiently cover a wide output voltage range from a fixed input voltage.



Fig. 2. The 2:1 and 3:2 reconfigurable SC converter power stage including the switch configuration in the charging and the discharging phase.

off the conversion ratio. A solution is a SC converter that can be reconfigured between the 2:1 and the 3:2 voltage conversion ratio to efficiently cover a wide output voltage range.

As an example: with  $V_{\rm in} = 1.8$  V, the 2:1 converter covers the output voltage below 900 mV, and the 3:2 converter covers an output voltage below 1.2 V. By changing between the 2:1 and the 3:2 configuration, efficiency can be kept high over a wide output voltage range, as depicted in **Fig. 1**.

The output current (and thereby the output power) for a given output voltage depends on the dimensioning of capacitors and switches in the power stage. Output current regulation capability can be done by changing the switching frequency as in [2, 3, 14–17].

#### A. 2:1 and 3:2 reconfigurable power stage

The basic operating principle of the reconfigurable SC converter power stage is shown in **Fig. 2** [11, 12, 15, 16]. In each configuration, two flying capacitors are sequentially switched between a charging and a discharging phase at 50%

duty cycle. The rate at which the converter switches phase is denoted by the switching frequency  $f_{\rm sw}$ . The implementation of this converter in a 32 nm SOI CMOS technology will be detailed further in Section IV.

# III. STATE SPACE MODEL REPRESENTATION

Applying a model framework on a SC converter, as depicted in **Fig. 3**, translates the SC converter into an equivalent model that captures the steady state converter operation and power losses (efficiency) The transformer winding ratio 1:*M* models the voltage conversion ratio,  $R_{\rm eq}$  models the equivalent output resistance that governs the conduction losses, and  $R_{\rm bp}$ models the equivalent bottom plate resistance that governs the switching losses. Both  $R_{\rm eq}$  and  $R_{\rm bp}$  are functions of  $f_{\rm sw}$ .

The model framework from [11, 12] can be used to estimate  $R_{\rm eq}$ . However, as mentioned in the introduction, this model framework is not directly applicable for on-chip SC converters because of the presence of the flying capacitor's parasitic bottom plate capacitor  $C_{\rm bp}$ , which significantly influences both steady state operation and efficiency. In other words, the influence of  $C_{\rm bp}$  on  $R_{\rm eq}$  and the inclusion of  $R_{\rm bp}$  in Fig. 3 are not taken into account in [11, 12]. These effects have been indicated in e.g. [15, 17], but not in a comprehensive manner that considers steady state operation.

## A. State space model framework including $C_{\rm bp}$

The state space model from [13] is extended in the following to include  $C_{bp}$ , thereby being applicable in the design of on-chip SC converters. As opposed to [13], we have 2ncapacitors (*n* flying capacitors and *n* appertaining bottom plate capacitors), which are put as diagonal elements into a 2n diagonal matrix C. The input and the output voltage are composed into vector **u**. Vectors **v** and **i** collect all capacitor voltages and currents, respectively, with **v** and **i** being related by

$$\mathbf{i} = \mathbf{C} \dot{\mathbf{v}},\tag{1}$$

where  $\dot{\mathbf{v}}$  is the time derivative of  $\mathbf{v}$ .

For the charging phase (phase 1), Kirchhoff's voltage and current laws (KVL and KCL, respectively) are applied to determine 2n independent equations of the form

$$\mathbf{E}_1 \mathbf{i} + \mathbf{F}_1 \mathbf{v} + \mathbf{G}_1 \mathbf{u} = \mathbf{0}.$$
 (2)

When KVL is applied, rows in  $E_1$  are resistances (transistor on-state resistances and/or flying capacitor equivalent series resistances), and rows in  $F_1$  and  $G_1$  are -1, 0, or 1. When KCL is applied, rows in  $E_1$  are -1, 0, or 1 and rows in  $F_1$ and  $G_1$  are all 0. Letting v represent the system states, (1) and (2) can be combined into

$$\dot{\mathbf{v}} = \mathbf{A}_1 \mathbf{v} + \mathbf{B}_1 \mathbf{u}$$
  
 $\mathbf{A}_1 = -\mathbf{C}^{-1} \mathbf{E}_1^{-1} \mathbf{F}_1$  (3)  
 $\mathbf{B}_1 = -\mathbf{C}^{-1} \mathbf{E}_1^{-1} \mathbf{G}_1$ ,

where C is always invertible because it is a diagonal matrix and  $E_1$  is invertible when KVL and KCL have been applied correctly [13]. The general solution to the system of differential equations in (3) is

$$\mathbf{v}(t) = \underbrace{e^{\mathbf{A}_{1}(t-t_{0})}}_{\mathbf{\Phi}_{1}(t)} \mathbf{v}(t_{0}) + \left[\underbrace{\int_{t_{0}}^{t} e^{\mathbf{A}_{1}(t-\tau)} \mathbf{B}_{1} \mathrm{d}\tau}_{\mathbf{\Gamma}_{1}(t)}\right] \mathbf{u}, \quad (4)$$

where we have utilized that **u** is independent of  $\tau$ .  $\Phi_1(t)$  is known as the state transition matrix.

Using the same approach for the discharging phase (phase 2) results in  $A_2$  and  $B_2$ , as well as  $\Phi_2(t)$  and  $\Gamma_2(t)$ .

With 50% duty cycle,  $t_1 = 1/(2f_{sw})$  is the duration of the charging phase, and  $t_2 = 1/(2f_{sw})$  is the duration of the discharging phase. Hence, assuming the charging phase begins at  $t_0 = 0$ , the system states (capacitor voltages) at the end of each switching phase equals

$$\mathbf{v}(t_1) = \mathbf{\Phi}_1(t_1)\mathbf{v}(0) + \mathbf{\Gamma}_1(t_1)\mathbf{u}$$
(5)

$$\mathbf{v}(t_1+t_2) = \mathbf{\Phi}_2(t_2)\mathbf{v}(t_1) + \mathbf{\Gamma}_2(t_2)\mathbf{u}.$$
 (6)

In steady state,  $\mathbf{v}(0) = \mathbf{v}(t_1 + t_2)$  applies, which, using (5) and (6), gives the initial condition

$$\mathbf{v}(0) = \left(\mathbf{I} - \boldsymbol{\Phi}_2(t_2)\boldsymbol{\Phi}_1(t_1)\right)^{-1} \left(\boldsymbol{\Phi}_2(t_2)\boldsymbol{\Gamma}_1(t_1) + \boldsymbol{\Gamma}_2(t_2)\right) \mathbf{u}, \quad (7)$$

where I is the 2n identity matrix. The charge delivered by each capacitor per switching phase is determined as

$$\mathbf{q}_1 = \mathbf{C} \left( \mathbf{v}(t_1) - \mathbf{v}(0) \right) \tag{8}$$

$$\mathbf{q}_2 = \mathbf{C} \left( \mathbf{v}(t_1 + t_2) - \mathbf{v}(t_1) \right) = -\mathbf{q}_1, \tag{9}$$

where the last equality holds because of charge conservation.

## B. Example 2:1 SC converter analysis

In the following, the above state space model is applied on the 2:1 SC converter shown in **Fig. 4a**, where the equivalent circuit is shown in its charging and its discharging phase in Fig. 4b and Fig. 4c, respectively. In the equivalent circuit, each switch is replaced by an on-state resistance  $R_{\rm on}$  when on and an open circuit when off, and the flying capacitor model includes its equivalent series resistance  $R_{\rm esr}$  and the bottom plate capacitor  $C_{\rm bp}$ .

The application of KVL and KCL put into the form of (2) yields the system matrices

$$\begin{split} \mathbf{C} &= \begin{pmatrix} C & 0 \\ 0 & C_{\rm bp} \end{pmatrix}, \ \mathbf{i} = \begin{pmatrix} i_C \\ i_{C \rm bp} \end{pmatrix}, \ \mathbf{v} = \begin{pmatrix} v_C \\ v_{C \rm bp} \end{pmatrix}, \ \mathbf{u} = \begin{pmatrix} V_{\rm in} \\ V_{\rm out} \end{pmatrix}, \\ \mathbf{E}_1 &= \begin{pmatrix} R_{\rm on1} + R_{\rm esr} & 0 \\ -R_{\rm on3} & R_{\rm on3} \end{pmatrix}, \ \mathbf{E}_2 &= \begin{pmatrix} R_{\rm on2} + R_{\rm esr} & 0 \\ R_{\rm on4} & -R_{\rm on4} \end{pmatrix}, \\ \mathbf{F}_1 &= \begin{pmatrix} 1 & 1 \\ 0 & 1 \end{pmatrix}, \qquad \mathbf{F}_2 &= \begin{pmatrix} 1 & 1 \\ 0 & -1 \end{pmatrix}, \\ \mathbf{G}_1 &= \begin{pmatrix} -1 & 0 \\ 0 & -1 \end{pmatrix}, \qquad \mathbf{G}_2 &= \begin{pmatrix} 0 & -1 \\ 0 & 0 \end{pmatrix}. \end{split}$$

Now the procedure described above can be applied to calculate the capacitor charges in (8) and (9). From Fig. 4, the output charge in each phase can be found as

$$q_{\rm out1} = q_{C1} - q_{C\rm bp1},\tag{10}$$

$$q_{\text{out2}} = -q_{C2} = q_{C1},\tag{11}$$



Fig. 3. Using a model framework to describe a SC converter by an equivalent model. The model framework presented in this paper takes into account the flying capacitors' parasitic bottom plate capacitors, whose effect significantly influence both steady state operation and efficiency.



Fig. 4. Basic 2:1 SC converter consisting of 4 switches and 1 flying capacitor analyzed using the state space model framework presented in this paper. The converter is shown in both the charging and the discharging phase, including the transistor on-state resistances  $R_{on1-4}$ , the equivalent series resistance  $R_{esr}$ , and the parasitic bottom plate capacitor  $C_{bp}$ .

and the total average output current over a full switching period becomes

$$I_{\text{out}} = \frac{q_{\text{out}1} + q_{\text{out}2}}{t_1 + t_2} = \left(2q_{C1} - q_{C\text{bp}1}\right) f_{\text{sw}}.$$
 (12)

Likewise, the total average input current is

$$I_{\rm in} = \frac{q_{\rm in1} + q_{\rm in2}}{t_1 + t_2} = q_{C1} f_{\rm sw}.$$
 (13)

Using (12) and (13), the total efficiency of the 2:1 SC converter can be calculated as

$$\eta = \frac{P_{\text{out}}}{P_{\text{in}}} = \frac{V_{\text{out}}I_{\text{out}}}{V_{\text{in}}I_{\text{in}}} = \frac{V_{\text{out}}}{V_{\text{in}}} \left(2 - \frac{q_{C\text{bp1}}}{q_{C1}}\right).$$
(14)

To port this analysis to the equivalent model from Fig. 3, the resistances can be determined to be

$$R_{\rm eq} = \frac{MV_{\rm in} - V_{\rm out}}{I_{\rm out}} = \frac{\frac{1}{2}V_{\rm in} - V_{\rm out}}{\left(2q_{C1} - q_{C\rm bp1}\right)f_{\rm sw}} \qquad (15)$$

$$R_{\rm bp} = \frac{MV_{\rm in}}{\frac{1}{M}I_{\rm in} - I_{\rm out}} = \frac{1}{2}\frac{V_{\rm in}}{q_{C\rm bp1}f_{\rm sw}}$$
(16)

where M = 1/2 is the voltage conversion ratio.

A similar analysis is carried out for the 3:2 SC converter, but the details have been omitted for space reasons.

#### C. Model verification

The state space model of the reconfigurable SC converter is verified against simulations using the Matlab Simulink environment. For the verification,  $V_{\rm in} = 1.8$  V,  $R_{\rm on1-9} = R_{\rm esr1,2} = 1 \Omega$ , and  $C_{1,2} = 1$  nF. When sweeping the output voltage, the switching frequency is arbitrarily chosen to equal  $f_{\rm sw} = 100$  MHz, and when sweeping the switching frequency, the output voltage is arbitrarily chosen to equal  $V_{\rm out} = 850$  mV. In the 2:1 configuration,  $I_{\rm out}$  is doubled and  $R_{\rm eq}$  and  $R_{\rm bp}$  are halved since the power stage from Fig. 2 consists of two 2:1 SC converters in parallel.

**Fig. 5** shows the model and simulation results for various ratios of bottom plate capacitor to flying capacitor

$$\alpha = C_{\rm bp}/C. \tag{17}$$

As can be seen, the state space model framework is able to accurately capture the influence of the bottom plate capacitors on the converter's steady state operation and efficiency. For  $\alpha = 0\%$ , which corresponds to omitting  $C_{\rm bp}$ , the efficiency shown in Fig. 5a approaches 100% as  $I_{\rm out}$  shown in Fig. 5b approaches 0 and  $V_{\rm out}$  goes towards  $\frac{1}{2}V_{\rm in}$  ( $\frac{2}{3}V_{\rm in}$ ) in the 2:1 (3:2) configuration. For  $\alpha > 0\%$ , the efficiency drops because of the switching losses. Moreover, the transition voltage between the 2:1 and 3:2 configurations is adjusted for each value of  $\alpha$  to ensure a continuous efficiency over the entire voltage range. Regarding  $R_{\rm eq}$  for  $\alpha = 0\%$  shown in Fig. 5c, the well-known characteristics of a  $1/f_{\rm sw}$  behavior at low switching



Fig. 5. Verification of (a) efficiency  $\eta$ , (b) output current  $I_{out}$ , (c) equivalent output resistance  $R_{eq}$ , and (d) equivalent bottom plate resistance  $R_{bp}$  resulting from the state space model framework. The simulated results (red dots) match the model results (blue lines) over both output voltage and switching frequency for various values of  $\alpha = C_{bp}/C$ .

frequencies and a constant behavior at high switching frequencies are observed [12]. For  $\alpha > 0\%$ , the decrease in  $I_{\rm out}$  and the increase and upward bend at high switching frequencies in  $R_{\rm eq}$  are associated with the presence of  $R_{\rm bp}$  in Fig. 5d. From Fig. 3,  $R_{\rm bp}$  sinks a current ( $I_{\rm bp}$ ) that would otherwise have been delivered to the output, thereby affecting both the efficiency and the output current. This behavior is not captured by the existing model frameworks [11–13].

#### D. Power loss distribution

The state space model facilitates an investigation of the distribution of conduction losses  $P_{\rm eq}$  and switching losses  $P_{\rm bp}$ , which are the power losses associated with  $R_{\rm eq}$  and  $R_{\rm bp}$ , respectively. Using the same model parameter values as above, the distribution of power losses for various values of  $\alpha$  are shown in **Fig. 6**. For  $\alpha = 0\%$ , there are no switching losses  $(P_{\rm bp} = 0)$  and conduction losses constitute all power losses in the converter  $(P_{\rm loss} = P_{\rm eq})$ . For  $\alpha > 0\%$ , the ratio between the losses is constant at low  $f_{\rm sw}$ , since, from Fig. 5c and Fig. 5d, both  $R_{\rm eq}$  and  $R_{\rm bp}$  scale with  $1/f_{\rm sw}$ . This leads to a constant ratio of  $R_{\rm bp}/R_{\rm eq} \propto I_{\rm bp}/I_{\rm out} \propto P_{\rm bp}/P_{\rm eq}$ . For higher  $f_{\rm sw}$ , switching losses constitute an increasing fraction of the total power losses as  $R_{\rm eq}$  ceases while  $R_{\rm bp}$  continues to



Fig. 6. Distribution of conduction losses  $P_{eq}$  and switching losses  $P_{bp}$  or various values of  $\alpha$ . The total power loss is  $P_{loss} = P_{eq} + P_{bp}$ .

scale with  $1/f_{\rm sw}$ . This leads to an increased ratio of  $I_{\rm bp}/I_{\rm out}$ and thereby an increased ratio of  $P_{\rm bp}/P_{\rm eq}$ . For  $\alpha > 3\%$ , switching losses constitute more than two-thirds of the total power losses.



Fig. 7. Transistor level circuit diagram of the reconfigurable SC converter. The parasitic bottom plate capacitors are explicitly shown in gray.



Fig. 8. Level-shifted and non-overlapping (deadtime) gate signals generated by the gate driver.

#### IV. IMPLEMENTATION IN 32 NM SOI CMOS

The reconfigurable SC converter described in Section II and modeled in Section III is implemented in a 32 nm SOI CMOS process from IBM. This particular process features the deep trench capacitor, which, for OCVR applications, has shown superior efficiency and power density performance compared with other on-chip capacitor technologies [2, 3]. The performance benefits are a result of the deep trench capacitor's high capacitance density and low  $\alpha$ .

The transistor level circuit diagram of the reconfigurable converter is shown in **Fig. 7**. For each transistor, its gate signal, which is derived from the level-shifted non-overlapping clock signals shown in **Fig. 8**, is listed in **Tab. I** for both the 2:1 and the 3:2 configuration. The change between clock feeds is implemented using multiplexers (not shown) set by a separate control signal. The gate driver used to generate the level-shifted non-overlapping clock signals in Fig. 8 is done as

 TABLE I

 Gate signals for all transistors in the 2:1 and the 3:2

 configuration.

|              | 2:1                 | 3:2  |  |  |  |
|--------------|---------------------|------|--|--|--|
| $v_{g1}$     | $v_{ m g,pH}$       |      |  |  |  |
| $v_{g2}$     | $v_{ m g,nH}$       |      |  |  |  |
| $v_{\rm g3}$ | $v_{\mathrm{g,pL}}$ |      |  |  |  |
| $v_{g4}$     | $v_{\rm g,pH}$      | gnd  |  |  |  |
| $v_{g5}$     | gnd                 | Vout |  |  |  |
| $v_{g6}$     | v <sub>g,pH</sub>   |      |  |  |  |
| $v_{\rm g7}$ | $v_{\rm g,nH}$      | Vout |  |  |  |
| $v_{\rm g8}$ | $v_{\mathrm{g,pL}}$ |      |  |  |  |
| $v_{g9}$     | $v_{ m g,nL}$       |      |  |  |  |



Fig. 9. Chip micrograph of the reconfigurable SC converter implemented in a 32 nm SOI CMOS technology, which features the high capacitance density and low  $\alpha$  deep trench capacitor.

in [3]. In the literature, earlier implementations of this power stage use gate drivers that depend on either a) internal node voltages as in [15] or b) external voltage supplies as in [16]. In this implementation, all gate signals are based on  $V_{\text{out}}$ ,  $V_{\text{in}}$ , and gnd only, see Fig. 8 and Tab. I. This greatly simplifies the gate driver design.

A chip photo with a layout view of the reconfigurable SC converter is shown in **Fig. 9**. The deep trench capacitors take up 72.1%, the transistors 27.3%, and the gate driver 0.6% of the total converter area. The pad labeled gear is used to externally configure between the 2:1 and the 3:2 configuration.

#### V. EXPERIMENTAL RESULTS

Measurements are carried out on the unpackaged chip die mounted on a probe station. Keithley SourceMeters are used to measure the input and output currents by acting as both the input supply and the output sink. The input and output voltages are measured using Kelvin contacts to account for the voltage drops of cable and contact resistances. For each measurement point, the Keithley SourceMeters are configured such that the on-chip input and output voltages are at the desired levels.

An insufficient on-chip decoupling capacitance is implemented because of chip area limitations, so a discrete 33 nF capacitor is added externally to the chip to reduce the output voltage ripple. However, the extra decoupling capacitance is



Fig. 10. Measured efficiency and power density results over the entire output voltage range for  $f_{\rm sw}=100\,{\rm MHz}.$ 

not included in the power density measures as the converter presented here is intended to be used in interleaved on-chip SC converters, in which the need for output decoupling can be drastically reduced or even completely omitted [15–17].

The measurement results for  $f_{\rm sw} = 100 \,\rm MHz}$  are shown in **Fig. 10**. The converter achieves a maximum efficiency of 85.2% at 2.1 W/mm<sup>2</sup> power density in the 2:1 configuration and a maximum efficiency of 84.1% at 3.2 W/mm<sup>2</sup> in the 3:2 configuration. The efficiency across the entire voltage range of 700 mV to 1150 mV is above 70%. Moreover, the power density can be as high as  $6 \,\rm W/mm^2$ , but at reduced efficiencies only.

## A. Comparison of measurement and model framework results

The model presented in Section III is applied on the reconfigurable SC converter using model parameter values ( $R_{on1-9}$ , C,  $\alpha$ , and  $R_{esr}$ ) that are extracted from the technology models provided by IBM. Thereafter, the model is sought fitted to the measurement results in order to investigate parameter variations between the expected and measured results.

The transistors' on-state resistance and the capacitance of the deep trench capacitor depend on the output voltage, thereby affecting the values of  $R_{on1-9}$  and C. However, both  $R_{on}$  and C are to a first order independent on switching frequency.

 $\label{eq:comparison} \begin{array}{l} \mbox{TABLE II} \\ \mbox{Comparison of model results using parameters extracted from} \\ \mbox{Technology models and parameters fitted to the measurement} \\ \mbox{Results with } V_{in} = 1.8 \, \mathrm{V} \mbox{ and } f_{sw} = 100 \, \mathrm{MHz}. \end{array}$ 

| 2:1                                                                                     | $R_{ m on1}$                                                                                                | $R_{ m on2}$                                                                                                       | $R_{ m on3}$                                                                                                      | $R_{ m on4}$                                                                                   | $R_{ m on5}$                                                                                                        | $R_{ m on6}$                                                                                                                                                                                   |
|-----------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Model(extract)                                                                          | $0.6\Omega$                                                                                                 | $0.6\Omega$                                                                                                        | $0.7\Omega$                                                                                                       | $0.7\Omega$                                                                                    | -                                                                                                                   | $1.1\Omega$                                                                                                                                                                                    |
| Model(meas)                                                                             | $1.1\Omega$                                                                                                 | $1.1\Omega$                                                                                                        | $1.2\Omega$                                                                                                       | $1.2\Omega$                                                                                    | -                                                                                                                   | $1.9\Omega$                                                                                                                                                                                    |
|                                                                                         | $R_{ m on7}$                                                                                                | $R_{ m on8}$                                                                                                       | $R_{ m on9}$                                                                                                      | C                                                                                              | $\alpha$                                                                                                            | $R_{ m esr}$                                                                                                                                                                                   |
| Model(extract)                                                                          | $1.3\Omega$                                                                                                 | $0.7\Omega$                                                                                                        | $0.7\Omega$                                                                                                       | 1.0 nF                                                                                         | 1.8%                                                                                                                | $0.7\Omega$                                                                                                                                                                                    |
| Model(meas)                                                                             | $2.3\Omega$                                                                                                 | $1.2\Omega$                                                                                                        | $1.2\Omega$                                                                                                       | $0.8\mathrm{nF}$                                                                               | 1.7%                                                                                                                | $0.5\Omega$                                                                                                                                                                                    |
|                                                                                         | $V_{ m out}$                                                                                                | Iout                                                                                                               | $R_{ m eq}$                                                                                                       | $R_{ m bp}$                                                                                    | $\eta$                                                                                                              | ρ                                                                                                                                                                                              |
| Model(extract)                                                                          | $0.85\mathrm{V}$                                                                                            | $25\mathrm{mA}$                                                                                                    | $1.7\Omega$                                                                                                       | $286\Omega$                                                                                    | 84.6%                                                                                                               | $2.5\mathrm{W/mm^2}$                                                                                                                                                                           |
| Model(meas)                                                                             | $0.85\mathrm{V}$                                                                                            | $22\mathrm{mA}$                                                                                                    | $2.3\Omega$                                                                                                       | $369\Omega$                                                                                    | 85.0%                                                                                                               | $2.2\mathrm{W/mm^2}$                                                                                                                                                                           |
|                                                                                         |                                                                                                             |                                                                                                                    |                                                                                                                   |                                                                                                |                                                                                                                     |                                                                                                                                                                                                |
| 3:2                                                                                     | $R_{ m on1}$                                                                                                | $R_{ m on2}$                                                                                                       | $R_{ m on3}$                                                                                                      | Ron4                                                                                           | $R_{ m on5}$                                                                                                        | $R_{ m on6}$                                                                                                                                                                                   |
| 3:2<br>Model(extract)                                                                   | <b>R</b> on1<br>1.0 Ω                                                                                       | <b>R</b> on2<br>1.0 Ω                                                                                              | <b>R</b> on3<br>0.5 Ω                                                                                             | $R_{on4}$ –                                                                                    | <b>R</b> on5<br>3.8 Ω                                                                                               | <b>R</b> on6<br>2.0 Ω                                                                                                                                                                          |
| 3:2<br>Model(extract)<br>Model(meas)                                                    | <b>R</b> on1<br>1.0 Ω<br>1.4 Ω                                                                              | <b>R</b> on2<br>1.0 Ω<br>1.4 Ω                                                                                     | $\frac{\boldsymbol{R_{\text{on3}}}}{0.5\Omega}$                                                                   | $R_{ m on4}$ $ -$                                                                              | $\frac{\boldsymbol{R_{\text{on5}}}}{3.8\Omega}$ $5.2\Omega$                                                         | Ron6           2.0 Ω           2.7 Ω                                                                                                                                                           |
| 3:2<br>Model(extract)<br>Model(meas)                                                    | Ron1           1.0 Ω           1.4 Ω           Ron7                                                         | R <sub>on2</sub> 1.0 Ω           1.4 Ω           R <sub>on8</sub>                                                  | Ron3           0.5 Ω           0.7 Ω           Ron9                                                               | $R_{ m on4}$ $  C$                                                                             | $ \begin{array}{c} R_{\text{on5}} \\ 3.8 \Omega \\ 5.2 \Omega \end{array} \\  \begin{array}{c} \alpha \end{array} $ | R <sub>on6</sub> 2.0 Ω           2.7 Ω           R <sub>esr</sub>                                                                                                                              |
| 3:2<br>Model(extract)<br>Model(meas)<br>Model(extract)                                  | Ron1           1.0 Ω           1.4 Ω           Ron7                                                         | Ron2           1.0 Ω           1.4 Ω           Ron8           0.5 Ω                                                | Ron3           0.5 Ω           0.7 Ω           Ron9           0.5 Ω                                               | Ron4           -           -           C           1.0 nF                                      | Ron5           3.8 Ω           5.2 Ω           α           1.8%                                                     |                                                                                                                                                                                                |
| 3:2<br>Model(extract)<br>Model(meas)<br>Model(extract)<br>Model(meas)                   | Ron1           1.0 Ω           1.4 Ω           Ron7           -           -                                 | Ron2           1.0 Ω           1.4 Ω           Ron8           0.5 Ω           0.7 Ω                                | Ron3           0.5 Ω           0.7 Ω           Ron9           0.5 Ω           0.7 Ω                               | Ron4           -           C           1.0 nF           0.7 nF                                 | Ron5           3.8 Ω           5.2 Ω           α           1.8%           1.7%                                      | $     \begin{array}{r} R_{on6} \\             2.0 \Omega \\             2.7 \Omega \\             R_{esr} \\             0.7 \Omega \\             1.4 \Omega \\             \end{array}     $ |
| 3:2<br>Model(extract)<br>Model(meas)<br>Model(extract)<br>Model(meas)                   | Ron1           1.0 Ω           1.4 Ω           Ron7           -           -           Vout                  | Ron2           1.0 Ω           1.4 Ω           Ron8           0.5 Ω           0.7 Ω           Iout                 | Ron3           0.5 Ω           0.7 Ω           0.5 Ω           0.7 Ω           0.7 Ω           Req                | Ron4           -           C           1.0 nF           0.7 nF           R <sub>bp</sub>       | Ron5           3.8 Ω           5.2 Ω           α           1.8%           1.7%                                      |                                                                                                                                                                                                |
| 3:2<br>Model(extract)<br>Model(meas)<br>Model(extract)<br>Model(meas)<br>Model(extract) | Ron1           1.0 Ω           1.4 Ω           Ron7           -           -           Vout           1.09 V | Ron2           1.0 Ω           1.4 Ω           Ron8           0.5 Ω           0.7 Ω           Iout           30 mA | Ron3           0.5 Ω           0.7 Ω           Ron9           0.5 Ω           0.7 Ω           Req           3.5 Ω | Ron4           -           C           1.0 nF           0.7 nF           R <sub>bp</sub> 501 Ω | Ron5           3.8 Ω           5.2 Ω           α           1.8%           1.7%           η           84.5%          | Ron6           2.0 Ω           2.7 Ω           Resr           0.7 Ω           1.4 Ω           ρ           3.8 W/mm <sup>2</sup>                                                                |

For these reasons, the model is fitted to the measurement results with  $V_{\text{out}} = 0.85 \text{ V}$  for the 2:1 configuration and  $V_{\text{out}} = 1.09 \text{ V}$  for the 3:2 configuration over the switching frequency range from 40 MHz to 200 MHz. Furthermore,  $V_{\text{in}} = 1.8 \text{ V}$  applies.

The fitting algorithm is designed for a best fit of the measured and modeled  $R_{eq}$  and  $R_{bp}$  over a switching frequency range. The error function S to be minimized is defined as the sum of the normalized root mean square error between measured and modeled values of  $R_{eq}$  and  $R_{bp}$ .

$$S = \frac{\sqrt{\frac{1}{N}\sum_{i=1}^{N} \left(R_{\rm eq,meas,f_{\rm sw}i} - R_{\rm eq,model,f_{\rm sw}i}\right)^2}}{\frac{\frac{1}{N}\sum_{i=1}^{N} R_{\rm eq,meas,f_{\rm sw}i}}{\frac{\sqrt{\frac{1}{N}\sum_{i=1}^{N} \left(R_{\rm bp,meas,f_{\rm sw}i} - R_{\rm bp,model,f_{\rm sw}i}\right)^2}}{\frac{1}{N}\sum_{i=1}^{N} R_{\rm bp,meas,f_{\rm sw}i}}}, \quad (18)$$

where N is the number of measurement points considered.

The extracted model parameter values are labeled 'Model(extract)' in **Tab II**, and the evaluation of the model using these parameters are listed. To reduce the number of model parameters to fit, the ratios between the extracted onstate resistance values in Tab. II are assumed to be valid when fitting to the measurement results. Using a Matlab script, the model framework is evaluated with different values for each model parameter. Thereafter, the best fit parameters are found by minimizing S from (18). The best fit results for both output voltages are listed as 'Model(meas)' in Tab. II, and the following observations are made from the comparison between extracted and fitted model parameter values:

• The increase in  $R_{\rm on}$  is attributed to wire resistances that are not included in the schematic parameter extraction.



Fig. 11. Efficiency and power density performance comparison between the SC converter presented in this paper and prior art, showing the performance benefits facilitated by the deep trench (DT) capacitor.

• The lower C at higher V<sub>out</sub> could indicate that the nonlinear voltage dependency on capacitance is more severe for the test chip than anticipated from the technology models.

# B. Comparison with prior art

In **Fig. 11**, the performance of the converter presented is compared with that of other on-chip SC converters having (at least) the 2:1 and 3:2 configurations. The values compared are the maximum efficiency and appertaining power density in both configurations. The comparison in Fig. 11 clearly shows the deep trench capacitor's outstanding efficiency and power density performance compared with MOS or MIM capacitors. The power density is more than twice that of prior art at an overall higher efficiency.

# VI. CONCLUSION

This paper extends an existing model framework for switched capacitor (SC) converters to include the parasitic bottom plate capacitor, which significantly influences both the operation and performance of on-chip SC converters. The model framework presented is verified using Matlab Simulink simulations. It is used to design a SC converter that can be configured to provide either a 2:1 or a 3:2 voltage conversion ratio, thereby efficiently extending the output voltage range supported from a fixed input supply. The reconfigurable SC converter is implemented in a 32 nm SOI CMOS technology that features the deep trench capacitor, which has superior capacitance density and low parasitic bottom plate capacitance compared with MIM and MOS capacitors.

The measured efficiency across the entire voltage range of 700 mV to 1150 mV stays above 70%. The converter achieves a maximum efficiency of 85.2% at  $2.1 \text{ W/mm}^2$  power density in the 2:1 configuration and a maximum efficiency of 84.1% at  $3.2 \text{ W/mm}^2$  in the 3:2 configuration. The power density is more than twice that of prior art at an overall higher efficiency. With these efficiency and power density figures, on-chip SC

converters using deep trench capacitors are viable as on-chip voltage regulators for multicore microprocessor power delivery applications.

#### REFERENCES

- "International technology roadmap for semiconductors," 2011. [Online]. Available: www.itrs.net
- [2] L. Chang, R. K. Montoye, B. L. Ji, A. J. Weger, K. G. Stawiasz, and R. H. Dennard, "A fully-integrated switched-capacitor 2:1 voltage converter with regulation capability and 90% efficiency at 2.3A/mm<sup>2</sup>," in *Proc. of the IEEE Symp. on VLSI Circuits (VLSIC)*, Honolulu, Hawaii, Jun. 2010, pp. 55–56.
- [3] T. M. Andersen, F. Krismer, J. W. Kolar, T. Toifl, and et al., "A 4.6W/mm<sup>2</sup> power density 86% efficiency on-chip switched capacitor DC-DC converter in 32 nm SOI CMOS," in *Proc. of the IEEE Applied Power Electronics Conference and Exposition (APEC)*, Long Beach, CA, USA, Mar. 2013, pp. 692–699.
- [4] P. Stanley-Marbell, V. C. Cabezas, and R. P. Luijten, "Pinned to the walls – impact of packaging and application properties on the memory and power walls," in *Proc. of the IEEE Int. Symp. on Low Power Electronics* and Design (ISLPED), Fukuoka, Japan, Aug. 2011, pp. 51–56.
- [5] W. Kim, M. S. Gupta, G.-Y. Wei, and D. Brooks, "System level analysis of fast, per-core DVFS using on-chip switching regulators," in *Proc. of the IEEE Symp. on High Performance Computer Architecture (HPCA)*, Salt Lake City, UT, USA, Feb. 2008, pp. 123–134.
- [6] N. Sturcken, E. J. O'Sullivan, N. Wang, P. Herget, and et al., "A 2.5D integrated voltage regulator using coupled-magnetic-core inductors on silicon interposer," *IEEE Journal of Solid-State Circuits*, vol. 48, no. 1, pp. 244–254, Jan. 2013.
- [7] F. Waldron, R. Foley, J. Slowey, A. N. Alderman, B. C. Narveson, and S. C. O. Mathuna, "Technology roadmapping for power supply in package (PSiP) and power supply on chip (PwrSoC)," *IEEE Trans. on Power Electronics*, vol. 28, no. 9, pp. 4137–4145, Sep. 2013.
- [8] N. Wang, J. Barry, J. Hannon, S. Kulkarni, and et al., "High frequency DC-DC converter with co-packaged planar inductor and power IC," in *Proc. of the IEEE Electronic Components and Technology Conf. (ECTC)*, Las Vegas, NV, USA, May 2013, pp. 1946–1952.
- [9] T. J. DiBene, "400 A fully integrated silicon voltage regulator with indie magnetically coupled embedded inductors," in *Proc. of the IEEE Applied Power Electronics Conference and Exposition (APEC)*, Palm Spring, CA, USA, Feb. 2010, p. Special Presentation.
- [10] T. M. Andersen, C. M. Zingerli, F. Krismer, J. W. Kolar, and C. O'Mathuna, "Inductor optimization procedure for power supply in package and power supply on chip," in *Proc. of the IEEE Energy Conversion Congress and Exposition (ECCE)*, Phoenix, AZ, USA, Sep. 2011, pp. 1320–1327.
- [11] M. S. Makowski and D. Maksimovic, "Performance limits of switchedcapacitor DC-DC converters," in *Proc. of the IEEE Power Electronics Specialists Conference (PESC)*, vol. 2, Atlanta, GA, USA, Jun. 1995, pp. 1215–1221 vol.2.
- [12] M. D. Seeman, "A design methodology for switched-capacitor DC-DC converters," Ph.D. dissertation, University of California, Berkeley, 2009.
- [13] J. M. Henry and J. W. Kimball, "Practical performance analysis of complex switched-capacitor converters," *IEEE Trans. on Power Electronics*, vol. 26, no. 1, pp. 127–136, 2011.
- [14] R. Jain, B. Geuskens, M. Khellah, S. Kim, J. Kulkarni, J. Tschanz, and V. De, "A 0.45-1V fully integrated reconfigurable switched capacitor step-down DC-DC converter with high density MIM capacitor in 22nm tri-gate CMOS," in *Proc. of the IEEE Symp. on VLSI Circuits (VLSIC)*, Kyoto, Japan, Jun. 2013, pp. 174–175.
- [15] H.-P. Le, S. R. Sanders, and E. Alon, "Design techniques for fully integrated switched-capacitor DC-DC converters," *IEEE Journal of Solid-State Circuits*, vol. 46, no. 9, pp. 2120–2131, 2011.
- [16] G. V. Piqué, "A 41-phase switched-capacitor power converter with 3.8mV output ripple and 81% efficiency in baseline 90nm CMOS," in *Proc. of the IEEE Solid-State Circuits Conference (ISSCC)*, San Francisco, CA, USA, Feb. 2012, pp. 98–100.
- [17] H. Meyvaert, T. Van Breussegem, and M. Steyaert, "A 1.65 W fully integrated 90nm bulk CMOS intrinsic charge recycling capacitive DC-DC converter: Design & techniques for high power density," in *Proc. of the IEEE Energy Conversion Congress and Exposition (ECCE)*, Phoenix, AZ, USA, Sep. 2011, pp. 3234–3241.