Abstract
Silicon (Si) based complementary metal-oxide semiconductor (CMOS) technology has been the driving force of the information-technology revolution. However, scaling of CMOS technology as per Moore’s law has reached a serious bottleneck. Among the emerging technologies memristive devices can be promising for both memory as well as computing applications. Hybrid CMOS/memristor circuits with CMOL (CMOS + “Molecular”) architecture have been proposed to combine the extremely high density of the memristive devices with the robustness of CMOS technology, leading to terabit-scale memory and extremely efficient computing paradigm. In this work, we demonstrate a hybrid 3D CMOL circuit with 2 layers of memristive crossbars monolithically integrated on a pre-fabricated CMOS substrate. The integrated crossbars can be fully operated through the underlying CMOS circuitry. The memristive devices in both layers exhibit analog switching behavior with controlled tunability and stable multi-level operation. We perform dot-product operations with the 2D and 3D memristive crossbars to demonstrate the applicability of such 3D CMOL hybrid circuits as a multiply-add engine. To the best of our knowledge this is the first demonstration of a functional 3D CMOL hybrid circuit.
Similar content being viewed by others
Introduction
Resistance switching or memristive devices are metal-insulator-metal structures that can switch between at least two different resistance states upon application of an electrical impulse (voltage or current). Although the phenomenon of resistance switching has been known since the 1960 s, research interest has significantly grown in the last decade after resistance switching devices were identified not only as one of the leading candidates for next generation memory1,2,3,4,5,6,7 but also for analog computation8,9,10, neuromorphic circuits11,12,13,14,15, reconfigurable logicFigure 5c(ii) shows a gradual increase of channel 1 (layer 1) weight from 0.39 mS to 0.48 mS in 5 steps, using the tuning algorithm. The evolution of the output waveform is shown in Fig. 5c(i), which confirms the correct operation. The amplitude (peak-peak) of the output current for the layer 1 component changes from 72 μA to 95 μA as a result (see Supplementary Figure 14b). In the next step, the state of the device in layer 1 is kept unchanged while the device in layer 2 is gradually turned on (Fig. 5c(v)). The corresponding output waveforms are displayed in Fig. 5c(iv). As expected, as the weight for channel 2 increases the amplitude of the high-frequency component in the output waveform increases. As shown in Supplementary Figure 14f, the layer 2 component (peak-peak) changes from 10–50 μA. The margins for the change in output current components for both the layer 1 and layer 2 devices are ~5 μA or more (Supplementary Figure 14) and it is to be noted that this margin can be easily adjusted by the tuning procedure. These results indicate that the devices in each layer in the 3D CMOL crossbars can be controlled independently and used for matrix multiplication operation. However, practical implementation of high bandwidth multiply-add operation using 3D CMOS/memristor crossbars will also require overcoming the challenges due to finite line resistance, sneak-path and other sources of noise. Increase in the number of layers in a 3D crossbar is equivalent to increasing the size of a 2D crossbar array, thereby also increasing the sneak-currents33 or train the hardware through supervised or unsupervised learning schemes14,34. A detailed analysis of the effect of sneak-paths in 3D hybrid memristor/CMOS and accurate benchmarking is beyond the scope of this work. Development of a selector technology can also be a possible solution. We will investigate these avenues in our future work.
(a) Schematic of the set-up for the dot-product operation utilizing two devices in two different layers of the 3D crossbar, (b) An example of the dot product operation with two sinusoidal inputs applied to two devices in a 3D crossbar and the device in layer 1 being programmed to decreasing conductance values, (c) (i) evolution of the output waveform with the weight of the device in layer 1 changing (ii) and the weight for the device on the 2nd layer being kept constant (iii); (iv) evolution of the output waveform when weight of the device in layer 1 is kept constant (v) while layer 2 device changes it’s state (weight) (vi).
In summary, we demonstrate the first 3D CMOL hybrid circuit with 3D memristive crossbars monolithically integrated on a CMOS substrate. High integration yield in terms of good electrical contact between the memristive components and the CMOS substrate was achieved by planarization of the CMOS chips. The integrated 3D crossbars can be fully controlled by the underlying CMOS circuitry. The memristive devices display forming-free switching with low voltage operation. They are analog tunable using a write-and-verify algorithm. The multi-level control of the states for the memristive devices allows them to be used in multiply-add operations where their conductance values can be used as controllable weights. Demonstration of multiply-add operation utilizing memristive devices both in the 1st and 2nd layer of the 3D crossbars opens up promise for ultra-high bandwidth multiply-add engines with high density memristor/CMOS 3D hybrid circuits.
Methods
Preparation of planarization holder
The CMOS chips used in this experiment have a dimension of 5 mm × 5 mm and a thickness of 256 μm. Thickness of a 4 inch Si wafer with initial thickness of ~260 μm (+0−4 μm) is reduced by 6–10 μm (depending on the initial thickness) using deep-Si Reactive Ion Etching (DRIE) to have a final thickness of ~254 μm. The wafer is then polished by CMP in SF1 slurry (alkaline colloidal silica) for 4 mins to remove the roughness generated by the DRIE process. A 3 μm SiO2 film is then deposited by Plasma Enhanced Chemical Vapor Deposition (PECVD) on the wafer. A 5.5 mm window is patterned on the oxide by photolithography with negative resist (AZnLOF2020) and using a 5.5 mm Si piece as the mask. The oxide in the window region is etched back with CHF3 plasma. The wafer is then subjected to DRIE to completely etch Si within the window to make a 5.5 mm × 5.5 mm hole in the Si wafer.
Chemical mechanical planarization of the chip
The as-received chip has a 1.3 μm scratch-protect oxynitride layer with unknown composition (undisclosed from the Foundry). Due to the unknown composition/quality of the oxynitride it is difficult to precisely control processing of this layer. Therefore we completely remove this layer and use a planarization dielectric of known quality/composition. After completing removing the oxynitride layer by dry etching in CHF3 plasma, the organic residues were removed by cleaning in AZ300T for 15 minutes. Next the CMOS pads in the active region are covered with Ti/Au (10/100 nm) to prevent oxidation of Al. A 2.5 μm SiO2 layer is then deposited by Inductively Coupled Plasma based PECVD (ICP-PECVD) at low temperature (50 °C). The planarization holder is then crystal bonded on a second Si substrate and the CMOS chip is placed in the middle of the holder. The entire ensemble was then polished in CMP with SF1 slurry for 4 mins. After CMP the final topography of the chip is verified by atomic force microscopy (AFM). The oxide thickness on top of the CMOS pads post-CMP is measured by a reflectance measurement unit. The post-CMP oxide thickness is ~1.5 μm across the chip. Next the planarization oxide (SiO2) is etched back to the desired thickness (180 nm).
3D memristive crossbar fabrication
4 × 4 μm2 via holes for contact between the BEs of the crossbars and the CMOS pads are first created by photolithography and CHF3 plasma etching through the 180 nm planarization oxide. Next, Ta/Pt (5/60 nm) BEs for layer 1 devices (width 2 μm) were patterned by photolithography and E-beam evaporation. The Al2O3/TiOx (3/30 nm) switching stack is deposited by reactive sputtering in Ar/O2 plasma. Stoichiometry of the TiOx layer was controlled by controlling the O2 flow. Next, via holes (4 × 4 μm2) for contact between the TEs and CMOS pads are created by photolithography and dry etching in CHF3 plasma. TEs of Ti/Pt (15/60 nm) are defined by optical lithography and E-beam evaporation. After fabrication of the first crossbar layer an isolation oxide of 200 nm is deposited by ICP- PECVD. The 2nd layer of crossbars is fabricated by performing the same fabrication steps used for layer 1, namely patterning of BE via holes, deposition of Ta/Pt BEs, deposition of the switching oxide stack, patterning of TE via holes and defining the Ti/Pt TEs. In a final lithography step, via holes are opened on the wire-bonding pads by photolithography and CHF3 plasma etching. The chip is then annealed at 300 °C for 15 mins in forming gas (N2 + H2). The processed chip was wire bonded and packaged in a commercial facility before electrical measurements were performed.
Additional Information
How to cite this article: Chakrabarti, B. et al. A multiply-add engine with monolithically integrated 3D memristor crossbar/CMOS hybrid circuit. Sci. Rep. 7, 42429; doi: 10.1038/srep42429 (2017).
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Change history
27 July 2017
A correction has been published and is appended to both the HTML and PDF versions of this paper. The error has been fixed in the paper.
27 July 2017
Scientific Reports 7: Article number: 42429; published online: 14 February 2017; updated: 27 July 2017. M. Payvand, A. Madhavan, A. Ghofrani and L. Theogarajan were omitted from the author list in the original version of this Article. This has been corrected in the PDF and HTML versions of the Article, as well as in the Supplementary Information that now accompanies the Article.
References
Chen, Y. S. et al. Highly scalable hafnium oxide memory with improvements of resistive distribution and read disturb immunity. IEDM Tech. Dig. 5.5. 1–4 http://dx.doi.org/10.1109/IEDM.2009.5424411 (2009).
Hsu, C. W. et al. 3D vertical TaOx/TiO2 RRAM with over 103 self-rectifying ratio and sub-μA operating current. IEDM Tech. Dig. 10.4. 1–4 http://dx.doi.org/10.1109/IEDM.2013.6724601 (2013).
Govoreanu, B. et al. 10 × 10 nm2 Hf/HfOx crossbar resistive RAM with excellent performance, reliability and low-energy operation. IEDM Tech. Dig. 31.6. 1–4 http://dx.doi.org/10.1109/IEDM.2011.6131652 (2011).
Lee, S. R. et al. Multi-level switching of triple-layered TaOx RRAM with excellent reliability for storage class memory. Dig. Tech. pap. - VLSI Technol. (VLSIT), 2012 Symp. 52, 71–72 (2012).
Sheu, S. S. et al. A 4 Mb embedded SLC resistive-RAM macro with 7.2 ns read-write random- access time and 160 ns MLC-access capability. Dig. Tech. pap.- Int. Solid-State Circuits Conf. (ISSCC) 11.2. 200–202 http://dx.doi.org/10.1109/ISSCC.2011.5746281 (2011).
Sheu, S. S. et al. A 5 ns fast write multi-level non-volatile 1 K bits RRAM memory with advance write scheme. Dig. Tech. pap.- VLSI Circuits, 2009 Symp. 82–83 (2009).
Kim, Y. B. et al. Bi-layered RRAM with unlimited endurance and extremely uniform switching. Dig. Tech. pap.- VLSI Technol. (VLSIT), 2011 Symp. 52–53 (2011).
Laiho, M. & Lehtonen, E. Arithmetic operation within memristor based analog memory. Proc. Int. Workshop CNNA 1–4 http://dx.doi.org/10.1109/CNNA.2010.5430319 (2010).
Merrikh-Bayat, F. & Shouraki, S. B. Memristor-based circuits for performing basic arithmetic operations. Procedia Comp. Sci. 3, 128–132 (2011).
Shin, S., Kim, K. & Kang, S. M. Memristor Applications for Programmable Analog ICs. IEEE Trans. Nanotech. 10, 266–274 (2011).
Gaba, S., Sheridan, P., Zhou, J., Choi, S. & Lu, W. Stochastic memristive devices for computing and neuromorphic applications. Nanoscale 5, 5872–5878 (2013).
Jo, S. H. et al. Nanoscale Memristor Device as Synapse in Neuromorphic Systems. Nano lett. 10, 1297–1301 (2010).
Kim, K.-H. et al. A Functional Hybrid Memristor Crossbar-Array/CMOS System for Data Storage and Neuromorphic Applications. Nano lett. 12, 389–395 (2012).
Prezioso, M. et al. Training and operation of an integrated neuromorphic network based on metal-oxide memristors. Nature 521, 61–64 (2015).
Yu, S., Wu, Y., Jeyasingh, R., Kuzum, D. & Wong, H. S. P. An Electronic Synapse Device Based on Metal Oxide Resistive Switching Memory for Neuromorphic Computation. IEEE Trans. Electron Dev. 58, 2729–2737 (2011).
**a, Q. et al. Memristor−CMOS Hybrid Integrated Circuits for Reconfigurable Logic. Nano lett. 9, 3640–3645 (2009).
Yang, J. J., Borghetti, J., Murphy, D., Stewart, D. R. & Williams, R. S. A Family of Electronically Reconfigurable Nanodevices. Adv. Mat. 21, 3754–3758 (2009).
Gao, Y., Ranasinghe, D. C., Al-Sarawi, S. F., Kavehei, O. & Abbott, D. Memristive crypto primitive for building highly secure physical unclonable functions. Sci. Rep. 5, 12785 (2015).
Gao, L., Alibart, F. & Strukov, D. B. Programmable CMOS/Memristor Threshold Logic. IEEE Trans. Nanotech. 12, 115–119 (2013).
Likharev, K. K. & Strukov, D. B. CMOL: Devices, Circuits, and Architectures. Lect. Notes Phys. 680, 447–477 (Springer, 2005).
Likharev, K. K. Neuromorphic CMOL circuits. Proc. IEEE-NANO. 339–342 http://dx.doi.org/10.1109/NANO.2003.1231787 (2003).
Strukov, D. B. & Likharev, K. K. Prospects for terabit-scale nanoelectronic memories. Nanotech. 16, 137–148 (2005).
Liu, T. Y. et al. A 130.7 mm 2-layer 32-Gb ReRAM memory device in 24-nm technology. IEEE J. Solid-State Circuits 49, 140–153 (2014).
Li, H. et al. Four-layer 3D vertical RRAM integrated with FinFET as a versatile computing unit for brain-inspired cognitive information processing. Dig. Tech. pap.- VLSI Technol. (VLSIT), 2016 Symp. 1–2 (2016).
Lin, P., Pi, S. & **a, Q. 3D integration of planar crossbar memristive devices with CMOS substrate. Nanotech. 25, 405202 (2014).
Adam, G. C. et al. Highly-uniform multi-layer ReRAM crossbar circuits. Proc. ESSDERC 436–439 (2016).
Strukov, D. B. & Williams, R. S. Four-dimensional address topology for circuits with stacked multilayer crossbar arrays. Proc. Nat. Academy of Sci. 106, 20155–20158 (2009).
Lee, H. D., Miller, M. H. & Bifano, T. G. Planarization of a CMOS die for an integrated metal MEMS. SPIE Proc. 4979, 137–144 (2003).
Lastras-Montaño, M. A., Ghofrani, A. & Cheng, K.-T. Architecting energy efficient crossbar-based memristive random-access memories. Int. Symp. Nanoscale Archit. (NANOARCH) 1–6 http://dx.doi.org/10.1109/NANOARCH.2015.7180575 (2015).
Payvand, M. et al. A configurable CMOS memory platform for 3D-integrated memristors. Int. Symp. Circuits and Syst. (ISCAS) 1378–1381 http://dx.doi.org/10.1109/ISCAS.2015.7168899 (2015).
Yang, J. J., Strukov, D. B. & Stewart, D. R. Memristive devices for computing. Nat. Nanotech. 8, 13–24 (2013).
**a, L. et al. Technological Exploration of RRAM Crossbar Array for Matrix-Vector Multiplication. J. Comp. Sc. Tech. 31, 3–19 (2016).
Hu, M. et al. Dot-product engine for neuromorphic computing: programming 1T1M crossbar to accelerate matrix-vector multiplication. Proc. Design Automation Conf. (DAC) 1–6 http://dx.doi.org/10.1145/2897937.2898010 (2016).
Liu, B. et al. Vortex: variation-aware training for memristor x-bar. Proc. Design Automation Conf. (DAC) 1–6 http://dx.doi.org/10.1145/2744769.2744930 (2015).
Acknowledgements
This work was supported by the Air Force Office of Scientific Research (AFOSR) under the MURI grant FA9550-12-1-0038 and DARPA under Contract No. HR0011-13-C-0051UPSIDE via BAE Systems.
Author information
Authors and Affiliations
Contributions
B. C. wrote the manuscript and fabricated the hybrid CMOS/3D memristor chip. M. A. L.-M. and A. G. designed the architecture and the digital circuitry of the CMOS chip. M. P. (Payvand) designed the analog circuitry and M. P. (Payvand) and A. M. designed the overall layout of the CMOS chip. M. A. L.-M. created the user-interface for electrical measurements. Both B. C. and M. A. L.-M. conducted the electrical characterizations and analyzed the data. G. A. contributed in the chemical mechanical planarization of the CMOS chip as well as the tuning operation of the memristors. M. P. (Prezioso) contributed to develop strategies for electrical characterization of the memristor crossbars. B. H. was involved in the non-stoichiometric TiO2-x thin film depositions. L. T., K. T. C. and D. B. S. have supervised the overall project. All authors have seen and approved of the manuscript before submission.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Chakrabarti, B., Lastras-Montaño, M., Adam, G. et al. A multiply-add engine with monolithically integrated 3D memristor crossbar/CMOS hybrid circuit. Sci Rep 7, 42429 (2017). https://doi.org/10.1038/srep42429
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep42429
- Springer Nature Limited
This article is cited by
-
A novel memristor-based chaotic system with infinite coexisting attractors and controllable amplitude
Indian Journal of Physics (2023)
-
Ratio-based multi-level resistive memory cells
Scientific Reports (2021)
-
Energy-efficient Mott activation neuron for full-hardware implementation of neural networks
Nature Nanotechnology (2021)
-
Hybrid CMOS/memristor crossbar structure for implementing hopfield neural network
Analog Integrated Circuits and Signal Processing (2021)
-
A SPICE compact model for forming-free, low-power graphene-insulator-graphene ReRAM technology
Emergent Materials (2021)