深度学习赋能微纳光子学材料设计研究进展

付朋; 蓝文泽; 郭阳; 顾长志

doi:10.13922/j.cnki.cjvst.202302005

深度学习赋能微纳光子学材料设计研究进展

1.
北京凝聚态物理国家研究中心中国科学院物理研究所北京100190
2.
中国科学院大学中国科学院真空物理重点实验北京100049

通讯作者: E-mail: yangguo@iphy.ac.cn; czgu@iphy.ac.cn

中图分类号: O43

Research Progress of Deep Learning-Enabled Micro-Nano Photonics Material Design

1.
Beijing National Laboratory for Condensed Matter Physics, Institute of Physics, Chinese Academy of Sciences, Beijing, 100190, China
2.
School of Physical Sciences, CAS Key Laboratory of Vacuum Physics, University of Chinese Academy of Sciences, Beijing, 100049, China

Corresponding authors: Yang GUO, yangguo@iphy.ac.cn ; Changzhi GU, czgu@iphy.ac.cn

MSC: O43

摘要: 光子学结构设计是微纳光学器件和系统研究的核心。许多人工设计的光子学结构，比如超材料、光子晶体、等离激元纳米结构等，已经在高速可视通信、高灵敏度传感和高效能源收集及转换中得到了广泛的应用。然而，该领域中通用的设计方法是基于简化的物理解析模型及基于规则的数值模拟方法，属于反复试错的方法，效率低且很可能会错过最佳的设计参数。因此，快速得到设计参数和光谱响应信息之间的潜在关联性，是实现光子学器件高效设计的关键。在过去的几年里，深度学习在语言识别、机器视觉、自然语言处理等领域发展迅速。深度学习的独特优势在于其数据驱动的方法，可以让模型从海量数据中自动发现有用的信息，这为解决上述光子学结构设计问题提供了一种全新的方法。本篇综述从不同的微纳光子学结构设计的应用场景出发，介绍了不同的深度学习模型在光子学设计领域中的适用范围和选择依据，并对该领域未来的机遇与挑战进行了总结与展望。
- 深度学习 /
- 微纳光子学 /
- 正向预测网络 /
- 逆向设计
Abstract: The structure design is the core of micro-nanophotonic devices and optical systems. Many artificially designed photonic structures, such as metamaterials, photonic crystals, and plasmonic nanostructures, have been widely used in high-speed visible communication, high-sensitivity sensing, and efficient energy harvesting and conversion. However, standard design methods in this field are based on simplified physical analytical model and rule-based numerical simulation method, which is a trial-and-error method, low efficiency and likely to miss the optimal design parameters. Therefore, rapidly acquiring the potential correlation between design parameters and spectral response information is the key for realizing the efficient design of photonic devices. Besides, deep learning (DL) has been developed rapidly in fields such as language recognition, machine vision, and natural language processing in the past few years. The unique advantage of DL lies in its data-driven algorithm, which allows models for discovering useful information from massive amounts of data automatically and provides a new route to solve the aforementioned design problems of photonic structures. This review starts from different application scenarios of micro-nano photonics structure design, introduces the application scope and selection basis of various DL models in the field of photonics design, and summarizes and looks forward to future opportunities and challenges in this field.
- Deep learning /
- Nanophotonics structure design /
- Forward prediction network /
- Reverse design .
图 1 微纳光子学结构。(a)超材料^[1]；(b)等离激元纳米结构^[3]；(c)光子晶体^[2]；(d)亚波长光栅^[4]

Figure 1. Micro-nano photonic structures. (a) Metamaterials^[1]; (b) Plasmonic nanostructures^[3]; (c) Photonic crystals^[2]; (d) A subwavelength grating^[4]

下载: 全尺寸图片幻灯片

图 2 微纳光子学结构随着结构复杂度不同设计方法效率的比较

Figure 2. Comparison of the efficiency of different design methods for micro-nano photonic structures with structural complexity

下载: 全尺寸图片幻灯片

图 3 深度学习的发展历程

Figure 3. The history of deep learning

下载: 全尺寸图片幻灯片

图 4 几种深度神经网络模型。(a)感知机；(b)多层感知机；(c)卷积神经网络；(d)循环神经网络

Figure 4. Several deep neural network models. (a) A perceptron; (b) A multilayer perceptron; (c) A convolutional neural network; (d) A recurrent neural network

下载: 全尺寸图片幻灯片

图 5 深度学习对多自由度光谱的正向预测。(a)等离子体纳米结构的正向光谱线型预测^[22]；(b)辅助神经网络对极窄带宽处光谱数据进行二次训练^[23]；(c)图象分类问题迁移到相位预测问题的原理及应用^[24]；(d)光谱及振幅同时精准预测的正向全连接网络^[25]

Figure 5. Forward prediction of multi-degree-of-freedom spectra by deep learning. (a) Forward prediction of spectral lineshapes of plasmonic nanostructures^[22]; (b) Auxiliary neural network for secondary training on spectral data at extremely narrow bandwidths^[23]; (c) The principle and application of image classification problems transferred to phase prediction problems^[24]; (d)Forward fully connected network for accurate prediction of spectrum and amplitude simultaneously^[25]

下载: 全尺寸图片幻灯片

图 6 对光子学器件全局设计空间的逆向设计深度学习模型。(a)串联网络模型^[34]；(b)解决并生成满足多功能设计目标的超原子结构的生成对抗神经网络模型^[35]；(c)预测超表面拓扑结构、材料性质和跨多类超表面的面外结构参数的深度卷积生成对抗网络框架^[36]；(d)用于指导器件制造工艺并优化器件性能的机器学习策略^[37]

Figure 6. Deep learning models for inverse engineering of the global design space of photonic devices. (a) Tandem network model^[34]; (b) Generative adversarial neural network models to solve and generate meta-atom structures meeting multifunctional design goals^[35]; (c) Deep convolutional generative adversarial network framework to predict metasurface topology, material properties, and out-of-plane structural parameters across multiple classes of metasurfaces ^[36]; (d) Machine learning strategies to guide the device fabrication process and optimize device performance^[37]

下载: 全尺寸图片幻灯片

[1]	Smith D R,Pendry J B,Wiltshire M C K. Metamaterials and negative refractive index[J]. Science,2004,305(5685):788−792 doi: 10.1126/science.1096796
[2]	Joannopoulos J D, Johnson S G, Winn J N, et al. Photonic crystals: molding the flow of light (2^nd edition)[M]. Princeton: Princeton University Press, 2008
[3]	Toussaint Jr K C,Roxworthy B J,Michaud S,et al. Plasmonic nanoantennas: from nanotweezers to plasmonic photography[J]. Optics and Photonics News,2015,26(6):24−31 doi: 10.1364/OPN.26.6.000024
[4]	Liang Y Z,Zhang S,Cao X,et al. Free-standing plasmonic metal-dielectric-metal bandpass filter with high transmission efficiency[J]. Scientific Reports,2017,7(1):4357 doi: 10.1038/s41598-017-04540-9
[5]	Bose J C. On the rotation of plane of polarisation of electric wave by a twisted structure[J]. Proceedings of the Royal Society of London,1898,63(389-400):146−152 doi: 10.1098/rspl.1898.0019
[6]	Lindell I V,Sihvola A H,Kurkijarvi J. Karl F. Lindman: the last hertzian, and a harbinger of electromagnetic chirality[J]. IEEE Antennas and Propagation Magazine,1992,34(3):24−30 doi: 10.1109/74.153530
[7]	Veselago V G. The electrodynamics of substances with simultaneously negative values of ϵ and µ[J]. Soviet Physics Uspekhi,1968,10(4):509−514 doi: 10.1070/PU1968v010n04ABEH003699
[8]	Pendry J B,Schurig D,Smith D R. Controlling electromagnetic fields[J]. Science,2006,312(5781):1780−1782 doi: 10.1126/science.1125907
[9]	Schurig D,Mock J J,Justice B J,et al. Metamaterial electromagnetic cloak at microwave frequencies[J]. Science,2006,314(5801):977−980 doi: 10.1126/science.1133628
[10]	Liu Y M,Zhang X. Metamaterials: a new frontier of science and technology[J]. Chemical Society Reviews,2011,40(5):2494−2507 doi: 10.1039/c0cs00184h
[11]	Maier S A. Plasmonics: fundamentals and applications[M]. New York: Springer, 2007
[12]	Li W B,Meng F,Chen Y F,et al. Topology optimization of photonic and phononic crystals and metamaterials: a review[J]. Advanced Theory and Simulations,2019,2(7):1900017 doi: 10.1002/adts.201900017
[13]	Campbell S D,Sell D,Jenkins R P,et al. Review of numerical optimization techniques for meta-device design [Invited][J]. Optical Materials Express,2019,9(4):1842−1863 doi: 10.1364/OME.9.001842
[14]	Nair V, Hinton G E. Rectified linear units improve restricted Boltzmann machines[C]//Proceedings of the 27th International Conference on International Conference on Machine Learning, Haifa: Omnipress, 2010
[15]	Srivastava N,Hinton G,Krizhevsky A,et al. Dropout: a simple way to prevent neural networks from overfitting[J]. The Journal of Machine Learning Research,2014,15(1):1929−1958
[16]	Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift[C]//Proceedings of the 32nd International Conference on Machine Learning, Lille: JMLR. org, 2015: 448-456
[17]	Sanchez-Lengeling B,Aspuru-Guzik A. Inverse molecular design using machine learning: Generative models for matter engineering[J]. Science,2018,361(6400):360−365 doi: 10.1126/science.aat2663
[18]	Gawehn E,Hiss J A,Schneider G. Deep learning in drug discovery[J]. Molecular informatics,2016,35(1):3−14 doi: 10.1002/minf.201501008
[19]	Zeng M X,Yuan S,Huang D L,et al. Accelerated design of catalytic water-cleaning nanomotors via machine learning[J]. ACS Applied Materials & Interfaces,2019,11(43):40099−40106
[20]	Rahmani B,Loterie D,Konstantinou G,et al. Multimode optical fiber transmission with a deep learning network[J]. Light:Science & Applications,2018,7:69
[21]	Asano T,Noda S. Optimization of photonic crystal nanocavities based on deep learning[J]. Optics Express,2018,26(25):32704−32717 doi: 10.1364/OE.26.032704
[22]	Malkiel I,Mrejen M,Nagler A,et al. Plasmonic nanostructure design and characterization via Deep Learning[J]. Light:Science & Applications,2018,7:60
[23]	Ma W,Cheng F,Liu Y M. Deep-learning-enabled on-demand design of chiral metamaterials[J]. ACS Nano,2018,12(6):6326−6334 doi: 10.1021/acsnano.8b03569
[24]	Zhu D Y,Liu Z C,Raju L,et al. Building multifunctional metasystems via algorithmic construction[J]. ACS Nano,2021,15(2):2318−2326 doi: 10.1021/acsnano.0c09424
[25]	An S S,Fowler C,Zheng B W,et al. A deep learning approach for objective-driven all-dielectric metasurface design[J]. ACS Photonics,2019,6(12):3196−3207 doi: 10.1021/acsphotonics.9b00966
[26]	Molesky S,Lin Z,Piggott A Y,et al. Inverse design in nanophotonics[J]. Nature Photonics,2018,12(11):659−670 doi: 10.1038/s41566-018-0246-9
[27]	Liu Z C,Zhu D Y,Lee K T,et al. Compounding meta-atoms into metamolecules with hybrid artificial intelligence techniques[J]. Advanced Materials,2020,32(6):1904790 doi: 10.1002/adma.201904790
[28]	Wiecha P R,Muskens O L. Deep learning meets nanophotonics: a generalized accurate predictor for near fields and far fields of arbitrary 3D nanostructures[J]. Nano Letters,2020,20(1):329−338 doi: 10.1021/acs.nanolett.9b03971
[29]	Peurifoy J,Shen Y C,Jing L,et al. Nanophotonic particle simulation and inverse design using artificial neural networks[J]. Science Advances,2018,4(6):eaar4206 doi: 10.1126/sciadv.aar4206
[30]	Guo Q,Shi Z J,Huang Y W,et al. Compact single-shot metalens depth sensors inspired by eyes of jumping spiders[J]. Proceedings of the National Academy of Sciences of the United States of America,2019,116(46):22959−22965 doi: 10.1073/pnas.1912154116
[31]	Martins A,Li K Z,Li J T,et al. On metalenses with arbitrarily wide field of view[J]. ACS Photonics,2020,7(8):2073−2079 doi: 10.1021/acsphotonics.0c00479
[32]	Zhang Q,Liu C,Wan X,et al. Machine-learning designs of anisotropic digital coding metasurfaces[J]. Advanced Theory and Simulations,2019,2(2):1800132 doi: 10.1002/adts.201800132
[33]	Xu D,Luo Y,Luo J,et al. Efficient design of a dielectric metasurface with transfer learning and genetic algorithm[J]. Optical Materials Express,2021,11(7):1852−1862 doi: 10.1364/OME.427426
[34]	Liu D J,Tan Y X,Khoram E,et al. Training deep neural networks for the inverse design of nanophotonic structures[J]. ACS Photonics,2018,5(4):1365−1369 doi: 10.1021/acsphotonics.7b01377
[35]	An S S,Zheng B W,Tang H,et al. Multifunctional metasurface design with a generative adversarial network[J]. Advanced Optical Materials,2021,9(5):2001433 doi: 10.1002/adom.202001433
[36]	Yeung C,Tsai R,Pham B,et al. Global inverse design across multiple photonic structure classes using generative deep learning[J]. Advanced Optical Materials,2021,9(20):2100548 doi: 10.1002/adom.202100548
[37]	Chen X Y,Xie Y F,Sheng Y C,et al. Wafer-scale functional circuits based on two dimensional semiconductors with fabrication optimized by machine learning[J]. Nature Communications,2021,12(1):5953 doi: 10.1038/s41467-021-26230-x
[38]	Khoram E,Chen A,Liu D J,et al. Nanophotonic media for artificial neural inference[J]. Photonics Research,2019,7(8):823−827 doi: 10.1364/PRJ.7.000823
[39]	Shen Y C,Harris N C,Skirlo S,et al. Deep learning with coherent nanophotonic circuits[J]. Nature Photonics,2017,11(7):441−446 doi: 10.1038/nphoton.2017.93
[40]	Feldmann J,Youngblood N,Wright C D,et al. All-optical spiking neurosynaptic networks with self-learning capabilities[J]. Nature,2019,569(7755):208−214 doi: 10.1038/s41586-019-1157-8
[41]	Lin X,Rivenson Y,Yardimci N T,et al. All-optical machine learning using diffractive deep neural networks[J]. Science,2018,361(6406):1004−1008 doi: 10.1126/science.aat8084
[42]	Bao Q L,Zhang H,Ni Z H,et al. Monolayer graphene as a saturable absorber in a mode-locked laser[J]. Nano Research,2011,4(3):297−307 doi: 10.1007/s12274-010-0082-9
[43]	Tait A N,de Lima T F,Zhou E,et al. Neuromorphic photonic networks using silicon photonic weight banks[J]. Scientific Reports,2017,7(1):7430 doi: 10.1038/s41598-017-07754-z
[44]	Williamson I A D,Hughes T W,Minkov M,et al. Reprogrammable electro-optic nonlinear activation functions for optical neural networks[J]. IEEE Journal of Selected Topics in Quantum Electronics,2020,26(1):7700412
[45]	Shastri B J,Nahmias M A,Tait A N,et al. Spike processing with a graphene excitable laser[J]. Scientific Reports,2016,6:19126 doi: 10.1038/srep19126
[46]	Estakhri N M,Edwards B,Engheta N. Inverse-designed metastructures that solve equations[J]. Science,2019,363(6433):1333−1338 doi: 10.1126/science.aaw2498
[47]	Hughes T W,Williamson I A D,Minkov M,et al. Wave physics as an analog recurrent neural network[J]. Science Advances,2019,5(12):eaay6946 doi: 10.1126/sciadv.aay6946

图( 6)

计量

文章访问数: 938
HTML全文浏览数: 938
PDF下载数: 12
施引文献: 0

全文HTML

人类历史上重要的技术变革主要是通过打破物理层面的限制而实现的，通常体现在材料和器件上的突破。创建一台新机器或系统时，我们通常受限于可用材料的特性和对物理现象的理解。宇宙的物理定律几乎是亘古不变的，但我们可以通过“设计”原材料来控制和操纵自然现象，以达到为人类服务的目的。在光学发展的初期，人们从光线的角度对光的传输行为进行分析，由此诞生了几何光学中最重要的反射和折射定律。在几何光学中，人们主要通过设计和加工天然材料的整体轮廓来调控光的传输行为，从而发明各种光学元件和系统，如平面镜、凹/凸透镜、望远镜、显微镜等等。随着光学的不断发展与完善，人们逐渐从波动的角度来分析光波以解决几何光学中异常的传输行为，从而诞生了波动光学。干涉是波动的一个重要特征，与波长或亚波长尺度相当的结构会对光的干涉产生强烈影响，进而有效调控光的传输行为，微纳光子学这个概念应运而生。事实证明，不仅仅是材料的整体轮廓对光的传输行为会有影响，而且材料在波长或亚波长尺度下的微结构对光的传输行为也有很大的影响。在过去的十年中，人们通过精心设计与波长相当的微纳光子学结构，例如超材料^[1]、光子晶体^[2]、等离激元结构^[3]和亚波长光栅^[4]等(图1)，能够以前所未有的方式准确操纵光的特性，包括相位、谐振、角动量、和手性等。微纳光子学结构具有天然材料所不具备的特性和功能，给虚拟/增强现实、超高分辨成像、超灵敏生物传感、桌面级光学系统和高速宽带光通信等现代光学工程领域带来了革命性的变化。目前，微纳光子学的研究已经扩展到多个学科研究领域，有效地推动各领域的发展。

光学结构设计在微纳光子学研究中发挥着核心作用。由于人们很难直接求解定量描述亚波长尺度上光与物质相互作用的方程，以及无法获得具有解析形式的广义解，所以传统的对未知光子学结构的预测主要是利用有限元法（Finite Element Method, FEM）或时域有限差分法（Finite difference time domain method, FDTD）迭代求解麦克斯韦方程组。到目前为止，具有特定功能的光子学结构设计主要是以“自下而上”的方式执行的，在试错过程中不断优化单元格。这种方法不仅依赖于设计者的专业经验，而且需要反复的试错从而导致时间上的浪费。此外，这种传统的设计方法还会受到人为误差的影响及经验知识的固有限制，其计算结果通常会止步于局部最优解。因此，利用这种基于直觉猜测的设计方法很难发现具有突破常规功能和极高效率的光子学结构。基于此，高效建立光子学结构、材料参数和其光谱响应之间的对应关系是微纳光子学器件和材料设计的核心科学问题之一。随着深度学习的兴起，计算机视觉，语音识别和策略制定等科学和工程领域得到了快速发展。不同于基于规则的优化方式，深度学习是以训练数据为样本，旨在全面描述设计空间的一种数据驱动的方法，具有在给定的设计空间内泛化的能力。利用深度学习可以快速准确地完成设计，而无需对逐个案例进行耗时的数值计算。训练有素的深度学习模型可以直接建立从光子学结构到其光学响应的映射，反之亦然。

本综述将介绍深度学习在微纳光子学结构设计上的最新进展，并总结概括正向预测和逆向设计的主要应用场景。本文的第一部分，主要介绍微纳光子器件的发展历程及传统的结构设计方法。第二部分，主要介绍深度神经网络的基本概念和分类，并重点总结深度学习在微纳光子学结构设计中正向预测和逆向设计的应用；最后，对目前深度学习在光子学研究中的机遇与挑战进行总结与展望。

3. 总结和展望

深度学习的快速发展为解决光子学设计问题提供了全新的途径。“大数据”是深度学习的催化剂，它使人工智能的研究继续发生革命性的变化，但同时也导致计算能力和能耗的指数增长。目前，大多数深度学习算法都部署在具有冯·诺依曼架构的传统计算机中，其串行性是神经网络高效运行的内在障碍。因此，在速度和功耗方面都优于电子平台，包括纳米光子散射体^[38]、集成硅光子芯片^[39-40]和3D打印衍射层^[41]等用于深度学习的光子平台正在积极研究中。深度学习模型在光学平台上运行的巨大优势在于其对光信号的并行处理能力，用于光学平台的非线性激活函数可以通过使用二维材料的可饱和吸收^[42]、硅^[43]中的非线性电光调制或简单的外部数字处理器^[39]以固定或可重新编程的方式实现^[44]。光学元件不需要在ANN中进行矩阵乘法和非线性激活等数学操作，其本身就代表了一种数学运算操作。例如光脉冲神经网络利用片上光学元件，如波导、波分复用器和环形谐振器^[40,43,45]自然地模拟了生物神经元的基本集成和发射功能。这些神经形态的光子平台在处理信息时具有速度和并行性上的压倒性优势。新的光子学结构和深度学习之间的相互作用可能会克服当前计算方法和系统的限制，并潜在地将人工智能研究引向新的视野。事实上，已经证明逆向设计的超材料可以利用电磁场^[46]求解积分方程。另一方面，波动物理学也可以看作是一个模拟递归神经网络^[47]。深度学习技术已经展示了其在整个光学系统优化上的巨大潜力，并将继续探索新的方法来加速光学测量，甚至发现新的光学效应。总之，新的光子学结构实现非常规计算和人工智能技术的潜力还值得进一步探索。

本综述介绍了用于光子设计的正向预测与逆向设计的深度学习方法。这些显著的发展都在过去几年中得到了证明，随着不同背景的研究人员对这一新兴领域的贡献，人们期待着进一步的飞跃性进展。人工智能研究人员应该与光学科学家一起协同合作，开发非传统的、物理驱动的算法，这些算法和网络不仅具有鲁棒性、生成性和可解释性，同时使用更少的数据，可以提供了实现无与伦比的光学功能器件的非常规方法。这种融合人工智能、微纳光子学的跨学科方法将允许具有独特功能的大规模光子学结构设计以及新的光学表征方法，为高速超分辨成像、实时探测和操控、高效的能量转换系统以及量子测量和计量领域的变革性进展铺平道路。在这条道路上，光子学界应该以构建一个包含光子概念、架构、组件和光子材料的综合数据集为最终目标。

参考文献 (47)

深度学习赋能微纳光子学材料设计研究进展

通讯作者: E-mail: yangguo@iphy.ac.cn; czgu@iphy.ac.cn

Research Progress of Deep Learning-Enabled Micro-Nano Photonics Material Design

Corresponding authors: Yang GUO, yangguo@iphy.ac.cn ; Changzhi GU, czgu@iphy.ac.cn

计量

深度学习赋能微纳光子学材料设计研究进展

通讯作者: E-mail: yangguo@iphy.ac.cn;

通讯作者: czgu@iphy.ac.cn

English Abstract

Research Progress of Deep Learning-Enabled Micro-Nano Photonics Material Design

Corresponding author: Yang GUO, yangguo@iphy.ac.cn ;

Corresponding authors: Changzhi GU, czgu@iphy.ac.cn

全文HTML

1.1. 微纳光子学结构发展回顾

1.2. 微纳光子学结构传统设计方法

2.1. 深度学习概述

2.2. 深度神经网络的基本分类

2.3. 对光场多自由度响应的正向预测

2.4. 对光子学器件全局设计空间的逆向设计

目录

深度学习赋能微纳光子学材料设计研究进展

通讯作者: E-mail: yangguo@iphy.ac.cn; czgu@iphy.ac.cn

Research Progress of Deep Learning-Enabled Micro-Nano Photonics Material Design

Corresponding authors: Yang GUO, yangguo@iphy.ac.cn ; Changzhi GU, czgu@iphy.ac.cn

计量

出版历程

深度学习赋能微纳光子学材料设计研究进展

通讯作者: E-mail: yangguo@iphy.ac.cn;

通讯作者: czgu@iphy.ac.cn

English Abstract

Research Progress of Deep Learning-Enabled Micro-Nano Photonics Material Design

Corresponding author: Yang GUO, yangguo@iphy.ac.cn ;

Corresponding authors: Changzhi GU, czgu@iphy.ac.cn

全文HTML

1.1. 微纳光子学结构发展回顾

1.2. 微纳光子学结构传统设计方法

2.1. 深度学习概述

2.2. 深度神经网络的基本分类

2.3. 对光场多自由度响应的正向预测

2.4. 对光子学器件全局设计空间的逆向设计

目录