机器学习的量子动力学

王鹏; 麦麦提尼亚孜·麦麦提阿卜杜拉

doi:10.7498/aps.74.20240999

机器学习的量子动力学

西南民族大学计算机科学与工程学院, 成都　610225

通讯作者: E-mail: wp002005@163.com;

中图分类号: 07.05.Mh, 03.65.-w, 05.40.-a, 02.30.Jr

Quantum dynamics of machine learning

School of Computer Science and Engineering, Southwest Minzu University, Chengdu 610225, China

Corresponding author: E-mail: wp002005@163.com;

MSC: 07.05.Mh, 03.65.-w, 05.40.-a, 02.30.Jr

摘要: 基于第一性原理思想, 采用量子动力学方法对机器学习的迭代运动过程进行建模. 在机器学习的参数空间定义广义目标函数, 利用Schrödinger方程和势能等效得到机器学习过程的量子动力学方程, 通过Wick转动进一步建立了量子动力学与热动力学的关系, 这为利用物理理论和数学理论对机器学习的迭代过程进行研究提供了可能. 本文工作将机器学习的迭代过程转化为含时偏微分方程来进行精确数学表述, 该方程表明机器学习过程可能存在多尺度的退火过程和同一尺度下的时间演化过程. 利用量子动力学方程证明了机器学习在时间演化时的收敛性, 解释了机器学习中的扩散模型是量子动力学方程在经典近似和低阶泰勒近似下的映射模型, 导出了人工智能中常用的Softmax和Sigmoid函数. 这些结果表明量子动力学方法在研究机器学习理论中是有效的.
- 量子动力学 /
- 机器学习 /
- 扩散模型 /
- 薛定谔方程
Abstract: In order to solve the current lack of rigorous theoretical models in the machine learning process, in this paper the iterative motion process of machine learning is modeled by using quantum dynamic method based on the principles of first-principles thinking. This approach treats the iterative evolution of algorithms as a physical motion process, defines a generalized objective function in the parameter space of machine learning algorithms, and regards the iterative process of machine learning as the process of seeking the optimal value of this generalized objective function. In physical terms, this process corresponds to the system reaching its ground energy state. Since the dynamic equation of a quantum system is the Schrödinger equation, we can obtain the quantum dynamic equation that describes the iterative process of machine learning by treating the generalized objective function as the potential energy term in the Schrödinger equation. Therefore, machine learning is the process of seeking the ground energy state of the quantum system constrained by a generalized objective function. The quantum dynamic equation for machine learning transforms the iterative process into a time-dependent partial differential equation for precise mathematical representation, enabling the use of physical and mathematical theories to study the iterative process of machine learning. This provides theoretical support for implementing the iterative process of machine learning by using quantum computers. In order to further explain the iterative process of machine learning on classical computers by using quantum dynamic equation, the Wick rotation is used to transform the quantum dynamic equation into a thermodynamic equation, demonstrating the convergence of the time evolution process in machine learning. The system will be transformed into the ground energy state as time approaches infinity. Taylor expansion is used to approximate the generalized objective function, which has no analytical expression in the parameter space. Under the zero-order Taylor approximation of the generalized objective function, the quantum dynamic equation and thermodynamic equation for machine learning degrade into the free-particle equation and diffusion equation, respectively. This result indicates that the most basic dynamic processes during the iteration of machine learning on quantum computers and classical computers are wave packet dispersion and wave packet diffusion, respectively, thereby explaining, from a dynamic perspective, the basic principles of diffusion models that have been successfully utilized in the generative neural networks in recent years. Diffusion models indirectly realize the thermal diffusion process in the parameter space by adding Gaussian noise to and removing Gaussian noise from the image, thereby optimizing the generalized objective function in the parameter space. The diffusion process is the dynamic process in the zero-order approximation of the generalized objective function. Meanwhile, we also use the thermodynamic equation of machine learning to derive the Softmax function and Sigmoid function, which are commonly used in artificial intelligence. These results show that the quantum dynamic method is an effective theoretical approach to studying the iterative process of machine learning, which provides a rigorous mathematical and physical model for studying the iterative process of machine learning on both quantum computers and classical computers.
- quantum dynamics /
- machine learning /
- diffusion model /
- Schrödinger equation .
图 1 优化问题的量子动力学框架

Figure 1. Quantum dynamical framework for optimization problems

下载: 全尺寸图片幻灯片

图 2 波包色散过程

Figure 2. Process of wave packet dispersion

下载: 全尺寸图片幻灯片

图 3 波包色散到经典扩散的转化

Figure 3. Transition from wave packet dispersion to classical diffusion

下载: 全尺寸图片幻灯片

图 4 Sigmoid函数随时间的演化

Figure 4. Evolution of the Sigmoid function over time

下载: 全尺寸图片幻灯片

图 5 扩散模型的量子动力学诠释

Figure 5. Quantum dynamical interpretation of diffusion models

下载: 全尺寸图片幻灯片

图 6 参数空间的采样映射

Figure 6. Sampling mapping of parameter space

下载: 全尺寸图片幻灯片

图 7 基于扩散模型的推理结构

Figure 7. Inference structure based on diffusion models

下载: 全尺寸图片幻灯片

[1]	Metropolis N, Rosenbluth A W, Rosenbluth M N, Teller A H, Teller E 1953 J. Chem. Phys. 21 1087 doi: 10.1063/1.1699114
[2]	Kirkpatrick S, Gelatt C D, Vecchi M P 1983 Science 220 671 doi: 10.1126/science.220.4598.671
[3]	Finnila A B, Gomez M A, Sebenik C, Stenson C, Doll J D 1994 Chem. Phys. Lett. 219 343 doi: 10.1016/0009-2614(94)00117-0
[4]	Wang F, Wang P 2024 Quantum Inf. Process. 23 66 doi: 10.1007/s11128-024-04274-4
[5]	王鹏, 辛罡 2023 自动化学报 49 2396 doi: 10.16383/j.aas.c190761 Wang P, Xin G 2023 Acta Autom. Sin. 49 2396 doi: 10.16383/j.aas.c190761
[6]	王鹏, 黄焱, 任超, 郭又铭 2013 电子学报 41 2468 doi: 10.3969/j.issn.0372-2112.2013.12.023 Wang P, Huang Y, Ren C, Guo Y 2013 Acta Electron. Sin. 41 2468 doi: 10.3969/j.issn.0372-2112.2013.12.023
[7]	王鹏, 王方 2022 电子科技大学学报(自然科学版) 51 2 doi: 10.12178/1001-0548.2021345 Wang P, Wang F 2022 J. Univ. Electron. Sci. Technol. (Nat. Sci. Ed.) 51 2 doi: 10.12178/1001-0548.2021345
[8]	Johnson M W, Amin M H S, Gildert S 2011 Nature 473 194 doi: 10.1038/nature10012
[9]	Sohl-Dickstein J, Weiss E, Maheswaranathan N, Ganguli S 2015 Proceedings of the 32 ^nd International Conference on Machine Learning Lille, France, July 7–9, 2015 p2256
[10]	Song Y, Sohl-Dickstein J, Kingma D P, Kumar A, Ermon S, Poole B 2020 arXiv: 2011.13456 [cs.LG]
[11]	Xin G, Wang P, Jiao Y 2021 Expert. Syst. Appl. 185 115615 doi: 10.1016/j.eswa.2021.115615
[12]	Jin J, Wang P 2021 Swarm Evol. Comput. 65 100916 doi: 10.1016/j.swevo.2021.100916
[13]	Wick G C 1954 Phys. Rev. 96 1124 doi: 10.1103/PhysRev.96.1124
[14]	Dhariwal P, Nichol A 2021 Advances in Neural Information Processing Systems (NeurIPS 2021) December 7–10, 2021 (Virtual-only Conference) p8780
[15]	Ho J, Jain A, Abbeel P 2020 Advances in Neural Information Processing Systems (NeurIPS 2020) December 6–12, 2020 (Virtual-only Conference) p6840
[16]	Nichol A Q, Dhariwal P 2021 Proceedings of the 38th International Conference on Machine Learning July 18–24, 2021 (Virtual-only Conference) p8162
[17]	Lim S, Yoon E, Byun T, Kang T, Kim S, Lee K, Choi S 2023 Advances in Neural Information Processing Systems (NeurIPS 2023) New Orleans, USA, December 10–16, 2023 p37799
[18]	Anderson J B 1975 J. Chem. Phys. 63 1499 doi: 10.1063/1.431514
[19]	Kosztin I, Faber B, Schulten K 1996 Am. J. Phys. 64 633 doi: 10.1119/1.18168
[20]	Haghighi M K, Lüchow A 2017 J. Phys. Chem. A 121 6165 doi: 10.1021/acs.jpca.7b05798
[21]	Jeong J, Shin J 2023 Advances in Neural Information Processing Systems (NeurIPS 2023) New Orleans, USA, December 10–16, 2023 p67374
[22]	Morawietz T, Artrith N 2021 J. Comput. Aid. Mol. Des. 35 557 doi: 10.1007/s10822-020-00346-6

图( 7)

计量

文章访问数: 147
HTML全文浏览数: 147
PDF下载数: 2
施引文献: 0

全文HTML

1. 引　言

机器学习是一个典型的优化问题, 其学习过程就是在参数空间中的迭代寻优过程, 将这种算法的迭代运动过程视为动力学过程是一种很自然的思路. 经过长期的发展, 动力学理论体系已十分完备, 有量子动力学、牛顿动力学、热动力学、电动力学、分子动力学, 这些动力学通过建立一组动力学方程对运动规律进行理论描述. 建立机器学习的动力学理论有望能解决这一领域缺乏理论模型的问题, 从而推进机器学习理论和应用的发展.

针对优化问题采用动力学的方法进行理论建模最早始于Metropolis等^[1]在1953年基于热力学提出的解的接收准则, 后来被Kirkpatrick等^[2]应用于求解优化问题模拟退火算法, 这是将热动力学应用于优化问题的早期尝试. 1994年Finnila等^[3]提出将目标函数视为Schrödinger方程中的势能, 从而把优化问题转化为约束态量子基态波函数问题, 这是量子动力学理论在优化问题中的首次应用. 2013年我们开始尝试提出优化问题的量子动力学框架, 研究结果表明Schrödinger方程可以有效地对优化算法的基本迭代过程进行描述^[4–7]. 量子动力学在人工智能中能成功应用的证据还包括最早实现商业应用的量子计算机D-wave, 它利用量子退火实现了对智能优化问题的成功求解^[8].

2015年Sohl-Dickstein等^[9]利用非平衡热力学思想提出了扩散概率模型(diffusion probalistic model). 随后, 扩散模型(diffusion model)得到了快速的发展, 并被广泛应用在人工智能领域. 扩散模型是动力学理论在机器学习领域成功应用的范例. 近年来动力学理论在机器学习中展现出越来越大的理论和应用价值, 并发展出基于随机微分方程的新型动力学方法^[10], 其有效性不断被实验结果证实.

机器学习的动力学理论最重要的是要建立机器学习过程的动力学方程. 由于机器学习过程是一个概率运动过程, 而量子动力学和热动力学分别从微观和宏观两个角度描述了物质世界的概率运动过程, 特别是量子动力学反映了物质世界最基本的运动规律. 量子动力学的核心动力学方程是Schrödinger方程, 其利用一个确定性的含时偏微分方程描述了物质世界广泛存在的概率运动规律. 这表明量子动力学可以作为描述机器学习运动过程的第一性原理. 本文通过建立机器学习的量子动力学方程, 从量子动力学角度出发展开对机器学习迭代过程的研究.

5. 结　语

本文在Schrödinger方程的基础上建立了机器学习迭代过程的量子动力学方程, 并通过Wick转动将量子动力学方程与热动力学方程联系起来. 通过机器学习的量子动力学方程, 首先利用Wick转动和Taylor近似得到了机器学习的基本迭代过程; 其次, 利用方程的通解获得了机器学中常用的Softmax和Sigmoid函数, 从而在物理和数学上解释了Softmax和Sigmoid的动力学意义; 最后, 还采用量子动力学方程对扩散模型的基本迭代过程给出了动力学解释.

机器学习的量子动力学方程的建立, 将机器学习的迭代过程转化为含时偏微分方程, 实现了对机器学习过程的精确数学化描述, 从而可以充分利用量子力学和数学中成熟理论体系对机器学习展开研究. 本文的工作为建立机器学习的精确理论基础提供了一个新的研究视角和思路, 也有望为未来在量子计算机上实现人工智能算法提供理论依据^[22].

参考文献 (22)

机器学习的量子动力学

通讯作者: E-mail: wp002005@163.com;

Quantum dynamics of machine learning

Corresponding author: E-mail: wp002005@163.com;

计量

机器学习的量子动力学

通讯作者: E-mail: wp002005@163.com;

English Abstract

Quantum dynamics of machine learning

Corresponding author: E-mail: wp002005@163.com;

全文HTML

3.1. 机器学习量子动力学方程

3.2. 广义目标函数的Taylor近似

3.2.1.. 广义目标函数的0阶Taylor近似

3.2.2.. 广义目标函数的1阶Taylor近似

3.3. 机器学习量子动力学方程的经典近似

3.4. Fokker-Planck方程的离散化

4.1. 机器学习的收敛性分析

4.2. Softmax和Sigmoid的导出

4.3. 扩散模型的量子动力学解释

目录

机器学习的量子动力学

通讯作者: E-mail: wp002005@163.com;

Quantum dynamics of machine learning

Corresponding author: E-mail: wp002005@163.com;

计量

出版历程

机器学习的量子动力学

通讯作者: E-mail: wp002005@163.com;

English Abstract

Quantum dynamics of machine learning

Corresponding author: E-mail: wp002005@163.com;

全文HTML

3.1. 机器学习量子动力学方程

3.2. 广义目标函数的Taylor近似

3.2.1.. 广义目标函数的0阶Taylor近似

3.2.2.. 广义目标函数的1阶Taylor近似

3.3. 机器学习量子动力学方程的经典近似

3.4. Fokker-Planck方程的离散化

4.1. 机器学习的收敛性分析

4.2. Softmax和Sigmoid的导出

4.3. 扩散模型的量子动力学解释

目录