搜索

x

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

深度学习代理模型的容性耦合氩等离子体流体模拟: 非对称推理与定量可信边界

李靖宇 蒋星照 何倩 张逸凡 吴桐 姜森钟 宋远红 贾文柱

引用本文:
Citation:

深度学习代理模型的容性耦合氩等离子体流体模拟: 非对称推理与定量可信边界

李靖宇, 蒋星照, 何倩, 张逸凡, 吴桐, 姜森钟, 宋远红, 贾文柱

Capacitively coupled argon plasmas fluid simulations with deep learning surrogates: Asymmetric inference and quantitative trust boundaries

LI Jingyu, JIANG Xingzhao, HE Qian, ZHANG Yifan, WU Tong, JIANG Senzhong, SONG Yuanhong, JIA Wenzhu
Article Text (iFLYTEK Translation)
PDF
HTML
导出引用
在线预览
  • 容性耦合等离子体(CCP)的流体模拟对于理解放电物理机制非常重要, 但其高昂的计算成本制约了大范围参数化探索. 为突破该限制, 本研究开发了一种深度学习代理模型, 旨在以近瞬时推理速度复现一维CCP流体模型的输出结果. 该模型精确预测了容性耦合氩等离子体流体模拟中关键等离子体参数的空间分布, 包括电子密度、电子温度及电场分布, 并将所需计算时间从数小时压缩至毫秒量级. 除加速优势外, 代理模型学习过程还揭示了根植于等离子体物理的非对称推理能力. 代理模型可从复杂的低压物理域外推至更简单的高压物理域, 反之则不可行, 表明低压状态具有更完整的物理信息. 进一步, 本研究建立一个模型推理的置信边界, 确保预测结果的物理可靠性. 最终, 本研究为创建高保真、超快速的流体模拟等离子体替代提供了方案.
    Fluid simulations of capacitively coupled plasmas (CCP) are crucial for understanding their discharge physics, yet the high computational cost poses a major bottleneck. To overcome this limitation, we have developed a deep learning-based surrogate model designed to replicate the output of a one-dimensional CCP fluid model with near-instantaneous inference speed. Through a systematic evaluation of three architectures—Feedforward Neural Network (FNN), Attention-enhanced Long Short-Term Memory network (ALSTM), and Convolutional-Transformer hybrid network (CTransformer)—we found that the sequence-structured ALSTM model achieves the optimal balance between speed and accuracy, with an overall prediction error of only 1.73% for electron density, electric field, and electron temperature in argon discharge. This study not only achieves simulation acceleration but also reveals that the model can accurately extrapolate from low-pressure conditions dominated by complex non-local effects to high-pressure conditions governed by simple local effects, whereas the reverse extrapolation fails. This phenomenon suggests that training under low-pressure conditions enables the model to capture more comprehensive data features. From the perspective of model weights, both low-pressure and high-pressure models assign key weights to the sheath region, but the low-pressure model exhibits higher weight peaks in the sheath, indicating a stronger ability to capture the key physical process of sheath dynamics. In contrast, the high-pressure model, due to its lower weights in the sheath region, may fail to adequately resolve the complex dynamics of the sheath when predicting new operating conditions, thereby limiting its ability for high-fidelity extrapolation. To ensure the reliability of this data-driven model in practical applications, we established a trust boundary of 5% normalized mean absolute spatial error for model performance through systematic extrapolation experiments. When the model's extrapolation error falls below this threshold, the spatial distribution curves of predicted parameters such as electron density and electron temperature closely match the true physical distributions. However, once the error exceeds this critical point, systematic deviations such as morphological distortion and amplitude discrepancies begin to appear in the predicted spatial distributions, significantly diverging from the true physical laws. In the future, we will develop neural network models capable of processing high-dimensional spatial data and incorporating multi-dimensional input features such as various discharge gases, ultimately realizing a dedicated AI model for the field of capacitively coupled plasmas.
  • 图 1  神经网络架构示意图 (a) 前馈神经网络(FNN)结构; (b) 注意力增强型长短期记忆网络(ALSTM)结构, 将等离子体状态演化视为序列预测任务; (c) 卷积-Transformer混合网络(CTransformer)结构, 采用Transformer编码器与CNN解码器架构

    Fig. 1.  Schematic of the neural network architectures: (a) feedforward neural network (FNN) structure; (b) attention-enhanced long short-termmemory (ALSTM) structure, which treats plasma state evolution as a sequence prediction task; (c) hybrid convolutional-transformer (CTransformer) structure, employing a Transformer encoder and a CNN decoder.

    图 2  (a)和(b)分别展示了不同气压和电压下, 时间平均电子密度、电场以及电子温度的轴向空间分布, 红色系为流体模型仿真结果, 蓝色系为PIC模型仿真结果

    Fig. 2.  Time-averaged spatial profiles of electron density, electric field, and temperature under scans of (a) gas pressure and (b) driving voltage, used for dataset generation. The red-colored series represents fluid model simulation results, while the blue-colored series represents PIC model results.

    图 3  FNN、ALSTM与CTransformer模型对电子密度($ n_e $)、电场(E)和电子温度($ T_e $)在气压和电压数据集上的训练损失收敛曲线, 其中损失代表了模型的误差情况. 横轴代表训练迭代次数(Number of iterations), 纵轴表示损失的负对数值($ -\log_{10} \text{Loss} $)

    Fig. 3.  Convergence curves of training loss for the FNN, ALSTM, and CTransformer models on the electron density ($ n_e $), electric field (E), and electron temperature ($ T_e $) datasets, where the loss values reflect the models' error performance. The horizontal axis represents the number of training iterations. The vertical axis represents the negative logarithm of the loss ($ -\log_{10} \text{Loss} $).

    图 4  时间平均的电子密度、电场和温度的空间分布 (a), (e) 显示了流体/MC模拟的真实值. FNN (b), (f)、ALSTM (c), (g)和CTransformer (d), (h)模型的预测结果与真实值在不同气压(左列, b—d)和电压(右列, f—h)条件下的对比. 各模型的子图中均展示了预测结果及其归一化绝对误差(NAE)分布. 垂直虚线将训练数据(左侧)与测试数据(右侧)分隔开来

    Fig. 4.  Time-averaged spatial profiles of electron density, electric field, and temperature. Panels (a), (e) show the ground truth from a Fluid/MC simulation. Predictions from the FNN (b), (f), ALSTM (c), (g), and CTransformer (d), (h) models are compared against the ground truth under varying pressure (left columns, b–d) and voltage (right columns, f–h). Each panel for the models shows the prediction and its normalized absolute error (NAE) distribution. The vertical dashed line separates the training data (left) from the test data (right).

    图 5  FNN、ALSTM 和 CTransformer 模型在(a)气压数据集和(b)电压数据集上对电子密度、电场和电子温度的时间平均空间分布的外推性能(极板间距为0—3 cm). 蓝色阴影区域表示外推集, 灰色阴影区域表示训练集

    Fig. 5.  Extrapolation performance of FNN, ALSTM, and CTransformer for time-averaged spatial profiles of electron density, electric field, electron temperature on (a) the pressure set and (b) the voltage set(Electrode spacing is 0–3 cm). The blue-shaded area represents the extrapolation set, while the gray-shaded area is the training set.

    图 6  ALSTM的输入层权重分布图, 其中蓝线和黄线分别为基于低气压和高气压数据训练的ALSTM模型的输入层权重分布, 红色区域为鞘层权重分布

    Fig. 6.  The weight distribution map of the input layer for the ALSTM model, where the blue and yellow lines represent the input layer weight distributions of the ALSTM models trained using low-pressure and high-pressure data respectively, with the red region indicating the sheath layer weight distribution.

    图 7  FNN、ALSTM和CTransformer模型在不同电压数据集(总电压范围50—400 V的10%—50%子集)上训练时在不同外推距离下预测的电子密度(a)和电子温度(b)空间分布. 而外推距离是指从各自训练集边界开始计算的总电压参数范围的10%、20%、30%、40%和50%. GroundTruth是指流体模拟的数据

    Fig. 7.  Spatial distributions of electron density (a) and electron temperature (b) predicted by FNN, ALSTM, and CTransformer models at different extrapolation distances when trained on distinct voltage datasets (10–50% subsets of the total voltage range 50–400 V). These extrapolation distances correspond to 10%, 20%, 30%, 40%, and 50% of the total voltage parameter range calculated from the respective training set boundaries. GroundTruth refers to data from fluid simulations.

    图 8  FNN、ALSTM和CTransformer模型在不同电压数据集(总电压范围50—400 V的10%—50%子集)上训练时电子密度(a)和电子温度(b)的归一化平均空间绝对误差随电压的变化函数. 图中垂直虚线标记了图7(a), (b)中所示的具体外推距离

    Fig. 8.  The normalized mean spatial absolute error of electron density (a) and electron temperature (b) as a function of voltage for FNN, ALSTM, and CTransformer models trained on different voltage datasets (10–50% subsets spanning the total voltage range of 50–400 V). Vertical dashed lines in the figure mark the specific extrapolation distances shown in Figure 7(a), (b)

    表 1  神经网络与传统模型在低温等离子体仿真中的对比

    Table 1.  Comparison of Neural Network and Traditional Models in Low-Temperature Plasma Simulation

    算法整体误差/%求解时间/s推理时间/s
    Fluid Model$ 0 $$ > 4680 $
    PIC Model$ 42.53 $$ > 7200 $
    FNN$ 2.78 $$ 36.0 $$ 0.00041 $
    ALSTM$ 1.73 $$ 93.6 $$ 0.00849 $
    CTransformer$ 3.30 $$ 75.6 $$ 0.00158 $
    下载: 导出CSV
    Baidu
  • [1]

    Kim H C, Iza F and Yang S S, Radmilović-Radjenović M and Lee J K 2005 J. Phys. D: Appl. Phys. 38 R283Google Scholar

    [2]

    Wang X C and Zhang Y T 2023 J. Appl. Phys. 133 143301Google Scholar

    [3]

    Zhang Y T, Gao S H and Zhu Y Y 2023 J. Appl. Phys. 133 053303Google Scholar

    [4]

    Sethi S P, Das D P and Behera S K 2023 IEEE Trans. Plasma Sci. 51 1434Google Scholar

    [5]

    Kim B, Im S and Yoo G 2021 Electronics 10 49

    [6]

    Xiao T Q, Wu Z, Christofides P D, Armaou A and Ni D 2021 Ind. Eng. Chem. Res. 61 638

    [7]

    Liau L C K, Huang C J, Chen C C, Huang C S, Chen T, Lin S C and Kuo L C 2002 Sol. Energy Mater. Sol. Cells 71 169Google Scholar

    [8]

    Pan J, Liu Y, Zhang S, Hu X C, Liu Y D and Shao T 2023 Energy Convers. Manage. 277 116620Google Scholar

    [9]

    Wan C G, Yu Z, Wang F, Liu X J and Li J G 2021 Nucl. Fusion 61 066015Google Scholar

    [10]

    Yang Y, Yang S, Li C and Yu Z H 2021 IEEE Access 9 67232Google Scholar

    [11]

    Borghei M and Ghassemi M 2022 IEEE Trans. Dielectr. Electr. Insul. 29 319Google Scholar

    [12]

    Seo J, Kim S, Jalalvand A, Conlin R, Rothstein A, Abbate J, Erickson K, Wai J, Shousha R and Kolemen E 2024 Nature 626 746Google Scholar

    [13]

    Dave B, Patel S, Shivani R, Purohit S and Chaudhury B 2022 Contrib. Plasma Phys. 63 e202200051

    [14]

    Marvin M and Seymour P 1969 MIT Press 6 318

    [15]

    Riedmiller M 1994 Computer Standards & Interfaces 16 265

    [16]

    Krizhevsky A, Sutskever I and Hinton G E 2012 Advances in neural information processing systems 25

    [17]

    Elman J L 1990 Cognitive science 14 179Google Scholar

    [18]

    Hpchreiter S and Schmidhuber J 1997 Neural computation 9 1735Google Scholar

    [19]

    Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez A N, Kaiser Ł and Polosukhin 2017 Advances in neural information processing systems 30

    [20]

    Liu Z M, Wang Y X, Vaidya S, Ruehle F, Halverson J, Soljačić M, Hou Y T and Tegmark M 2024 2404.19756v5[cs.LG]

    [21]

    Von R L, Maryer S, Beckh K, Georgiev B, Giesselbach S, Heese R, Kirsch B, Pfrommer J, Pick A, Ramamurthy R, Walczak M, Garcke J, Bauckhage C and Schuecker J. 2021 IEEE Trans. Knowl. Data Eng. 35 614

    [22]

    Willard J, Jia X, Xu S M, Steinbach M and Kumar V 2020 arXiv: 2003.04919v6[physics.comp-ph]

    [23]

    Basir S 2022 arXiv: 2209.09988v3[cs.LG]

    [24]

    Luo X, Yuan S Y, Tang H W, Xu D, Ran Q H, Cen Y H and Ling D F 2024 Hydrol. Processes 38 e15143Google Scholar

    [25]

    Li J, Wu X and Li Z 2025 Appl. Ocean Res. 161 104661Google Scholar

    [26]

    Guo Y, Li L, Xiang Z, Gui J, Shi S, Lei Z and Xu X 2025 Atomic Energy Science and Technology 59 1085

    [27]

    Li W K and Zhang Y T 2025 J. Appl. Phys. 137 203304Google Scholar

    [28]

    Noh H, Lee J and Yoon E 2025 J. Comput. Phys. 523 113665Google Scholar

    [29]

    Shi H Y, Wang S and Wang P Y 2024 J. Environ. Chem. Eng. 12 112998Google Scholar

    [30]

    Marcus G 2018 arXiv: 1801.00631v1[cs.AI]

    [31]

    Battaglia P W, Hamrick J B, Bapst V, Sanchez-Gonzalez A, Zambaldi V, Malinowski M, Tacchetti A, Raposo D, Santoro A, Faulkner R, Gulcehre C, Song F, Ballard A, Gilmer J, Dahl G, Vaswani A, Allen K, Nash C, Langston V, Dyer C, Heess N, Wierstra D, Kohli P, Botvinick M, Vinyals O, Li Y, Pascanu R. 2018 arXiv: 1806.01261v3[cs.LG]

    [32]

    Xu K, Zhang M, Li J, Du S S, Kawarabayashi K I and Jegelka S 2020 arXiv: 2009.11848v5[cs.LG]

    [33]

    Wu Y, Zhu Z, Liu F, Chrysos G and Cevher V 2022 Advances in neural information processing systems 35 26980

    [34]

    Hestness J, Narang S, Ardalani N, Diamos G, Jun H, Kianinejad H, Patwary M M A, Yang Y and Zhou Y Q 2017 arXiv: 1712.00409v1[cs.LG]

    [35]

    Manzhos S and Ihara M 2023 J. Phys. Chem. A 127 7823Google Scholar

    [36]

    Jain S and Wallace B C 2019 arXiv: 1902.10186v3[cs.CL]

    [37]

    Donko Z, Derzsi A, Vass M, Horvth B, Wilczek S, Horvath B and Hartmann P 2021 Plasma Sources Sci. Technol 30 095017Google Scholar

    [38]

    Jia W Z, Zhang Q Z, Wang X F, Song Y H and Wang Y N 2019 J. Phys. D: Appl. Phys 52 015206Google Scholar

  • [1] 王正君, 陈长城, 云雄飞, 韩昭, 拓娅莉, 杜钰玺, 张欣会, 张春玲, 关晓宁, 谢江舟, 刘刚, 芦鹏飞. 深度学习与密度泛函协同优化无铅双钙钛矿太阳能电池性能.  , doi: 10.7498/aps.75.20251302
    [2] 安吉, 郑君, 陈民, 远晓辉, 盛政明. 利用深度学习从质子成像反演激光等离子体中的磁场分布研究.  , doi: 10.7498/aps.75.20251243
    [3] 张帆, 张恒, 李卓越, 文俊, 胡海豹. 基于深度学习方法的圆柱绕流实验缺失数据重构.  , doi: 10.7498/aps.74.20241689
    [4] 陈鹏博, 王少义, 张文博, 温家星, 吴玉迟, 赵宗清, 王度. 基于深度学习的长波红外介电光栅加速器结构设计.  , doi: 10.7498/aps.74.20250130
    [5] 李京泽, 赵明亮, 张钰如, 高飞, 王友年. 双频容性耦合Ar/CF4等离子体源的多物理场三维仿真.  , doi: 10.7498/aps.74.20251121
    [6] 杨振宇, 张元哲, 范威, 杨广杰, 韩先伟, 谭畅. 可变比冲磁等离子体发动机电离与离子加热过程数值模拟.  , doi: 10.7498/aps.74.20251170
    [7] 史寒旭, 李欣阳, 张钰如, 王友年. 超低频/射频联合驱动容性耦合等离子体中二次电子效应的模拟.  , doi: 10.7498/aps.74.20250341
    [8] 刘鸿江, 刘逸飞, 谷付星. 基于深度学习的微纳光纤自动制备系统.  , doi: 10.7498/aps.73.20240171
    [9] 杨振宇, 张元哲, 范威, 杨广杰, 韩先伟. 磁等离子体发动机中磁喷管分离过程的流体模拟.  , doi: 10.7498/aps.73.20231862
    [10] 欧秀娟, 肖奕. RNA扭转角预测的深度学习方法.  , doi: 10.7498/aps.72.20231069
    [11] 孙涛, 袁健美. 基于深度学习原子特征表示方法的Janus过渡金属硫化物带隙预测.  , doi: 10.7498/aps.72.20221374
    [12] 战庆亮, 葛耀君, 白春锦. 基于深度学习的流场时程特征提取模型.  , doi: 10.7498/aps.71.20211373
    [13] 赵智鹏, 周双, 王兴元. 基于深度学习的新混沌信号及其在图像加密中的应用.  , doi: 10.7498/aps.70.20210561
    [14] 徐昭, 周昕, 白星, 李聪, 陈洁, 倪洋. 基于深度学习的相位截断傅里叶变换非对称加密系统攻击方法.  , doi: 10.7498/aps.70.20202075
    [15] 张瑶, 张云波, 陈立. 基于深度学习的光学表面杂质检测.  , doi: 10.7498/aps.70.20210403
    [16] 王丽, 温德奇, 田崇彪, 宋远红, 王友年. 容性耦合等离子体中电子加热过程及放电参数控制.  , doi: 10.7498/aps.70.20210473
    [17] 高书涵, 王绪成, 张远涛. 脉冲调制条件下介质阻挡特高频放电特性的数值模拟.  , doi: 10.7498/aps.69.20191853
    [18] 郎利影, 陆佳磊, 于娜娜, 席思星, 王雪光, 张雷, 焦小雪. 基于深度学习的联合变换相关器光学图像加密系统去噪方法.  , doi: 10.7498/aps.69.20200805
    [19] 陈炜, 郭媛, 敬世伟. 基于深度学习压缩感知与复合混沌系统的通用图像加密算法.  , doi: 10.7498/aps.69.20201019
    [20] 胡艳婷, 张钰如, 宋远红, 王友年. 相位角对容性耦合电非对称放电特性的影响.  , doi: 10.7498/aps.67.20181400
计量
  • 文章访问数:  503
  • PDF下载量:  13
  • 被引次数: 0
出版历程
  • 收稿日期:  2025-09-19
  • 修回日期:  2025-10-09
  • 上网日期:  2025-10-14

/

返回文章
返回
Baidu
map