Draft:Physics-Enhanced Machine Learning

{{AFC submission|d|ai|u=Ryan-907316|ns=118|decliner=Jlwoodwa|declinets=20250613012943|ts=20250612214326}}

Physics-enhanced machine learning (PEML) is an approach that combines knowledge of physical systems with data-driven algorithms, aiming to overcome the limitations of using either approach in isolation.{{Cite journal |last1=Haywood-Alexander |first1=Marcus |last2=Liu |first2=Wei |last3=Bacsa |first3=Kiran |last4=Lai |first4=Zhilu |last5=Chatzi |first5=Eleni |date=2024 |title=Discussing the spectrum of physics-enhanced machine learning: a survey on structural mechanics applications |url=https://www.cambridge.org/core/product/identifier/S2632673624000339/type/journal_article |journal=Data-Centric Engineering |language=en |volume=5 |doi=10.1017/dce.2024.33 |issn=2632-6736}} PEML incorporates known physical laws and domain-specific knowledge into the learning process, enabling models to match observed data while remaining consistent with underlying physical principles.

The term PEML is relatively recent and is used to describe a broad class of methods that combine physics and machine learning, including earlier techniques such as physics-informed neural networks (PINNs), which were introduced in 2019.{{Cite journal |last1=Raissi |first1=M. |last2=Perdikaris |first2=P. |last3=Karniadakis |first3=G. E. |date=2019-02-01 |title=Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations |url=https://www.sciencedirect.com/science/article/pii/S0021999118307125 |journal=Journal of Computational Physics |volume=378 |pages=686–707 |doi=10.1016/j.jcp.2018.10.045 |bibcode=2019JCoPh.378..686R |issn=0021-9991}} The concept is closely related to other terms seen in literature such as scientific machine learning (SciML),{{Cite journal |last1=Noordijk |first1=Ben |last2=Garcia Gomez |first2=Monica L. |last3=ten Tusscher |first3=Kirsten H. W. J. |last4=de Ridder |first4=Dick |last5=van Dijk |first5=Aalt D. J. |last6=Smith |first6=Robert W. |date=2024-08-02 |title=The rise of scientific machine learning: a perspective on combining mechanistic modelling with machine learning for systems biology |journal=Frontiers in Systems Biology |language=English |volume=4 |doi=10.3389/fsysb.2024.1407994 |issn=2674-0702 |doi-access=free}} physics-informed machine learning (PIML),{{Citation |last1=Hao |first1=Zhongkai |title=Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications |date=2022 |arxiv=2211.08064 |last2=Liu |first2=Songming |last3=Zhang |first3=Yichi |last4=Ying |first4=Chengyang |last5=Feng |first5=Yao |last6=Su |first6=Hang |last7=Zhu |first7=Jun}} physics-enhanced artificial intelligence (PEAI){{Cite arXiv|last1=O'Driscoll |first1=Patrick |last2=Lee |first2=Jaehoon |last3=Fu |first3=Bo |date=2019-03-11 |title=Physics Enhanced Artificial Intelligence |class=cs.AI |language=en |eprint=1903.04442}} and physics-guided machine learning,{{Cite journal |last1=Chen |first1=Shengyu |last2=Kalanat |first2=Nasrin |last3=Xie |first3=Yiqun |last4=Li |first4=Sheng |last5=Zwart |first5=Jacob A. |last6=Sadler |first6=Jeffrey M. |last7=Appling |first7=Alison P. |last8=Oliver |first8=Samantha K. |last9=Read |first9=Jordan S. |last10=Jia |first10=Xiaowei |date=2023-08-01 |title=Physics-guided machine learning from simulated data with different physical parameters |url=https://doi.org/10.1007/s10115-023-01864-z |journal=Knowledge and Information Systems |language=en |volume=65 |issue=8 |pages=3223–3250 |doi=10.1007/s10115-023-01864-z |issn=0219-3116}} and while usage often overlaps, it has been positioned as an umbrella term for methods that improve predictive capability through the integration of physics-based components within machine learning architectures.{{Cite journal |last=Cicirello |first=Alice |date=2024-12-01 |title=Physics-Enhanced Machine Learning: a position paper for dynamical systems investigations |url=https://iopscience.iop.org/article/10.1088/1742-6596/2909/1/012034 |journal=Journal of Physics: Conference Series |volume=2909 |issue=1 |pages=012034 |doi=10.1088/1742-6596/2909/1/012034 |bibcode=2024JPhCS2909a2034C |issn=1742-6588}}

Background

PEML has emerged in response to a range of challenges commonly encountered in scientific and data-driven modelling. These include: limited volumes of high-quality data, predictions that may be statistically accurate but physically implausible, difficulty in quantifying uncertainty, and limited interpretability of machine learning models. These limitations have motivated the development of methods that integrate physical knowledge into machine learning, and, by doing so, PEML aims to improve generalisation to unseen conditions, ensure that predictions remain consistent with established physical laws, and enhance the transparency and interpretability of learned models. This concept has gained traction since the early 2020s, when researchers began systematically exploring strategies for embedding domain knowledge into learning algorithms, allowing them to leverage domain-specific structure during training.

Early interest in PEML was driven by challenges observed in fields such as structural mechanics{{Citation |last1=Cross |first1=Elizabeth J. |title=Physics-Informed Machine Learning for Structural Health Monitoring |date=2022 |work=Structural Health Monitoring Based on Data Science Techniques |pages=347–367 |editor-last=Cury |editor-first=Alexandre |url=https://doi.org/10.1007/978-3-030-81716-9_17 |access-date=2025-06-03 |place=Cham |publisher=Springer International Publishing |language=en |doi=10.1007/978-3-030-81716-9_17 |isbn=978-3-030-81716-9 |last2=Gibson |first2=S. J. |last3=Jones |first3=M. R. |last4=Pitchforth |first4=D. J. |last5=Zhang |first5=S. |last6=Rogers |first6=T. J. |arxiv=2206.15303 |editor2-last=Ribeiro |editor2-first=Diogo |editor3-last=Ubertini |editor3-first=Filippo |editor4-last=Todd |editor4-first=Michael D.}} and environmental science,{{Cite journal |last1=Zhao |first1=Ying |last2=Chadha |first2=Mayank |last3=Barthlow |first3=Dakota |last4=Yeates |first4=Elissa |last5=Mcknight |first5=Charles J. |last6=Memarsadeghi |first6=Natalie P. |last7=Gugaratshan |first7=Guga |last8=Todd |first8=Michael D. |last9=Hu |first9=Zhen |date=2024-09-20 |title=Physics-enhanced machine learning models for streamflow discharge forecasting |url=https://doi.org/10.2166/hydro.2024.061 |journal=Journal of Hydroinformatics |volume=26 |issue=10 |pages=2506–2537 |doi=10.2166/hydro.2024.061 |issn=1464-7141}} where purely data-driven methods struggled with limited data or lacked reliability. For instance, in structural engineering, physics-based simulations can be very accurate but often require costly modelling and still face uncertainty in loads or material properties. On the other hand, data-driven models may fit experimental data but fail to generalise outside the training domain. PEML approaches were developed to bridge this gap, effectively creating a "spectrum" between the extremes of purely physics-based (white-box modelling) and purely data-driven (black-box modelling), known as grey-box or hybrid modelling.{{Cite journal |last1=Schweidtmann |first1=Artur M. |last2=Zhang |first2=Dongda |last3=von Stosch |first3=Moritz |date=2024-03-01 |title=A review and perspective on hybrid modeling methodologies |url=https://www.sciencedirect.com/science/article/pii/S2772508123000546 |journal=Digital Chemical Engineering |volume=10 |pages=100136 |doi=10.1016/j.dche.2023.100136 |issn=2772-5081}} In practice, this means a PEML model can leverage governing equations or simulation data to inform the learning process, thus requiring less training data and yielding outputs that obey physical laws.

Methods and Techniques

PEML encompasses a range of methods that integrate domain-specific physical knowledge into the machine learning process. These techniques differ in how and where physics is incorporated, whether through loss functions, model structures, feature design, or data generation, and can be summarised into three different categories: physics-informed, physics-guided, and physics-encoded machine learning.

= Physics-informed learning =

{{See also|Physics-informed neural networks}}

Physics-informed learning techniques integrate physical laws directly into the machine learning process to simulate complex systems, often in the form of partial differential equations (PDEs) and embeds physical constraints into the model itself. At an abstract level, a physics-informed model can be described in the form of a model M, a data-driven model D with parameters \mathbf{\theta}_D, such that:f(\mathbf{x}) \approx M[D(\mathbf{x}_ {measured},\mathbf{y}_ {measured},\mathbf{\theta}_D) + biases)]This is typically done by embedding physical constraints into the training process, such as by adding PDE residuals into the loss function. An example of physics-informed learning is through the use of physics-informed neural networks (PINNs), which implement composite loss functions that balance errors with PDE residuals, effectively blending sparse observations with physical constraints.{{Cite journal |last1=Karniadakis |first1=George Em |last2=Kevrekidis |first2=Ioannis G. |last3=Lu |first3=Lu |last4=Perdikaris |first4=Paris |last5=Wang |first5=Sifan |last6=Yang |first6=Liu |date=2021-05-24 |title=Physics-informed machine learning |url=https://www.nature.com/articles/s42254-021-00314-5 |journal=Nature Reviews Physics |language=en |volume=3 |issue=6 |pages=422–440 |doi=10.1038/s42254-021-00314-5 |bibcode=2021NatRP...3..422K |osti=2282016 |issn=2522-5820}} PINNs are particularly suited for problems involving irregular geometries or sparse measurements due to their ability to operate in a meshless paradigm by sampling random collocation points, and are often used when a larger volume of data is available. Physics-informed learning has been successfully applied in multi-physics systems such as electroconvection,{{Cite journal |last1=Cai |first1=Shengze |last2=Wang |first2=Zhicheng |last3=Lu |first3=Lu |last4=Zaki |first4=Tamer A. |last5=Karniadakis |first5=George Em |date=2021-07-01 |title=DeepM&Mnet: Inferring the electroconvection multiphysics fields based on operator approximation by neural networks |url=https://linkinghub.elsevier.com/retrieve/pii/S0021999121001911 |journal=Journal of Computational Physics |language=en |volume=436 |pages=110296 |doi=10.1016/j.jcp.2021.110296|arxiv=2009.12935 |bibcode=2021JCoPh.43610296C }} molecular dynamics,{{Cite book |last1=Jia |first1=Weile |last2=Wang |first2=Han |last3=Chen |first3=Mohan |last4=Lu |first4=Denghui |last5=Lin |first5=Lin |last6=Car |first6=Roberto |last7=Weinan |first7=E |last8=Zhang |first8=Linfeng |chapter=Pushing the Limit of Molecular Dynamics with Ab Initio Accuracy to 100 Million Atoms with Machine Learning |date=2020-11-09 |title=SC20: International Conference for High Performance Computing, Networking, Storage and Analysis |chapter-url=https://doi.org/10.1109/sc41405.2020.00009 |publisher=IEEE |pages=1–14 |doi=10.1109/sc41405.2020.00009|isbn=978-1-7281-9998-6 }} and real-time 4D flow reconstruction from MRI observations.

= Physics-Guided Learning =

In physics-guided learning, physical knowledge is introduced into the learning process not by altering the model itself, but through data preprocessing and feature engineering. This strategy enables conventional learning algorithms to work with inputs that already encode important physical structure, enhancing both accuracy and interpretability. Common techniques include:

  1. Physics-based feature extraction. Raw data is transformed into features with physical meaning, such as dimensionless numbers (e.g. Reynolds number or Mach number), wavelet coefficients, or energy spectra. For example, Mohan et al.{{Cite web |last=Mohan, Livescu, Chertkov |first=Arvind T, Daniel, Michael |date=April 2020 |title=Wavelet-powered neural networks for turbulence |url=https://openreview.net/pdf?id=MLXkfw5y79 }} used wavelet transforms to extract turbulence-related features from velocity fields, embedding known physics of turbulent cascades into the model inputs.
  2. Simulation or theory-driven feature augmentation. Outputs of simplified physical models (or residuals between observed and predicted behaviour) are used as additional features, reducing the learning burden of the machine learning model by letting it focus only on discrepancies. This technique has been used in chemical kinetics applications, where delta learning was applied to graph neural networks (GNNs) to enhance activation energy predictions in chemical reactions,{{Cite journal |last1=Chang |first1=Han-Chung |last2=Tsai |first2=Ming-Hsuan |last3=Li |first3=Yi-Pei |date=2025-02-10 |title=Enhancing Activation Energy Predictions under Data Constraints Using Graph Neural Networks |journal=Journal of Chemical Information and Modeling |volume=65 |issue=3 |pages=1367–1377 |doi=10.1021/acs.jcim.4c02319 |issn=1549-960X |pmc=11815826 |pmid=39862160}} or in quantitative structure-activity relationships (QSARs), where molecular descriptors derived from quantum chemistry calculations or physical models are used as features to predict chemical properties.{{Cite journal |last1=Cherkasov |first1=Artem |last2=Muratov |first2=Eugene N. |last3=Fourches |first3=Denis |last4=Varnek |first4=Alexandre |last5=Baskin |first5=Igor I. |last6=Cronin |first6=Mark |last7=Dearden |first7=John |last8=Gramatica |first8=Paola |last9=Martin |first9=Yvonne C. |last10=Todeschini |first10=Roberto |last11=Consonni |first11=Viviana |last12=Kuz'min |first12=Victor E. |last13=Cramer |first13=Richard |last14=Benigni |first14=Romualdo |last15=Yang |first15=Chihae |date=2014-06-26 |title=QSAR modeling: where have you been? Where are you going to? |journal=Journal of Medicinal Chemistry |volume=57 |issue=12 |pages=4977–5010 |doi=10.1021/jm4004285 |issn=1520-4804 |pmc=4074254 |pmid=24351051}}
  3. Physical domain transformations. Data is mapped into domains where physics-relevant patterns become more visible. For example, signal processing often employs Fourier transforms to reveal frequency content, allowing oscillatory features to be revealed. This enables machine learning algorithms, such as convolutional neural networks (CNNs) to apply standard vision models, yielding better generalisation and efficiency by learning from spectrograms instead of raw waveforms.{{Cite journal |last1=Rudolph |first1=Maja |last2=Kurz |first2=Stefan |last3=Rakitsch |first3=Barbara |date=2024-03-19 |title=Hybrid modeling design patterns |journal=Journal of Mathematics in Industry |volume=14 |issue=1 |pages=3 |doi=10.1186/s13362-024-00141-0 |issn=2190-5983 |doi-access=free}}

At an abstract level, physics-guided learning can be represented with a physics-based model G and latent physics-based model parameters \mathbf{\theta} _p as:f(\mathbf{x})\approx M[G(\mathbf{x} _{model},\mathbf{\theta} _p)+biases]These preprocessing methods are especially useful when physical insight is available, but the system is too complex for fully mechanistic modelling. By encoding physics into the data, standard machine learning architectures such as multi-layer perceptrons (MLPs) can be trained without needing architecture-specific changes. As a result, the learned function is implicitly constrained by the structured inputs, reducing the need to learn fundamental physical relationships from scratch.{{Cite journal |last1=Faroughi |first1=Salah A. |last2=Pawar |first2=Nikhil M. |last3=Fernandes |first3=Célio |last4=Raissi |first4=Maziar |last5=Das |first5=Subasish |last6=Kalantari |first6=Nima K. |last7=Kourosh Mahjour |first7=Seyed |date=2024-01-29 |title=Physics-Guided, Physics-Informed, and Physics-Encoded Neural Networks and Operators in Scientific Computing: Fluid and Solid Mechanics |url=https://doi.org/10.1115/1.4064449 |journal=Journal of Computing and Information Science in Engineering |volume=24 |issue=40802 |doi=10.1115/1.4064449 |issn=1530-9827}}

= Physics-Encoded Learning =

Physics-encoded learning, sometimes referred to as hybrid modelling, combines physics-based components with data-driven components in a singular unified framework. This approach is useful when the underlying physical laws are partially understood but insufficient to describe the full system behaviour, and are computationally expensive to simulate. In such methods, the final model M integrates the physics-based model G and the data-driven correction term D, along with additional biases to narrow the solution space to only contain physically plausible outputs such that the system is in the form:f(\mathbf{x})\approx M[G(\mathbf{x} _{model},\mathbf{\theta} _p) \cup D(\mathbf{x} _{measured},\mathbf{y} _{measured}, \mathbf{\theta} _D) + biases]This hybrid setup allows the machine learning component to compensate for missing or poorly understood physics, while ensuring the model respects key physical constraints embedded in G. Common examples of physics-encoded learning include gaussian process (GP) latent force models{{Cite journal |last1=Zou |first1=Joanna |last2=Lourens |first2=Eliz-Mari |last3=Cicirello |first3=Alice |date=2023-10-01 |title=Virtual sensing of subsoil strain response in monopile-based offshore wind turbines via Gaussian process latent force models |url=https://doi.org/10.1016/j.ymssp.2023.110488 |journal=Mechanical Systems and Signal Processing |volume=200 |pages=110488 |doi=10.1016/j.ymssp.2023.110488 |arxiv=2207.05901 |bibcode=2023MSSP..20010488Z |issn=0888-3270}}{{Cite journal |last1=Marino |first1=Luca |last2=Cicirello |first2=Alice |date=2023-07-24 |title=A switching Gaussian process latent force model for the identification of mechanical systems with a discontinuous nonlinearity |url=https://www.cambridge.org/core/journals/data-centric-engineering/article/switching-gaussian-process-latent-force-model-for-the-identification-of-mechanical-systems-with-a-discontinuous-nonlinearity/29AB5151DDD352DE4BCA57CE9C73562B |journal=Data-Centric Engineering |language=en |volume=4 |pages=e18 |doi=10.1017/dce.2023.12 |issn=2632-6736}} and Physics-informed sparse identification of nonlinear dynamics (PhI-SINDy),{{Citation |last=Lathourakis |first=Christos |title=xristosl0610/PhI-SINDy |date=2025-03-20 |url=https://github.com/xristosl0610/PhI-SINDy |access-date=2025-06-03}} which have been used to model multiple degree-of-freedom (MDOF) oscillators with multiple Coulomb friction contacts under harmonic load using both synthetic and experimental noisy experiments with multiple sources of discontinuous nonlinearities.{{Cite journal |last1=Lathourakis |first1=Christos |last2=Cicirello |first2=Alice |date=2024-07-01 |title=Physics enhanced sparse identification of dynamical systems with discontinuous nonlinearities |url=https://doi.org/10.1007/s11071-024-09652-2 |journal=Nonlinear Dynamics |language=en |volume=112 |issue=13 |pages=11237–11264 |doi=10.1007/s11071-024-09652-2 |bibcode=2024NonDy.11211237L |issn=1573-269X}}

Applications of Physics-Enhanced Machine Learning

PEML methods have moved beyond theoretical development and are now actively deployed in real-world systems across engineering, biology, chemistry,{{Cite journal |last1=Zivar |first1=Davood |last2=Pourafshary |first2=Peyman |date=2019-09-01 |title=A new approach for predicting oil recovery factor during immiscible CO2 flooding in sandstones using dimensionless numbers |url=https://doi.org/10.1007/s13202-019-0630-0 |journal=Journal of Petroleum Exploration and Production Technology |language=en |volume=9 |issue=3 |pages=2325–2332 |doi=10.1007/s13202-019-0630-0 |issn=2190-0566}} physics, scientific discovery, and computer science,{{cite conference |last1=Zhang |first1=Lvmin |last2=Rao |first2=Anyi |last3=Agrawala |first3=Maneesh |date=2025-01-22 |title=Scaling In‑the‑Wild Training for Diffusion‑based Illumination Harmonization and Editing by Imposing Consistent Light Transport |url=https://openreview.net/forum?id=u1cQYxRI1H |access-date=2025-06-12 |book-title=Proceedings of the International Conference on Learning Representations}} to name a few applications. These applications are especially valuable in high-stakes or data-scarce environments where traditional machine learning or purely physics-based models may fall short.

= Wind Turbine Structural Monitoring =

PEML has been applied to predict fatigue loads in wind turbine blades under wake steering control (WSC), a strategy that improves wind farms efficiency by intentionally misaligning turbine yaw angles to reduce wake interference.{{Cite journal |last1=Miao |first1=Yizhi |last2=Soltani |first2=Mohsen N. |last3=Hajizadeh |first3=Amin |date=2022-07-22 |title=A Machine Learning Method for Modeling Wind Farm Fatigue Load |journal=Applied Sciences |language=en |volume=12 |issue=15 |pages=7392 |doi=10.3390/app12157392 |doi-access=free |issn=2076-3417 }} While WSC can enhance power output, it also introduces additional fatigue loads on downstream turbines, complicating structural health monitoring. Traditional methods, such as look-up-tables (LUTs), approximate turbine loads based on precomputed simulations, but may struggle to capture complex wake-induced loading effects under high turbulence or non-standard conditions.{{Cite journal |last1=Mendez Reyes |first1=Hector |last2=Kanev |first2=Stoyan |last3=Doekemeijer |first3=Bart |last4=van Wingerden |first4=Jan-Willem |date=2019-10-11 |title=Validation of a lookup-table approach to modeling turbine fatigue loads in wind farms under active wake control |url=https://wes.copernicus.org/articles/4/549/2019/ |journal=Wind Energy Science |language=English |volume=4 |issue=4 |pages=549–561 |doi=10.5194/wes-4-549-2019 |issn=2366-7443 |doi-access=free|bibcode=2019WiEnS...4..549M }} A recent approach addressed this by using gaussian process (GP) models trained on physics-informed features, including damage-equivalent loads (DELs) derived from Rainflow Counting and the Palmgren-Miner rule. These GPs provided probabilistic fatigue predictions with improved accuracy. Compared to LUTs, the PEML model reduced the root mean square error (RMSE) by 13.99% for edgewise moments and by 51.87% for flapwise moments, highlighting the value of incorporating fatigue physics into machine learning-based predictive maintenance.

= Tuned Mass Damper Optimisation =

Tuned mass dampers (TMDs) are widely used to mitigate structural vibrations in tall buildings during seismic events.{{Cite thesis |last=Kourakis |first=Ioannis |title=Structural systems and tuned mass dampers of super-tall buildings : case study of Taipei 101 |date=2007 |degree=Thesis |publisher=Massachusetts Institute of Technology |hdl=1721.1/38947 |url=https://dspace.mit.edu/handle/1721.1/38947}} Traditional physics-based design methods, such as the Den Hartog approach, assume linear structural behaviour and do not fully capture the effects of nonlinear dynamics or variable seismic loads.{{cite book |last=Den Hartog |first=Jacob P. |title=Mechanical Vibrations |publisher=Dover Publications |year=1985 |isbn=978-0-486-64785-2 |edition=Reprint |location=New York, NY |pages=93–105}} Conversely, purely data-driven optimisation techniques may lack physical constraints, resulting in unrealistic or inefficient damping configurations. To address this, researchers developed a PEML framework based on a generative adversarial network (GAN) architecture.{{Cite journal |last1=Yang |first1=Xiongjun |last2=Lei |first2=Ying |last3=Wang |first3=Junjie |last4=Zhu |first4=Hongping |last5=Shen |first5=Wenai |date=2023-10-01 |title=Physics-enhanced machine learning-based optimization of tuned mass damper parameters for seismically-excited buildings |url=https://www.sciencedirect.com/science/article/pii/S0141029623007940 |journal=Engineering Structures |volume=292 |pages=116379 |doi=10.1016/j.engstruct.2023.116379 |bibcode=2023EngSt.29216379Y |issn=0141-0296}} The system incorporates a physical evaluation network into the GAN loop to guide the generation of TMD parameters (the natural frequency and damping ratio) under realistic seismic excitations. This approach was tested on both linear shear-type structures and nonlinear moment-resisting frames. Compared to traditional particle swarm optimisation (PSO), the physics-enhanced GAN achieved a 24.14% reduction in displacement under seismic loading while reducing computational cost by 80%, demonstrating the effectiveness of hybrid machine learning approaches in structural vibration control.

= Satellite Attitude Control =

In spacecraft missions involving dynamic payload changes, such as active debris removal, traditional attitude control systems (ACS) that rely on fixed mass and inertia properties may struggle to maintain stability. A study published in Frontiers in Robotics and AI proposed a PEML approach using deep reinforcement learning (DRL) to address this challenge.{{Cite journal |last1=Retagne |first1=Wiebke |last2=Dauer |first2=Jonas |last3=Waxenegger-Wilfing |first3=Günther |date=2024-07-23 |title=Adaptive satellite attitude control for varying masses using deep reinforcement learning |journal=Frontiers in Robotics and AI |language=English |volume=11 |doi=10.3389/frobt.2024.1402846 |doi-access=free |pmid=39109322 |pmc=11300345 |issn=2296-9144}} The method integrated physics-based simulation with DRL algorithms such as proximal policy optimisation (PPO) and soft actor-critic (SAC),{{Citation |last1=Haarnoja |first1=Tuomas |title=Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor |date=2018 |arxiv=1801.01290 |last2=Zhou |first2=Aurick |last3=Abbeel |first3=Pieter |last4=Levine |first4=Sergey}} and was trained using the Basilisk high-fidelity spacecraft simulator, which models Newtonian rotational dynamics and reaction wheel behaviour.{{Cite web |title=Welcome to Basilisk: an Astrodynamics Simulation Framework — Basilisk 2.7.9 documentation |url=https://avslab.github.io/basilisk/ |access-date=2025-06-05 |website=avslab.github.io}} The approach incorporated "stacked observations," feeding sequences of sensor readings (e.g. angular velocities and torques) into the learning model to enable inference of unknown mass properties over time. Compared to conventional proportional-integral-derivative (PID) controllers, DRL controllers with stacked observations achieved improved control performance, particularly in scenarios involving unknown or varying mass distributions. In simulations, the SAC controller with stacking reduced attitude error by up to 78° and settled the spacecraft 26 seconds faster than the PID controller. These results highlight the potential of PEML methods for improving control robustness under uncertain spacecraft dynamics.

= Aircraft Flight Path Optimisation =

Accurate upper-air wind field prediction is essential for optimising aircraft trajectories to reduce fuel consumption and flight time. Traditional numerical weather prediction (NWP) methods, while physically rigorous, are computationally expensive and limited in short-term forecasting, requiring hours of supercomputing time for multi-day forecasts.{{Cite journal |last1=Bi |first1=Kaifeng |last2=Xie |first2=Lingxi |last3=Zhang |first3=Hengheng |last4=Chen |first4=Xin |last5=Gu |first5=Xiaotao |last6=Tian |first6=Qi |date=2023-07-05 |title=Accurate medium-range global weather forecasting with 3D neural networks |journal=Nature |language=en |volume=619 |issue=7970 |pages=533–538 |doi=10.1038/s41586-023-06185-3 |pmid=37407823 |pmc=10356604 |bibcode=2023Natur.619..533B |issn=1476-4687}} A recent study proposed a method that integrates a predictive recurrent neural network (PredRNN) with an improved A* pathfinding algorithm to generate efficient flight routes in dynamic wind conditions.{{Cite journal |last1=Ma |first1=Jieying |last2=Xiang |first2=Pengyu |last3=Yao |first3=Qinghe |last4=Jiang |first4=Zichao |last5=Huang |first5=Jiayao |last6=Li |first6=Hejie |date=2025-01-23 |title=Optimizing Aircraft Route Planning Based on Data-Driven and Physics-Informed Wind Field Predictions |journal=Mathematics |language=en |volume=13 |issue=3 |pages=367 |doi=10.3390/math13030367 |doi-access=free |issn=2227-7390 }} PredRNN was trained on ERA5 wind data{{Cite web |title=ERA5 hourly data on single levels from 1940 to present |url=https://cds.climate.copernicus.eu/datasets/reanalysis-era5-single-levels?tab=overview |access-date=2025-06-07 |website=cds.climate.copernicus.eu |language=en}} at cruising altitudes of 5,500m along major Chinese airline routes using a loss function informed by Navier-Stokes equations. The resulting wind field forecasts enables the A* algorithm to avoid zones of high turbulence and optimise routes in real-time for up to 10 hours in advance. Compared to standard neural network and physics-based approaches, this framework improved forecasting accuracy and produced safer, more fuel-efficient trajectories.

= Streamflow Discharge Forecasting =

Accurate river discharge forecasting is critical for flood mitigation, as well as waterway management and infrastructure planning. Physics-based hydrological models such as RAPID (Routing Application for Parallel Computation of Discharge) simulate river flow based on the Muskingum algorithm but often make assumptions such as linear process modelling and reliance on adjacent inflows, simplifying the problem.{{Cite journal |last=Cunge |first=J. A. |date=1969-01-01 |title=On The Subject Of A Flood Propagation Computation Method (Musklngum Method) |url=https://doi.org/10.1080/00221686909500264 |journal=Journal of Hydraulic Research |volume=7 |issue=2 |pages=205–230 |doi=10.1080/00221686909500264 |bibcode=1969JHydR...7..205C |issn=0022-1686}}{{Cite journal |last1=Tavakoly |first1=Ahmad A. |last2=David |first2=Cédric H. |last3=Gutenson |first3=Joseph L. |last4=Wahl |first4=Mark W. |last5=Follum |first5=Mike |date=2023-03-01 |title=Development of non-data driven reservoir routing in the routing application for parallel computatIon of discharge (RAPID) model |url=https://www.sciencedirect.com/science/article/pii/S1364815223000178 |journal=Environmental Modelling & Software |volume=161 |pages=105631 |doi=10.1016/j.envsoft.2023.105631 |bibcode=2023EnvMS.16105631T |issn=1364-8152}} These limitations can lead to deviations from observed discharge values, especially in complex or ungauged river networks. To address this, a PEML approach was proposed that integrates RAPID with data-driven models using delta learning and data augmentation techniques.{{Cite journal |last1=Zhao |first1=Ying |last2=Chadha |first2=Mayank |last3=Barthlow |first3=Dakota |last4=Yeates |first4=Elissa |last5=Mcknight |first5=Charles J. |last6=Memarsadeghi |first6=Natalie P. |last7=Gugaratshan |first7=Guga |last8=Todd |first8=Michael D. |last9=Hu |first9=Zhen |date=2024-09-20 |title=Physics-enhanced machine learning models for streamflow discharge forecasting |url=https://doi.org/10.2166/hydro.2024.061 |journal=Journal of Hydroinformatics |volume=26 |issue=10 |pages=2506–2537 |doi=10.2166/hydro.2024.061 |issn=1464-7141}} These hybrid models combine physical runoff simulations with machine learning algorithms, including gaussian process nonlinear autoregressive with exogenous inputs (GP-NARX),{{Citation |last=Särkkä |first=Simo |title=The Use of Gaussian Processes in System Identification |date=2019 |encyclopedia=Encyclopedia of Systems and Control |pages=1–10 |url=https://link.springer.com/rwe/10.1007/978-1-4471-5102-9_100087-1 |access-date=2025-06-07 |publisher=Springer, London |language=en |doi=10.1007/978-1-4471-5102-9_100087-1 |isbn=978-1-4471-5102-9}} neural networks, and bidirectional LSTMs (long short-term memory). The goal is to compensate for uncertainties in the RAPID model by learning discrepancies between predicted and gauged discharge values and using additional basin-wide runoff data to inform forecasts. The study demonstrated that the hybrid PEML models significantly outperformed RAPID alone, improving discharge prediction accuracy by a factor of four to seven across various river systems in the United States. By leveraging both physical principles and basin-wide hydrological data, the approach enables robust, long-range forecasting in data-limited conditions and enhances the reliability of streamflow predictions for gauged rivers.

= CO<sub>2</sub> Flooding Recovery Prediction =

PEML has been applied to predict oil recovery during immiscible CO2 flooding in sandstone reservoirs, a widely used enhanced oil recovery (EOR) method.{{Cite journal |last1=Zivar |first1=Davood |last2=Pourafshary |first2=Peyman |date=2019-09-01 |title=A new approach for predicting oil recovery factor during immiscible CO2 flooding in sandstones using dimensionless numbers |url=https://doi.org/10.1007/s13202-019-0630-0 |journal=Journal of Petroleum Exploration and Production Technology |language=en |volume=9 |issue=3 |pages=2325–2332 |doi=10.1007/s13202-019-0630-0 |issn=2190-0566}} Traditional core-flooding experiments and physics-based models, while informative, often rely on simplifying assumptions regarding flow dynamics which can limit their predictive accuracy.{{Cite journal |last1=Safi |first1=Razi |last2=Agarwal |first2=Ramesh K. |last3=Banerjee |first3=Subhodeep |date=2016-04-22 |title=Numerical simulation and optimization of CO2 utilization for enhanced oil recovery from depleted reservoirs |url=https://www.sciencedirect.com/science/article/pii/S000925091630001X |journal=Chemical Engineering Science |volume=144 |pages=30–38 |doi=10.1016/j.ces.2016.01.021 |issn=0009-2509}} To improve generalisation, researchers developed a PEML framework combining experimental data with physically informed features expressed through dimensionless numbers, which include the capillary number, relative radius (based on porosity and permeability), injection pressure ratio, and oil composition number. The model was trained on core-flooding datasets spanning a wide range of reservoir conditions: porosity (10.8-37.2%), permeability (1-18,000 mD), injection pressures (2.73-11.44 MPa), flow rates, and various crude oil types. Rather than relying on individual parameters, the PEML model used a grouped dimensionless formulation to represent the combined physical forces governing displacement efficiency. A logarithmic correlation was found between these grouped parameters and the oil recovery factor, achieving strong agreement with experimental results (81% confidence). This approach demonstrated improved accuracy over traditional methods and highlighted the benefits of embedding domain knowledge into machine learning for more robust EOR performance prediction.

= Illumination Harmonisation and Editing =

{{See also|Physically based rendering}}

In image processing and computer vision, PEML has been used to improve illumination harmonisation and editing tasks. Traditional graphics models are often computationally expensive and may struggle to generalise to diverse real-world lighting conditions.{{Cite journal |last1=Schmidt |first1=Thorsten-Walther |last2=Pellacini |first2=Fabio |last3=Nowrouzezahrai |first3=Derek |last4=Jarosz |first4=Wojciech |last5=Dachsbacher |first5=Carsten |date=2016 |title=State of the Art in Artistic Editing of Appearance, Lighting and Material |url=https://onlinelibrary.wiley.com/doi/abs/10.1111/cgf.12721 |journal=Computer Graphics Forum |volume=35 |issue=1 |pages=216–233 |doi=10.1111/cgf.12721 |hdl=11380/1299569 |issn=1467-8659}} Conversely, standard diffusion-based models are powerful for generative tasks, but can alter intrinsic image properties such as albedo or reflectance, leading to unrealistic visual artifacts. To address these limitations, researchers proposed a PEML-based training strategy known as Imposing Consistent Light (IC-Light) transport. This method incorporates physical light transport theory into the training of diffusion-based illumination models by enforcing a consistency principle: the linear blending of different lighting conditions should reflect physically plausible results. By embedding this constraint during training, the model learns to modify illumination without distorting other visual features of the image. IC-Light was applied to a large-scale training regime involving over 10 million samples, including real photographs, rendered data, and in-the-wild synthetic augmentations. The model was benchmarked against several baselines (e.g. SwitchLight{{Cite book |last1=Kim |first1=Hoon |last2=Jang |first2=Minje |last3=Yoon |first3=Wonjun |last4=Lee |first4=Jisoo |last5=Na |first5=Donghyun |last6=Woo |first6=Sanghyun |chapter=SwitchLight: Co-Design of Physics-Driven Architecture and Pre-training Framework for Human Portrait Relighting |date=2024-06-16 |title=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |chapter-url=https://ieeexplore.ieee.org/document/10658582 |pages=25096–25106 |doi=10.1109/CVPR52733.2024.02371|arxiv=2402.18848 |isbn=979-8-3503-5300-6 }} and DiLightNet{{Citation |last1=Zeng |first1=Chong |title=Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers |date=2024 |url=https://github.com/iamNCJ/DiLightNet |access-date=2025-06-08 |last2=Dong |first2=Yue |last3=Peers |first3=Pieter |last4=Kong |first4=Youkang |last5=Wu |first5=Hongzhi |last6=Tong |first6=Xin|chapter=DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation |pages=1–12 |doi=10.1145/3641519.3657396 |arxiv=2402.11929 |isbn=979-8-4007-0525-0 }}), achieving state-of-the-art results in perceptual quality (LPIPS{{cite web | url=https://lightning.ai/docs/torchmetrics/stable/image/learned_perceptual_image_patch_similarity.html | title=Learned Perceptual Image Patch Similarity (LPIPS) — PyTorch-Metrics 1.7.2 documentation }} = 0.1025), while maintaining balanced performance in PSNR (23.72) and SSIM (0.8513). This PEML approach enables more stable and scalable illumination editing while being physically consistent, supporting applications in content creation and digital design.

Challenges and Limitations

Despite its advantages, PEML faces several challenges that limit its scalability and general adoption. One major issue is the lack of standardised benchmarks for evaluating PEML models.{{Cite journal |last1=Meng |first1=Chuizheng |last2=Griesemer |first2=Sam |last3=Cao |first3=Defu |last4=Seo |first4=Sungyong |last5=Liu |first5=Yan |date=2025-05-07 |title=When physics meets machine learning: a survey of physics-informed machine learning |url=https://doi.org/10.1007/s44379-025-00016-0 |journal=Machine Learning for Computational Science and Engineering |language=en |volume=1 |issue=1 |pages=20 |doi=10.1007/s44379-025-00016-0 |issn=3005-1436}} Direct comparisons are often difficult, as the models integrate domain knowledge in different ways. Studies have noted that models with similar statistical accuracy may generalise differently when applied to new conditions. Recent literature has called for evaluation metrics that account for physical consistency and domain-specific performance, beyond traditional error metrics such as RMSE or MAE.{{Cite journal |last1=Hao |first1=Zhongkai |last2=Yao |first2=Jiachen |last3=Su |first3=Chang |last4=Su |first4=Hang |last5=Wang |first5=Ziao |last6=Lu |first6=Fanzhi |last7=Xia |first7=Zeyu |last8=Zhang |first8=Yichi |last9=Liu |first9=Songming |last10=Lu |first10=Lu |last11=Zhu |first11=Jun |date=2024-12-16 |title=PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs |url=https://proceedings.neurips.cc/paper_files/paper/2024/hash/8c63299fb2820ef41cb05e2ff11836f5-Abstract-Datasets_and_Benchmarks_Track.html |journal=Advances in Neural Information Processing Systems |language=en |volume=37 |pages=76721–76774}}{{cite conference |last1=Takamoto |first1=Makoto |last2=Praditia |first2=Timothy |last3=Leiteritz |first3=Raphael |last4=MacKinlay |first4=Dan |last5=Alesiani |first5=Francesco |last6=Pflüger |first6=Dirk |last7=Niepert |first7=Mathias |date=2022-11-28 |title=PDEBENCH: an extensive benchmark for scientific machine learning |url=https://papers.nips.cc/paper_files/paper/2022/file/8fc731d94e060b5f1d5f1d56be0852d7-Paper-Conference.pdf |location=Red Hook, NY, USA |publisher=Curran Associates, Inc. |pages=1596–1611 |isbn=978-1-7138-7108-8 |book-title=Proceedings of the 36th International Conference on Neural Information Processing Systems (NeurIPS 2022)}}

Another key challenge is balancing the role of physics and data in PEML models. If the data-driven component is too flexible, it may overfit to training data and generalise results that violate known physical principles, and conversely, if the physical constrains are applied too rigidly, the model may underfit, and fail to capture important patterns in the data, creating a Pareto frontier.{{Cite journal |last1=Rohrhofer |first1=Franz M. |last2=Posch |first2=Stefan |last3=Gößnitzer |first3=Clemens |last4=Geiger |first4=Bernhard C. |date=2023 |title=Data vs. Physics: The Apparent Pareto Front of Physics-Informed Neural Networks |url=https://ieeexplore.ieee.org/document/10210413 |journal=IEEE Access |volume=11 |pages=86252–86261 |doi=10.1109/ACCESS.2023.3302892 |arxiv=2105.00862 |bibcode=2023IEEEA..1186252R |issn=2169-3536}} For physics-encoded and hybrid modelling, there is also a risk that the machine learning component may override the physics-based component unless appropriately regulated. To manage this trade-off, researchers have investigated strategies such as incorporating physics-based regularisation terms,{{Cite thesis |last=Raymond |first=Christian |title=Meta-Learning Loss Functions for Deep Neural Networks |date=2025 |publisher=Victoria University of Wellington Library |doi=10.26686/wgtn.28448894 |url=https://doi.org/10.26686/wgtn.28448894}} and applying adaptive weighting schemes during training.{{Cite journal |last1=Maddu |first1=Suryanarayana |last2=Sturm |first2=Dominik |last3=Müller |first3=Christian L |last4=Sbalzarini |first4=Ivo F |date=2022-02-15 |title=Inverse Dirichlet weighting enables reliable training of physics informed neural networks |url=https://dx.doi.org/10.1088/2632-2153/ac3712 |journal=Machine Learning: Science and Technology |language=en |volume=3 |issue=1 |pages=015026 |doi=10.1088/2632-2153/ac3712 |arxiv=2107.00940 |bibcode=2022MLS&T...3a5026M |issn=2632-2153}}

Error sources in both the data and the physical models present significant challenges as well. These include issues such as incorrect modelling assumptions (for example, wrong constitutive laws), noisy or non-informative data, and model architecture choices that allow the data-driven and physics-based components to become imbalanced for that purpose. To mitigate these risks, recent studies have proposed methods for the automatic detection and correction of such errors, along with uncertainty quantification techniques to flag unreliable or extrapolated predictions.

Scalability is another limitation. Many PEML techniques have been demonstrated on idealised or low-dimensional problems. Applying them to large-scale systems, such as multi-physics simulations or real-time control scenarios remains computationally demanding. Techniques like domain decomposition, surrogate modelling, and reduced-order physics are used to mitigate this, though they often introduce additional approximation errors.{{Cite journal |last1=Chen |first1=Wenqian |last2=Wang |first2=Qian |last3=Hesthaven |first3=Jan S. |last4=Zhang |first4=Chuhua |date=2021-12-01 |title=Physics-informed machine learning for reduced-order modeling of nonlinear problems |url=https://ui.adsabs.harvard.edu/abs/2021JCoPh.44610666C/abstract |journal=Journal of Computational Physics |language=en |volume=446 |pages=110666 |doi=10.1016/j.jcp.2021.110666 |bibcode=2021JCoPh.44610666C |issn=0021-9991}}

Finally, interpretability and uncertainty quantification are still under active development. While PEML models are often more transparent than purely black-box approaches, interpreting the learned components (e.g. correction terms) is not always straightforward. Similarly, quantifying uncertainty in both the data and model parameters is critical for high-stakes applications, but current methods are still evolving.

See also

References