As the most important solvent, water has been at the center of interest since the advent of computer simulations. While early molecular dynamics and Monte Carlo simulations had to make use of simple model potentials to describe the atomic interactions, accurate ab initio molecular dynamics simulations relying on the first-principles calculation of the energies and forces have opened the way to predictive simulations of aqueous systems. Still, these simulations are very demanding, which prevents the study of complex systems and their properties. Modern machine learning potentials (MLPs) have now reached a mature state, allowing us to overcome these limitations by combining the high accuracy of electronic structure calculations with the efficiency of empirical force fields. In this Perspective, we give a concise overview about the progress made in the simulation of water and aqueous systems employing MLPs, starting from early work on free molecules and clusters via bulk liquid water to electrolyte solutions and solid–liquid interfaces.
INTRODUCTION
A large fraction of the surface of the Earth is covered by water and, still, some ice, giving our planet its distinctive blue color when viewed from space. Water is carried down deep into the Earth’s crust at subduction zones, influencing volcanism and plate tectonics, and in the atmosphere, in the form of vapor, liquid, or ice, water is a key climate factor from the troposphere up to the stratosphere and mesosphere. Down at the Earth’s surface, water shapes landscapes, provides the basis for life, and is central to many technologies that sustain humanity. Given its significance and abundance, it is no surprise that over the centuries, much research has been undertaken to understand the properties of water and their physical origin.
One of the central scientific questions addressed in water research is how the complex behavior of water, exhibiting many anomalies and a rich phase diagram, arises from the interactions of the chemically rather simple H2O molecules. Due to the limited temporal and spatial resolution of many experimental probes, much of what we know about water has been learned from computer simulations. Specifically, atomistic simulations have provided detailed insights into the directed network of hydrogen bonds between molecules that governs the structure and dynamics of water and its interaction with solutes and surfaces.1–4 Moreover, computer simulations have made it possible to investigate water under extreme conditions that are not accessible in experiments. For instance, simulations have been used to study water and ice under pressure and temperature conditions prevailing in the deep Earth5 and in the interiors of the giant planets Uranus and Neptune,6 as well as in the deeply supercooled state, the so-called no-man’s land, where crystallization occurs extremely quickly.7,8
Following the pioneering Monte Carlo (MC) simulations of Barker and Watts9 and molecular dynamics (MD) studies of Rahman and Stillinger1 in the late 1960s and early 1970s, respectively, many computer simulations of water and aqueous systems were carried out. Initially, these simulations were based on empirical potentials,10 but, later, they relied increasingly on forces and energies obtained from electronic structure calculations.11 In the empirical potential—or force field—approach, the functional form of the interaction potential is constructed to capture the main physical interactions between molecules, with parameters adapted to reproduce some experimentally known quantities and/or quantum mechanical reference data.
Since the first water models for MD simulations were proposed more than half a century ago,12–14 a vast number of empirical potentials were developed for water and ice, ranging from simple forms based on pair interactions to sophisticated many-body potentials, including polarization and charge transfer.10,15–21 Despite their often simple functional form, these models have been remarkably successful in capturing the key properties of water across the phase diagram.22,23 Modeling chemical reactivity, however, has proven difficult using empirical potentials, and with a few exceptions,24–27 empirical water models are usually non-reactive. That is, they lack the capability to represent the dissociation and recombination of water molecules, which is, for example, essential for describing the famous Grotthuss mechanism of proton transport.28 The inclusion of proton transfer events is not only central for simulations of acids and bases but is also of utmost importance in countless chemical reactions with water involved as a reactant, from the hydrolysis of biomolecules to electrochemical water splitting. Nevertheless, to this day, empirical force fields have been used for the vast majority of simulations involving water, particularly in studies of biological macromolecules, where aqueous solvation effects are of crucial importance.18
A more fundamental route to the computer simulation of water relies on determining interactions from first principles, i.e., ab initio by solving the electronic Schrödinger equation. The first ab initio molecular dynamics (AIMD) simulations of liquid water were carried out 30 years ago based on density functional theory (DFT). Since then, ab initio methods have been used extensively to study water11,29 and, more generally, aqueous systems.30 While, in principle, ab initio approaches have the potential for truly predictive simulations and also provide access to electronic properties, currently available approximate methods are still somewhat limited.31 For instance, predicting the equilibrium density of liquid water has been proven difficult within DFT based on standard exchange correlation functionals, and only the inclusion of dispersion forces produces satisfactory results.30,32 Moreover, some attempts to study water using methods beyond DFT have been made, e.g., using MP2-based simulations33 or employing the random phase approximation (RPA).34
Compared to empirical force fields, ab initio approaches are computationally more expensive by many orders of magnitude, severely limiting the accessible system size and simulation times. Hence, many processes of interest occurring in aqueous systems, for instance freezing, glassy dynamics, the solvation of complex interfaces and biopolymers, as well as complex chemical reactions, are far beyond the capabilities of current ab initio simulations. Therefore, in order to transfer the reliability of ab initio methods to more complex systems, in the past two decades, considerable effort has been made to develop efficient but accurate potential energy surfaces (PESs) based on systematic and flexible functional forms.35–38 In particular, the growing use of machine learning techniques, such as artificial neural networks (ANNs), and Gaussian processes for the construction of highly accurate and efficient machine learning potentials (MLPs) has revolutionized—and continues to revolutionize—the field of atomistic computer simulations.39–53 These methodological advances in modern MLPs make it now possible to predict even complex properties of condensed systems from first principles, opening up exciting new possibilities in chemistry, materials science, and related disciplines.
Consequently, in recent years, atomistic modeling based on MLPs has also been increasingly applied to study water and, more generally, aqueous systems. The field has been evolving rapidly, in terms of both the underlying methodology and the complexity of systems that can be addressed, from small water clusters in vacuum via bulk water and its phase diagram to solution chemistry and processes at solid–liquid interfaces (cf. Fig. 1).
Schematic overview of some important applications of MLPs to aqueous systems with increasing complexity (from bottom to top): neutral and protonated water clusters, liquid water and its interface with ice, liquid water–vapor interface, electrolyte solutions, and solid–water interfaces.
Schematic overview of some important applications of MLPs to aqueous systems with increasing complexity (from bottom to top): neutral and protonated water clusters, liquid water and its interface with ice, liquid water–vapor interface, electrolyte solutions, and solid–water interfaces.
In this Perspective, we first provide a brief overview of the methodological basis and the current status of MLPs. In doing so, we focus on those types of MLPs that have been most frequently employed in simulations of systems involving water. In the subsequent sections, we discuss different applications of MLPs to aqueous systems. While it is not the goal of this Perspective to provide a comprehensive review covering all MLP-based studies of water and related systems exhaustively, we discuss a broad range of representative applications and point out future research directions to demonstrate the power and versatility of MLPs for the study of water and aqueous systems.
METHODOLOGY
In recent years, machine learning potentials have become an increasingly important tool for atomistic simulations of complex systems in chemistry, physics, and materials science. As a consequence, the development of MLPs is a very active topic of research, and here, we will restrict our discussion to a concise overview about the current status of the field. Readers interested in further details are referred to a very large number of reviews covering all aspects of the methodology of MLPs.39–54
MLPs offer many advantages, like an excellent numerical agreement with the underlying electronic structure reference method, resulting in typical energy errors of only about 1 meV/atom and force errors in the order of 100 meV/Å. These errors are significantly smaller than the uncertainty, e.g., due to the choice of the exchange correlation functional in DFT, and thus, replacing electronic structure calculations by MLPs does only marginally affect the accuracy of the simulations. Moreover, MLPs can describe the making and breaking of chemical bonds and provide a rather high computational efficiency enabling simulations of systems containing many thousands of atoms. Still, they are usually about one to two orders of magnitude slower to compute than simple classical force fields. A main advantage of MLPs is their ability to transfer the predictive capabilities of ab initio approaches to large systems, which are required for aqueous systems to (1) achieve proper sampling of the liquid, (2) ensure bulk-like water properties at greater distances from solid surfaces, and (3) prevent artificially high electrolyte concentrations and/or artificial periodicity that would be present in relatively small simulation cells directly accessible by electronic structure calculations.
First, MLPs have been introduced more than a quarter of a century ago by Blank et al.,55 who suggested to use a feed-forward neural network (NN) to represent the interactions between diatomic molecules and solid surfaces. A general limitation of this first generation of MLPs, which has been further explored by numerous groups for about a decade for different types of systems, has been the limitation to a few atomic degrees of freedom only, restricting the applicability to small molecules in vacuum or small molecules interacting with frozen surfaces, as summarized in some early reviews.39,40 The major challenge for extending this method to high-dimensional condensed systems, such as liquid water, at that time has been the lack of suitable structural input descriptors for the machine learning algorithms, which ensure the imperative translational, rotational, and permutational invariances of the potential energy surface (PES). Only for some applications, system-specific approximate solutions could be derived when neglecting less important degrees of freedom of the system.56–58 In parallel to these efforts in the development of early neural network-based MLPs, in pioneering work, Braams and Bowman35,59 introduced permutation invariant polynomials (PIPs), which are closely related and enable the construction of very accurate PESs by linear regression based on symmetrized polynomials as basis functions. While PIPs do not employ traditional machine learning algorithms, such as neural networks, they show a similar flexibility and include all invariances exactly, which has been a frustrating challenge for early MLPs employing non-linear models. Still, PIPs share with early MLPs the restriction to small systems with a very limited number of degrees of freedom. Over the years, PIPs have enabled the construction of very accurate potentials and have been applied successfully, for example, to vibrations and reaction dynamics of small molecules in vacuum as well as a variety of water clusters.35
The energetically relevant local environment determining the Ei is defined by a cutoff radius Rc such that all interactions beyond this radius, which is typically chosen between 5 and 10 Å, are not explicitly included. While the ansatz of Eq. (1) has been used in many empirical potentials for a long time, the introduction of second-generation MLPs has only become possible by the development of many-body descriptors with full translational, rotational, and permutational invariance. In the case of HDNNPs, atom-centered symmetry functions (ACSFs)64 are most frequently used for this purpose, but at present, a wide range of alternative descriptors is available and employed in different types of MLPs.65–68 All these descriptors represent structural fingerprints of the local atomic environments and serve as an input for the machine learning algorithms, which then construct the functional relation between the atomic environments and the atomic energies.
In the case of HDNNPs, which are often used for simulations of water and aqueous systems, there is one feed-forward NN to be parameterized per element, which is then evaluated as many times as atoms of the respective element are present in the system. Closely related is ANI (Accurate neural network engine for molecular energies ANAKIN-ME, ANI for short), which is a HDNNP with modified angular ACSFs69 aiming for transferability across a wide range of organic molecules. Another NN-based MLP that is very frequently used for simulations of water is Deep Potential Molecular Dynamics (DeePMD),70–72 which employs a local atomic coordinate system and descriptors in this reference frame as an input for the atomic NNs. Many other second-generation MLPs containing NNs have been proposed,73,74 and in recent years also NN potentials learning descriptors as part of the training process employing message passing75 have been put forward.76–82 Beyond neural networks, Gaussian approximation potentials (GAPs)83 combined with the SOAP (Smooth Overlap of Atomic Positions) descriptor65 are among the most frequently used MLPs, with a few applications to aqueous systems reported to date. Many other second-generation MLPs are available in the literature, which can be expected to be used for systems containing water in the future.68,84,85
An obvious limitation of second-generation MLPs is the truncation of atomic interactions at the cutoff radius. However, in many aqueous systems, long-range electrostatic interactions play an important role.86 These are explicitly considered in third-generation MLPs, which include electrostatic interactions employing environment-dependent charges represented by machine learning models. Already in 2007, Popelier and co-workers showed that it is possible to construct electrostatic multipoles using neural networks87 and Gaussian processes88 to improve the description of electrostatics in classical force fields, and, in addition, applications to water clusters have been reported.89
In 2011, HDNNPs of the third generation were proposed by introducing a second set of atomic neural networks providing atomic partial charges trained to DFT reference data.90,91 From these charges, the electrostatic energy can be computed and combined with the short-range expression of Eq. (1) to yield the total energy of the system. By training the short-range part to represent only the energy component not covered by electrostatics, double counting of energy contributions can be avoided. Further MLPs, including long-range electrostatics, are, for example, the HDNNP TensorMol,92 the message passing network PhysNet,93 and many others.94–98 At present, the machine learning representation of atomic partial charges and electrostatic multipoles is a very active field of research, opening many routes to the construction of third-generation MLPs.
A remaining limitation of third-generation MLPs is the locality of the atomic charges, which does not allow us to describe systems exhibiting long-range charge transfer and other non-local dependencies between the geometric and electronic structures.52 These phenomena can be considered in fourth-generation MLPs. The first MLP of this generation has been the charge equilibration neural network (CENT) technique proposed by Ghasemi et al. in 2015.99 Since the introduction of CENT, which employs a global charge equilibration step100 and is intended for applications to ionic materials, several other fourth-generation MLPs have emerged, such as Becke population neural networks (BpopNNs),101 fourth-generation HDNNPs (4G-HDNNPs),102 and charge recursive NNs (QRNNs).103 To date, fourth-generation MLPs have not been extensively applied to water but offer new interesting possibilities for studies of complex systems.
A key aspect in the training of MLPs54 is the construction of suitable datasets covering the structures that are visited in the intended simulations, as MLPs often show a strongly reduced accuracy when extrapolating beyond the known part of configuration space. The size and composition of these datasets depend on the systems of interest, but due to the high flexibility of MLPs, often energies and forces of 10 000 or more electronic structure calculations are required for training reliable potentials. For a systematic and unbiased determination of these structures, often various forms of active learning are employed.104–107
APPLICATIONS TO AQUEOUS SYSTEMS
Neutral and protonated water clusters
Water clusters have received considerable attention already during the advent of MLPs as important benchmark systems. Even for small water clusters, there is a large structural variety with many energetically close local minima, posing a significant challenge for potential development. At the same time, their moderate size allows us to perform accurate high-level electronic structure reference calculations.
Early first-generation MLPs for water clusters include an MP2-based six-dimensional PES for the water dimer with frozen monomer geometries reported in 1997 that made use of a single feed-forward NN.108 In 2006, a very accurate three-dimensional NN potential for the water monomer with a root mean squared error (RMSE) of only 1 cm−1 (about 0.1 meV for the molecular potential energy) was published,109 and, in addition, a NN potential for the same systems focusing on the permutation invariance of the PES was reported in 2012.110 From 2005 onward, water clusters have also been investigated in great detail using permutation invariant polynomials reaching MP2 and coupled cluster accuracies.111–122
To address larger water clusters, second-generation HDNNPs have been developed based on DFT data for a series of neutral clusters up to the decamer123,124 as well as for several protonated clusters.125 HDNNPs have also been applied to very large clusters containing hundreds of molecules.126 Moreover, nuclear quantum effects (NQEs) in neutral and protonated water clusters have been studied in recent years using very accurate HDNNPs trained to coupled cluster data.104,127–131
Starting in 2009, NNs were employed using water clusters as a test bed to find ways of improving the description of Coulomb interactions in classical force fields by learning environment-dependent electrostatic multipole moments.132 A comparison of NNs and Gaussian process regression (GPR) for the representation of multipoles found these methods to be similar in terms of accuracy and costs,89 resulting in an extension of the work to a water molecule embedded in a water decamer.133 Finally, this approach has been developed further to the FFLUX water model and applied to larger water clusters.134 In related work, GPR has been employed in a similar way to express charges in water molecules.95
A first MLP of the third generation expressing the full energy, i.e., electrostatics and short-range bonding, of the system by machine learning has been a third-generation HDNNP for the water dimer reported in 2012,91 which includes electrostatic interactions based on environment-dependent atomic partial charges expressed by a second set of atomic NNs. Another example in which electrostatics, as well as van der Waals interactions, were implemented on top of a short-range HDNNP for water clusters is TensorMol published in 2018.92 Moreover, small clusters have been used as a benchmark for the inclusion of long-range interactions in DeePMD.97
Finally, machine learning was used to learn tensorial properties using water clusters as a benchmark,135,136 and, in addition, the effect of noise on training HDNNPs for possible future applications on quantum computers has been tested for this system.137 Beyond NNs and GPR, also Support Vector Regression (SVR) has been explored for water clusters.138 Although not directly used in the development of high-dimensional PESs, the SVR method as well as random forest and Gaussian regression has also been used in specific applications, such as the prediction of the electron correlation energy in the water monomer and dimer.139 Beyond atomistic potentials, machine learning methods have also been used in various forms in combination with electronic structure concepts and methods, such as coupled cluster theory with single, double and perturbative triple excitations [CCSD(T)] and variational quantum Monte Carlo calculations with benchmarks for monomers and small clusters.140–146
Bulk water and ice
Of all aqueous systems, the construction of MLPs for bulk liquid water has received most attention not only due to its crucial importance as a solvent for a wide range of (bio)chemical reactions, but also because of its remarkable properties making water one of the most challenging benchmarks for the construction of interatomic potentials for molecular systems.
Due to the limited dimensionality that could be dealt with at that time, the first application of machine learning for the simulation of pure liquid water in 2002147 aimed for the inclusion of polarization in the TIP4P water model.148 For this purpose, a feed-forward neural network was used to represent the many-body interactions in dimers of rigid water molecules. This water model, named T4NN, was trained with MP2 reference data and was then used in Monte Carlo simulations to determine a range of properties of water, such as its density, heat capacity, and radial distribution functions. Overall, the agreement of many properties with experiment under standard conditions could be significantly improved with respect to the underlying TIP4P model, while the transferability to different temperatures posed a challenge.
In 2013, GAPs were used to enhance the accuracy of DFT-based ab initio MD simulations by introducing one- and two-body corrections trained on data of water monomers and dimers at the CCSD(T) level.149 Their application to water clusters and MD simulations of liquid water showed an improved overall potential energy surface, but additional corrections beyond two-body terms were necessary for further improvements, as also demonstrated by comparison with quantum Monte Carlo data.150
The first full-dimensional MLP for bulk liquid water and ice, which did not rely on a force-field or a DFT baseline potential, was reported in 2016.151 In this work, a series of HDNNPs were trained using DFT reference data obtained for different generalized gradient approximation (GGA) functionals, with and without dispersion corrections. This allowed us, for the first time, to benchmark the quality of common DFT functionals in the description of computationally demanding properties of water, such as the density anomaly, the melting temperature, the viscosity, and the dielectric constant. The calculation of such quantities requires extensive simulations of large systems, which are prohibitively expensive using ab initio MD directly, but become affordable with HDNNPs. In this work, particular attention was devoted to study the effect of van der Waals interactions, which were shown to govern the flexibility of the hydrogen bond network and, hence, play a crucial role in determining the properties of water and ice. In fact, if van der Waals forces are neglected, the density maximum of water disappears and ice becomes denser than liquid water. In follow-up work, HDNNPs based on BLYP-D3 and RPBE-D3 data were used to study the density anomaly of water at negative pressures (for both functionals)152 and the kinetics of the ice–water interface (only for RPBE-D3).153
Shortly after the first full-dimensional MLP for water, a HDNNP trained with B3LYP + D3 data was used in conjunction with path integral MD simulations to study nuclear quantum effects (NQEs) of liquid water close to the triple point.154 Since then, several other studies of nuclear quantum fluctuations in bulk liquid water and ice have followed,155–158 including a thermodynamic stability analysis of liquid water as well as hexagonal and cubic ice employing hybrid DFT data,159 and, more recently, a study of NQEs of liquid water based on the random phase approximation.160
The past few years have witnessed a significant expansion in the use of MLPs for bulk water and ice. This growth encompasses many applications but also the development of methods, tools, and extensive benchmarking. For instance, the vibrational spectroscopy features of liquid water were extensively studied over the full frequency spectrum taking into account the effect of temperature and overcoordinated hydrogen-bond environments employing a HDNNP based on revPBE-D3.161,162 Embedded atom neural networks (EANNs) have been used to represent tensorial properties in water describing vibrational features with the revPBE0-D3 functional.163 Moreover, using water as a test example, DeePMD was introduced in 2018,71 which, alongside HDNNPs, emerged as one of the principal methods for modeling water using MLPs. DeePMD has also been coupled with empirical force fields164 and employed to develop coarse-grained water models.165 Some applications of DeePMD include the analysis of hydrogen bond dynamics in supercritical water,166 the comparison of light and heavy water to assess isotope effects,167,168 and the calculation of vibrational densities of states.169
Crucial for the computation of vibrational features is the ability to determine the electronic polarizability tensor. In recent work, DeePMD was combined with an additional deep neural network to learn the environmental dependence of the polarizability tensor.170 As demonstrated using the SCAN (Strongly-Constrained and Appropriately-Normed) functional as a reference, this approach yields accurate Raman spectra of liquid water. Furthermore, DeePMD has been integrated with a deep neural network trained to predict Wannier centers based on local environments. This approach allowed us to compute infrared spectra171,172 and to determine the temperature dependence of the dielectric constant.173 The methods can also be extended to account for quadrupole moments.174 Recently, the description of tensorial properties, such as the polarizability tensor, has been fitted to MD simulations a posteriori using equivariant neural networks to describe infrared spectra.175 Moreover, a combination of HDNNP and GPR has allowed us to model the hyper-Raman spectra of water, which helped us understand the differences in the OH stretch mode between infrared and Raman spectra.176
The accuracy and computational efficiency of MLPs have made it possible to accurately determine the thermodynamic properties of water, including its phase diagram. For instance, the thermodynamic properties of water have been investigated with DeePMD based on SCAN177,178 and with a HDNNP based on revPBE0-D3.159,179 The phase behavior of water under extreme conditions expected in a planetary environment was also studied employing a HDNNP.180 Other aspects addressed with MLPs include the study of heat transport181,182 and the viscosity.183 More recently, the thermodynamics of water has been investigated with a neuroevolution potential.184
Studying the mechanism and kinetics of phase transitions is computationally very demanding and thus completely out of reach for ab initio simulations. MLPs, however, can be used to simulate systems of millions of water molecules with ab initio accuracy185 such that the simulation of phase transitions is now possible. One example is a recent investigation of the homogeneous nucleation of ice in supercooled water studied using DeePMD, in combination with the seeding methods,186,187 in a system of hundreds of thousands of water molecules.188 These calculations yielded nucleation rates consistent with experimental measurements. Very recently, advanced sampling techniques covering 36 µs of total simulation time have been used to probe the atomic structure of the critical nucleus.189 In addition, the liquid–liquid transition in supercooled water has been investigated. An initial attempt using DeePMD based on the SCAN functional found indications of this transition through anomalies in thermodynamic response functions.190 Two years later, the existence of this transition was conclusively demonstrated191 and its relation with the melting curves of ice polymorphs has been investigated.192 Finally, building on previous studies, the transition between ice Ih and its proton-ordered counterpart ice XI, mediated by ionic defects, has been studied based on the DeePMD model.193
Since modern MLPs can capture reactions, important processes such as proton transfer and autoionization are accessible. In this regard, HDNNPs have allowed us to describe the transport of hydronium and hydroxide ions, including nuclear quantum effects194 and the free energetics and mechanics of water dissociation,195 allowing us to compute the equilibrium pKw of water.196
The application of MLPs has been facilitated by careful benchmarking and transferability studies and the development of new ML-based methods and workflows. Some of these ML methods have been used beyond the fitting of potential energy surfaces as is the case for ML classifiers of phases197,198 and dynamical processes.199 One particular interesting finding regarding the transferability of MLPs is that liquid structures already contain the relevant information required to reproduce ice phases,200 including even ice–water interfaces. In particular, this was observed in studies of homogeneous nucleation, in which an empirical potential was benchmarked against its MLP representation containing only liquid structures.201 The performance of different density functionals [Perdew–Burke–Ernzerhof (PBE), SCAN, vdW-cx, and optB88-vdW] for modeling water and ice has been compared.202 Moreover, HDNNP and GPR trained on the same dataset have been shown to be equivalent when compared over different thermodynamic properties of liquid water although HDNNPs seem to be more demanding in terms of the required training data203 but are computationally more efficient. In fact, the role of the training data has been the focus in other studies.204 Furthermore, graph neural networks (GNNs), which do not require predefined structural descriptors, have been applied to accelerate molecular dynamics simulations.205 The selection of descriptors has also been automatized for HDNNPs,206,207 whereas GPR-based potentials have been employed in on-the-fly learning workflows.204,208
Empirical potentials have also benefited from the development of MLPs. Coarse-grained MLPs emerged,165,209–213 including an approach based on equivariant neural networks,214 and empirical force fields were parameterized using ML algorithms.215–218 Moreover, the addition of polarization to empirical force fields has been revisited, including charge transfer.219 DeePMD has been used to fit an accurate but costly many-body potential,220 reducing its computational cost by one order of magnitude.221 Moreover, a GNN has been applied to estimate Bayesian uncertainty in molecular dynamics simulations based on an empirical potential.222
Further progress has come again from water clusters. HDNNPs, PIP-based potentials, and GAPs have been shown to be equivalent in representing many-body interactions in water clusters,223 which can be employed in the construction of improved water potentials for bulk water. In fact, recent advances suggest that reference data obtained exclusively for water clusters could be sufficient to train accurate MLPs even for the bulk liquid phase,224–228 including results from Gaussian-moment neural networks (GMNNs).228 Recently, even gold-standard CCSD(T)-level accuracy for bulk water potentials has been reached by training to large clusters or periodic structures.121,229,230
An important current topic of research is the inclusion of long-range interactions,86 which are not explicitly considered in many MLPs. This problem has been addressed by introducing non-local representations of the system remapped as local and equivariant feature vectors, capturing non-local and non-additive effects.231 Another approach to treat this issue is to learn the long-range response with a self-consistent field neural network, which has been shown to produce correct long-range polarization correlations in liquid water, as well as the correct response of liquid water to external electrostatic fields.232,234
Architectures like equivariant neural networks have been combined with empirical electrostatics and dispersion.235 Such models are highly accurate in learning reference datasets,80,81,235–238 and their adoption is growing rapidly.
Liquid/vapor interface
Neural network potentials have also been used to investigate the structure, thermodynamics, and spectroscopic properties of the liquid/vapor interface. As the local environments close to the interface are highly anisotropic and thus very different from the bulk, it is important that the training set explicitly includes data for interface configurations.239 The structure of such configurations has been analyzed in detail using SOAP descriptors and local order parameters.240 Investigating the structure of the interface reveals the prevalence of orientations with the dipole moment roughly parallel to the surface with one OH bond pointing out of it,239 corroborating insights gained from sum frequency generation (SFG) measurements.241 By using the surface-sensitive velocity autocorrelation function,242 such SFG spectra of the liquid/vapor interface were calculated from path integral molecular dynamics based on a HDNNP trained at the revPBE0+D3 level.243 Recently, SFG spectra have been computed fully from first principles using a HDNNP combined with GPR.244
In another study, it was found that a DeePMD potential relying only on local atomic energies can be applied to the liquid/vapor interface.245 However, the explicit inclusion of long-range interactions was shown to be beneficial, confirming the results of previous studies carried out for empirical potentials.246 The effect of long-range interactions was tested for a water molecule moving away from the liquid/vapor interface using an extension of DeePMD, including long-range electrostatics.97 The case of curved liquid/vapor interfaces has been addressed as well. For instance, DeePMD has been employed to investigate the formation of bubbles in metastable water.247 Furthermore, it was shown that the free energy of water dissociation at the liquid/vapor interface of droplets and films deviates from the bulk, leading to an enrichment of hydronium cations at the interface and a depletion of hydroxide anions.248
Electrolyte solutions
Beyond pure water, MLPs have been used in numerous simulations of electrolyte solutions.249 Already in 1998, a feed-forward neural network was employed to represent the three-body interaction energies in H2O––H2O clusters with the aim to improve the force field description of ions dissolved in bulk water.56 This work represents an important milestone in the incorporation of permutation symmetry in structural descriptors. Later, HDNNPs have enabled the construction of full-dimensional DFT-quality PESs for aqueous NaOH solutions over the entire solubility range.250 In this work, it has been found that as the NaOH concentration increases, the primary mechanism for proton transfer shifts from being acceptor-driven, influenced by the pre-solvation of hydroxide ions, to donor-driven, controlled by the pre-solvation of water molecules. In addition, with increasing concentration, octahedral coordination geometries become less favored, in contrast to trigonal prism geometries.251 A novel water exchange mechanism has been identified around Na+(aq) ions in basic (high pH) solutions.252 Studies comparing classical and ring-polymer molecular dynamics based on the HDNNP revealed that nuclear quantum effects significantly reduce proton transfer barriers, thus increasing proton transfer rates. This leads to an enhanced diffusion coefficient, especially for OH−, and a shorter mean residence time of molecules in the first hydration shell around Na+ at high NaOH concentrations.253 Moreover, elevated temperatures in concentrated NaOH solutions amplify both the contributions of proton transfer to ionic conductivity and deviations from the Nernst–Einstein relation.254 Further applications of HDNNPs include investigations of fluoride and sulfate ions in the solution.107 Employing similar methodologies and training on revPBE + D3 data, the dissolution mechanisms of NaCl in water have also been addressed.255 Another example is the use of HDNNPs to study zinc ion hydration in water,256 with molecular dynamics simulations matching both the experimentally observed zinc–water radial distribution function and the x-ray absorption near edge structure spectrum. Moreover, HDNNP-based studies reveal the impact of surface stratification on the interfacial water structure in electrolyte solutions.257 Equivariant neural network potentials have also been employed to study various electrolyte solutions,258 including aqueous lithium chloride259 and aqueous sodium chloride.260 A genetic algorithm has also been utilized to study hydrated zinc(II) ion clusters.261 In addition, microhydrated sodium ions with a few water molecules have been studied for both the potential energy and the dipole moment employing PIPs.262
DeePMD potentials have been used to study sodium chloride, potassium chloride, and sodium bromide at various concentrations.263 These studies revealed that the structural changes due to the ions are confined to the immediate vicinity of the ions, where they disrupt the network of hydrogen bonds. Beyond these regions, the distribution of oxygen atoms relative to one another remains largely unchanged compared to pure water. In a related study, the dielectric permittivity of sodium chloride solutions has also been investigated.264 Using DeePMD potentials, the uptake of N2O5 into aqueous aerosols has been examined, a process that is challenging to study experimentally due to the fast reaction kinetics of N2O5.265 Furthermore, the diffusivity of water in aqueous cesium iodide and sodium chloride solutions has been examined using a DeePMD framework trained on DFT data using the revPBE-D3 functional.266 Such simulations addressing the characteristic behavior of different ions are not readily accessible through traditional force field-based molecular dynamics simulations due to less ion-specific description of ion–water interactions.
Water–solid interfaces
Solid–liquid interfaces are of high interest for catalysis and electrochemistry. Due to the very different bonding in liquid water and in crystalline surfaces, such as metals or oxides, constructing unified atomistic potentials that can describe all subsystems of solid–liquid interfaces with balanced high accuracy presents a substantial challenge for empirical potentials. Moreover, in many cases, water is not only in contact with the surface but can also dissociate and recombine at a much higher rate than in the bulk liquid. Consequently, the use of reactive potentials, which can describe the making and breaking of bonds, is mandatory. MLPs are ideally suited for this purpose.
In 2014, a HDNNP for a thin water layer on top of 55-atom CuAu alloy clusters with varying stoichiometries and a slab model were reported to study the effect of water on the stability of different interface compositions by Monte Carlo simulations.267 While this work exhibited a still rather large error of about 12 meV/atom, the accuracy of HDNNPs for solid–liquid interfaces has significantly improved in the following years. For instance, in studies of water at various surfaces of copper268,269 and zinc oxide270,271 energy, RMSEs of less than 1 meV/atom could be reached. Detailed convergence tests with respect to the required system size were carried out,268 showing that the diameter of the liquid phase between the slab surfaces needed to decouple the two surfaces by bulk-like water is at least about 35–40 Å.268 Such a size is beyond reach in ab initio molecular dynamics but necessary to ensure that the central water molecules have bulk-like environments in their local vicinity. This consideration is crucial to ensure that these molecules do not experience any significant influence from the altered water structure near the interfaces or from the surfaces directly.
While it has been found using HDNNPs that water does not spontaneously react with defect-free surfaces of certain metals, such as copper, on nanosecond time scales,268 fast dissociation and recombination processes leading to the formation of surface hydroxides have been observed on zinc oxide surfaces.270,271 These processes are often governed by the surrounding hydrogen bond networks, significantly influencing the free energy barriers of proton transfer processes.270 This phenomenon has been confirmed for water at TiO2 surfaces using DeePMD potentials,272,273 which have also been employed to explore the impact of slab thickness.274 Furthermore, the use of HDNNPs for computing anharmonic frequencies has been suggested as a method to elucidate the role of hydrogen bonds in surface processes.275
Depending on the specific surface geometry, proton transfer events can lead to various topologies of proton transport networks along the surface, which can be either one-dimensional or two-dimensional. This has been demonstrated for several surfaces of zinc oxide276 and the lithium intercalation compound LiMn2O4277 using HDNNPs. Surface defects, often stabilized by solvation compared to the vacuum interface, have also been a subject of study. The mobility of adatoms has been found to significantly vary across different low-index surfaces of copper.269 Investigations into the defective Zr7O8N4/H2O and pristine ZrO2/H2O interfaces using neural network potentials278 revealed a bilayer water structure for Zr7O8N4 and a monolayer structure for ZrO2. Oxygen vacancies on the Zr7O8N4 surface have been suggested as active sites for the oxygen reduction reaction. Furthermore, neural networks have been used to identify different oxidation states of transition metal ions at oxide-water–interfaces,277 which enables the characterization of electronic structures relevant for catalytic applications.
Due to its importance in catalysis, the TiO2–water interface has so far been the most intensely studied interface using MLPs.107,272–274,279–282 Investigations into the water coverage on the anatase (101) TiO2 surface using the DeePMD potential279 have shown that higher water coverage prompts significant reorganization of the water monolayer at O2c sites, leading to the formation of a two-dimensional hydrogen bond network with closely linked pairs of water molecules on neighboring TiO5c and O2c sites. Other DeePMD-based studies have examined the impact of water dissociation on thermal transport at the TiO2–water interface.280 While previous research on TiO2–water interfaces mainly focused on the anatase (101) and rutile (110) TiO2 surfaces, recent MLP-based studies281 have explored seven different TiO2 surfaces using three distinct functionals: SCAN, PBE, and optB88-vdW. These studies found that water dissociation is more likely on the anatase (100), anatase (110), rutile (001), and rutile (011) surfaces, while molecular adsorption is the primary process on the anatase (101) and rutile (100) surfaces. Moreover, simulations for rutile (110) showed that the slab thickness significantly influences the results, with thicker slabs favoring molecular adsorption. DeePMD has also been used to study amorphous TiO2 (a-TiO2) to compare its behavior with well-studied crystalline TiO2 at aqueous interfaces.282 These studies demonstrated that water molecules on the a-TiO2 surface do not exhibit the distinct layering typical of the aqueous interface of crystalline TiO2. This difference results in an approximately tenfold increase in water diffusion speed at the interface.
Other cases of employing MLPs to study solid–water interfaces include the use of HDNNPs for the Pt(111)–water interface to investigate the interaction between water and hydroxylated metal surfaces283,284 and for the hematite–water interface,285 revealing solvation dynamics at various time scales. DeePMD has been utilized for studying the TiS2/water interface286 to examine the influence of TiS2 surface termination on the structure of interfacial water. Moreover, DeePMD has been applied to the IrO2–water interface, exploring the hydration structure, proton transfer mechanisms, and acid–base characteristics,287 as well as to the GaP(110)–water interface,288 which has been shown to require about 12 ns to reach equilibrium, a duration not achievable with traditional AIMD simulations. DeePMD has also been used for the construction of a potential aimed at studying ice nucleation at the microcline feldspar surface using the SCAN functional289 and for investigating the impact of water dissociation on thermal transport at the Cu–water interface.290 Apart from HDNNPs and DeePMD, an equivariant graph neural network has been employed to study the oxygen reduction reaction at the Au(100)–water interface,291 and on-the-fly learning kernel-based regression has been applied to investigate water adsorption on MgO and Fe3O4 surfaces, including surface reconstructions.292
Confined water, which exhibits properties notably differing from bulk water, has also been studied using MLPs. Some examples include HDNNPs for water confined between two-dimensional boron nitride sheets293 and MoS2107 and water between graphite layers using DeePMD294,295 and committees of HDNNPs;296 the latter method was also applied to confinement within a graphene-like material.297 Furthermore, MD simulations have been used to investigate water in single-walled carbon and boron nitride nanotubes,107,298 finding a fivefold reduction in friction in carbon tubes compared to boron nitride, attributed to strong hydrogen–nitrogen interactions.298 Ion concentration profiles under nanoconfinement299 have also been studied using neural network potentials, focusing on the effects of channel widths, ion molarity, and ion types.
Other systems
Apart from studies of pure water, electrolytes, and solid–liquid interfaces, which have been in the focus of MLP-based atomistic simulations for several years, the use of MLPs for aqueous systems has increasingly diversified and now covers essentially all fields of simulations involving water. An exhaustive coverage of the related literature is beyond the scope of this Perspective, and in this section, we just point the interested readers to several typical applications of MLPs in these rapidly growing fields.
A prominent use of atomistic simulations is to study chemical reactions of organic molecules in solution. Examples for the application of MLPs are corrections to quantum mechanical/molecular mechanical (QM/MM) simulations of SN2 reactions in water,300 the solvation of protein fragments,93 the decomposition of urea in water,301 the computation of free energy profiles of reactions of organic molecules,302 and enzyme reactions.303 Further studies include the quantum dynamics of an electron solvated in water304 and the excited state of CH3NNCH3 surrounded by several water molecules.305
Still, the all-atom description of chemical processes in solution can be demanding, and, consequently, simplified MLPs have been proposed as well. For example, the solvated alanine dipeptide306,307 and the folding/unfolding of chignolin306,308 have been studied using the coarse-grained CGNet potential. Moreover, a DeepPot-SE model has been used to describe molecules under the influence of an implicit solvent.309
Additional applications of MLPs for aqueous systems include the study of diffusion in hydrogen hydrates,310 the determination of vibrational frequency shifts in formic acid C=O stretching and C=N stretching of MeCN in water,311 retinoic acid in water,312 graph-convolutional neural networks for benchmarking sets of solutes and chemical reactions in water,313 hydration dynamics and IR spectroscopy of 4-fluorophenol,314 zinc protein studies,315 conformational shifts of stacked heteroaromatics,316 and solvation free energy prediction of organic molecules in redox flow batteries.317
DISCUSSION AND OUTLOOK
In recent years, MLPs have reached a high level of maturity, and, currently, a transition from proof-of-concept and benchmark studies to practical simulations of a wide range of complex systems is taking place. Therefore, it can be anticipated that MLPs will allow us to overcome the limitations of conventional methods, such as empirical potentials in terms of accuracy and ab initio molecular dynamics in terms of efficiency, paving the way for simulations of extended systems with unprecedented accuracy. MLPs have demonstrated this capability already for a broad spectrum of aqueous systems, ranging from neutral and protonated water clusters to bulk liquid water and ice, liquid/vapor interfaces and from electrolyte solutions to complex solid-water interfaces (see Fig. 1). In all these studies, MLPs have enabled simulations with first-principles accuracy that previously have been prohibitively demanding, as evidenced by a rapidly growing number of publications in the field shown in Fig. 2.
Overview of the number of articles published per year for different types of MLPs applied to water and aqueous systems as discussed in this Perspective. These are, in order of first use for these systems, neural network potentials (NNPs) based on simple feed-forward neural networks, Gaussian process regression (GPR), high-dimensional neural network potentials (HDNNPs), deep potential molecular dynamics (DeePMD), support vector regression (SVR), random forest (RF), embedded atom neural networks (EANNs), graph neural networks (GNNs), Gaussian-moment neural networks (GMNNs), comprehensive genetic algorithm (CGA), and neuroevolution potentials (NEPs).
Overview of the number of articles published per year for different types of MLPs applied to water and aqueous systems as discussed in this Perspective. These are, in order of first use for these systems, neural network potentials (NNPs) based on simple feed-forward neural networks, Gaussian process regression (GPR), high-dimensional neural network potentials (HDNNPs), deep potential molecular dynamics (DeePMD), support vector regression (SVR), random forest (RF), embedded atom neural networks (EANNs), graph neural networks (GNNs), Gaussian-moment neural networks (GMNNs), comprehensive genetic algorithm (CGA), and neuroevolution potentials (NEPs).
MLPs bridge the gap between two traditional approaches in atomistic simulation, ab initio and force-field-based MD simulations, offering advantages over both. MLPs excel by enabling significantly longer length and time scales compared to AIMD simulations, thanks to their computational efficiency. This extension is crucial for accurately computing properties, such as equations of state across a broad range of parameters, kinetics of phase transitions, and interface mobility, and to study supercooled water as well as solvation dynamics on long time scales. In comparison with classical force fields, MLPs can potentially match the accuracy of the underlying electronic structure calculations, which usually surpass the accuracy of traditional force fields. Furthermore, MLPs are reactive since they do not rely on predefined chemical bonds, unlike most traditional force fields. This reactivity is crucial for correctly modeling systems with complex interactions, such as solid–liquid interfaces and aqueous solutions, where water dissociation, recombination, and proton transfer occur. It is also essential for accurately representing the entire range of acid/base chemistry, which fundamentally relies on proton transfer processes.
Over the past two decades, progress in MLPs for aqueous systems has focused on different frontiers. While right from the start the highly flexible functional form of ML algorithms has enabled a numerically very accurate representation of the electronic structure reference data, a severe challenge in the early years has been the very limited number of degrees of freedom that could be considered. Only the development of modern descriptors for the atomic environments allowed us to extend MLPs to condensed systems such as liquid water with all their associated degrees of freedom. Recently, message passing neural networks75 have become a promising alternative to the use of predefined descriptors, opening many new exciting possibilities for the construction of MLPs with higher accuracy, based on less data, in particular if equivariant features are used. Another frontier that has increasingly received attention in recent years is the incorporation of physical concepts into hitherto purely mathematical machine learning potentials, with many developments specifically aiming for improved descriptions of long-range electrostatic interactions, van der Waals forces, long-range charge transfer,52 and electron densities.318 This inclusion of physically meaningful terms, not only in the total energy expression but also in the form of novel descriptors,231,319 will further increase the accuracy and transferability of the potentials.
To effectively simulate chemical processes in the aqueous phase, the underlying potential function must rely on accurate electronic structure calculations, since the chosen reference method represents a natural limit for the accuracy of MLPs. While coupled cluster accuracy has already been achieved in MLPs for liquid water,121,229,230 which would have been unthinkable with conventional empirical potentials, reaching this gold-standard for more complex systems, such as solid–liquid interfaces, is very challenging. Therefore, DFT will likely remain the dominant method for the reference electronic structure calculations of many systems in the foreseeable future, in particular for those systems involving solids. Although the generalized gradient approximation (GGA) has been the most commonly employed functional in the study of aqueous systems as they offer a good compromise between computational cost and accuracy, more and more powerful computing resources increasingly enable the use of more advanced and computationally expensive functionals, such as meta-GGA (particularly SCAN) and hybrid functionals, in the investigation of several aqueous systems, and even this level of accuracy has remained essentially inaccessible by on-the-fly ab initio MD to date. Consequently, complex aqueous systems can now be investigated with a previously unattainable level of accuracy enabling predictive simulations. Moreover, MLPs can be employed to evaluate the accuracy of the underlying level of theory with respect to experimental values in a broad range of scenarios. This extends beyond the traditional comparison at standard temperature and pressure, allowing for a comprehensive evaluation that may provide insightful guidance for the development of theoretical methods.
The functional flexibility is a key property of MLPs, but it is a double-edged sword: on the positive side, this flexibility enables the accurate approximation of the PES based on the reference data. On the negative side, however, such flexibility severely limits the extrapolation capabilities of MLPs to chemical environments not adequately sampled during the training process. In fact, extrapolation to unfamiliar environments can lead to unphysical structures and completely wrong simulation results. Therefore, the construction and validation of MLPs have to be done with great care to ensure that all relevant local environments are included in the training set. Moreover, it is much more challenging than in case of simpler empirical potential to provide “boxed” MLPs for general usage, since not only the underlying parameters but also information about the range of validity is crucial information for successful applications. For instance, when studying solid–water interfaces, it becomes necessary to train the MLP not only on the bulk material and bulk water separately but also on systems that include all relevant interface configurations. Thus, increasing the complexity of the system also increases the number of local environments that must be included in the training data. Although AIMD simulations are sometimes used to generate initial training sets, they are usually insufficient since they often fail to capture the less frequently visited structures. Fortunately, this challenge has been largely overcome in recent years by incorporating active learning for the generation of the reference data. An alternative approach, recently emerging, focuses on establishing foundation models to be used as a starting point for MLP development.237 This might improve the transferability and contribute to the development of more accurate MLPs.
While, in this Perspective, we have focused on the construction and application of accurate and efficient interatomic potentials, machine learning approaches can be useful in several other ways for the atomistic simulation of aqueous systems, and, more generally, of materials and biomolecular systems.47 For instance, neural networks have been employed to classify local structures in liquid water and various forms of ice.197,198 The accurate identification of molecular structures with high spatial resolution is important, for instance, in the study of crystallization, melting, crystal growth, and the formation and migration of defect. Another challenge in molecular simulations lies in reducing or even completely removing correlations in the statistical sampling of configuration space as it occurs in sequential sampling methods, such as molecular dynamics and Markov chain Monte Carlo simulations. Here, normalizing flows320,321 or other generative methods can play an important role for sampling the equilibrium distribution and also for creating new stable crystal structures.322 Moreover, machine learning approaches have been suggested as a way to discover reaction coordinates and enhance the sampling of rare transitions occurring in complex molecular systems.323,324 There is little doubt that in the years to come further, new ML and AI tools will be applied to the computational investigation of matter at the atomistic level, creating new opportunities for studying complex aqueous systems.
CONCLUSIONS
Machine learning and, more generally, artificial intelligence are currently revolutionizing our way to do science and are providing new opportunities not achievable with traditional approaches. In the field of computational materials science, the advent of accurate, flexible, and efficient MLPs has dramatically increased the time and length scales accessible by atomistic simulations of materials with ab initio accuracy. In this Perspective, we have provided an overview of the key concepts of such machine learned potentials with a focus on their application to water and aqueous systems. The broad spectrum of systems successfully explored with these techniques, including water clusters, bulk liquid water and ice, the liquid/vapor interface, electrolyte solutions, and solid–liquid interfaces, underscores the flexibility, efficiency, and the high level of maturity they have reached since recent years.
Since machined learned potentials accurately reproduce the underlying reference data obtained with electronic structure methods but in applications require a much lower computational effort, they now make it possible to compute complex materials properties, such as phase diagrams. In this way, they do not only allow us to gain new insights into a variety of systems, but they also provide a way to truly test the theoretical description underlying the reference data and to reveal their possible limitations. This facilitates the generation of high quality reference data in the future as a basis for truly predictive ML-based computer simulations of complex materials.
In summary, modern MLPs have created new opportunities for the investigation of aqueous systems that would have been unimaginable with conventional methods for the foreseeable future. Carefully trained and validated MLPs can be employed to study complex reactive aqueous systems accurately across large time and length scales, without imposing ad hoc empirical constraints. While predicting the future of this rapidly evolving field is challenging, the remarkable progress made to date suggests that we can expect exciting new developments and some surprising breakthroughs in the years to come.
ACKNOWLEDGMENTS
J.B. and A.O. are grateful for the funding by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Grant No. TRR/CRC 247 (A10, Project No. 388390466) and under Germany’s Excellence Strategy—Grant No. EXC 2033 RESOLV (Project No. 390677874). C.D. and P.M.D.H. acknowledge the funding from the Austrian Science Foundation FWF through the Projects Doctoral College Advanced Functional Materials (Grant No. DOC 85-N) and the SFB TACO (Grant No. F-81).
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
Author Contributions
All authors contributed equally to this work.
Amir Omranpour: Visualization (equal); Writing – original draft (equal); Writing – review & editing (equal). Pablo Montero de Hijes: Visualization (equal); Writing – original draft (equal); Writing – review & editing (equal). Jörg Behler: Conceptualization (equal); Funding acquisition (equal); Supervision (equal); Writing – original draft (equal); Writing – review & editing (equal). Christoph Dellago: Conceptualization (equal); Funding acquisition (equal); Supervision (equal); Writing – original draft (equal); Writing – review & editing (equal).
DATA AVAILABILITY
Data sharing is not applicable to this article as no new data were created or analyzed in this study.