Exploring the configuration space of elemental carbon with empirical and machine learned interatomic potentials

Marchant, George A.; Caro, Miguel A.; Karasulu, Bora; Pártay, Livia B.

doi:10.1038/s41524-023-01081-w

Exploring the configuration space of elemental carbon with empirical and machine learned interatomic potentials

Article
Open access
Published: 27 July 2023

Volume 9, article number 131, (2023)
Cite this article

Download PDF

You have full access to this open access article

npj Computational Materials

Exploring the configuration space of elemental carbon with empirical and machine learned interatomic potentials

Download PDF

1707 Accesses
3 Citations
5 Altmetric
Explore all metrics

Abstract

We demonstrate how the many-body potential energy landscape of carbon can be explored with the nested sampling algorithm, allowing for the calculation of its pressure-temperature phase diagram. We compare four interatomic potential models: Tersoff, EDIP, GAP-20 and its recently updated version, GAP-20U. Our evaluation is focused on their macroscopic properties, melting transitions, and identifying thermodynamically stable solid structures up to at least 100 GPa. The phase diagrams of the GAP models show good agreement with experimental results. However, we find that the models’ description of graphite includes thermodynamically stable phases with incorrect layer spacing. By adding a suitable selection of structures to the database and re-training the potential, we have derived an improved model — GAP-20U+gr — that suppresses erroneous local minima in the graphitic energy landscape. At extreme high pressure nested sampling identifies two novel stable structures in the GAP-20 model, however, the stability of these is not confirmed by electronic structure calculations, highlighting routes to further extend the applicability of the GAP models.

Robust training of machine learning interatomic potentials with dimensionality reduction and stratified sampling

Article Open access 26 February 2024

A systematic approach to generating accurate neural network potentials: the case of carbon

Article Open access 14 April 2021

A new active learning approach for global optimization of atomic clusters

Article 17 May 2021

Introduction

Carbon is the fourth most abundant element in the universe and, while it readily forms a much wider range of compounds than any other element (including the bio-polymers crucial for life), its behaviour is just as rich in elemental form as well. Carbon atoms can bond to each other in fascinatingly diverse ways, forming a wide range of two- and three-dimensional allotropes, amorphous phases, clusters, fullerenes and multi-layered particles that give carbon one of the most diverse ranges of chemical and physical properties among materials^{1,2,3,4,5,6,7,8}. The Samara carbon database, which catalogues simulation data for these proposed structures of carbon, consists of more than five-hundred periodic configurations⁹ (as of December 2021). Furthermore, the properties of these structures are often unique, such as the hardness of diamond; the electronic properties of graphene; or the high ductile strength of carbon-fibres, resulting in extensive use of carbon across a wide range of industries, from battery design to advanced optical technologies^10,11.

One of carbon’s best known features is its phase transition from graphite to cubic diamond at pressures above 2 GPa. Diamond and graphite’s vastly different density and structural properties are reflected in carbon’s melting curve, which exhibits a dramatic change at the corresponding triple point; shifting from a subtly non-monotonic curve at lower pressures where graphite is formed, to diamond’s melting curve that quickly increases in temperature as greater pressures are applied. Diamond remains stable up to at least 300 GPa, but due to the extreme pressure little is known experimentally of carbon’s atomic structure beyond this. Ab initio calculations suggest a maximum in diamond’s melting line at around 450 GPa, as well as a transition to bc8 between 890–1000 GPa, and shock-wave experiments provide evidence for the accuracy of these predictions^12,13. The bc8 structure is also predicted to have a maximum in the melting temperature at around 1450 GPa, due to changes in the coordination number in the liquid phase¹⁴. In the terapascal regime further phase transitions are predicted, such as bc8-simple cubic and simple cubic-simple hexagonal³.

Atomistic simulations have thus played a major role in discovering novel phases of carbon; furthering our understanding of its phase diagram; and driving the development of new applications by providing useful insight into their structure and properties. However, the diverse properties of carbon mean that capturing its various characteristics within interatomic potential models is particularly difficult, especially when creating models that aim to be transferable among different allotropes and reproduce carbon’s macroscopic properties reliably under a wide range of conditions.

Several empirical interatomic potential models have been developed for carbon in the past 35 years. The bond-order potential introduced by Tersoff¹⁵ in 1988 is still considered to be the fastest and most simple carbon potential. Its elegant functional form, in which the strengths of chemical bonds are modified according to the number of nearest neighbours, allows for rapid calculation of chemical properties without a significant sacrifice in accuracy when compared to other, more expensive potentials¹⁶. Despite its shortcomings - the primary one being its lack of consideration for long-range interactions - it is still an ideal choice for testing the performance of new computational methods and more complex chemical potentials. Other early carbon models include the Stillinger-Webber potentials parameterised for diamond and graphitic carbon^17,18, although, due to their fixed coordination, these models are limited in their transferability across structures. Developed from the Tersoff model to include a wider range of parameters (conjugation and torsional terms), the “reactive” bond-order potentials were introduced: REBO (also referred to as the Brenner potential)¹⁹ and REBO-II²⁰. They were further improved by the inclusion of a long-range term to create a potential that accounts for the effects of dispersion, providing the adaptive intermolecular REBO (AIREBO)²¹. The environment-dependent interaction potential (EDIP) consists of a two-body pair energy; a three-body angular penalty; as well as a generalised description of coordination²². EDIP is known to successfully predict topological properties of carbonaceous films as well as clusters^23,24. One of the first empirical models capable of providing an accurate description of low-to-medium pressure phases of carbon is the long-range carbon bond-order potential (LCBOP). The LCBOP model is partially based on ab initio data, closely matches the ab initio MD results for the liquid structure, and accounts for interplanar interactions in graphite²⁵. Ghiringelli et al. have calculated the pressure-temperature phase diagram of the LCBOP potential, calculating the melting line up to 60 GPa and graphite–diamond transition, showing a good agreement with experimental findings²⁶. Another family of potentials were developed to accurately describe carbon’s bond formation and dissociation: the reactive force field (ReaxFF) potentials^27,28.

The emergence of machine-learning (ML) techniques offer the construction of potential models which are comparable in cost to classical interatomic potentials and, at the same time, comparable in accuracy to ab initio-level calculations. Using the Gaussian approximation potential (GAP) formalism^29,30, an ML potential was developed to describe the behaviour of liquid and amorphous carbon accurately³¹. This was later extended to include properties of crystalline bulk phases, defects and surfaces, known as the GAP-20 model^32,33. The C60 GAP force field includes van der Waals corrections and is especially suited for the simulation of C₆₀ fullerene structures³⁴, with another recent version specifically trained for nano-porous carbon³⁵. Recently, two other ML carbon potentials were also developed, using neural-networks³⁶ and the ACE formalism³⁷.

The performance and reliability of these potentials have been compared from different perspectives. Their (in)ability to describe amorphous structures^16,23 has underscored transferability issues and highlighted the need for thorough investigation of models in order to trust the interpretation of simulation results. The accuracy in predicting microscopic properties (e.g., surface energy, formation energy of common defects) have been also compared³². The performance of seven models in predicting the properties of carbon nano-clusters have been recently investigated, with a focus on their accuracy in structure search and global optimisations²⁴. The GAP-20 model emerged as the best performing model.

While these studies provide a detailed picture of the microscopic properties of carbon potentials, our knowledge of their macroscopic properties is limited. In order to understand the reliability and predictive power of computational results, it is important to examine the potential models’ macroscopic behaviour and evaluate their phase stability, unbiased by our chemical intuition. Ultimately, this also informs the development of new generations of potentials, such as ML-based models, highlighting strengths as well as areas for improvement.

In the current work we aim to evaluate the performance of carbon potentials and calculate their pressure-temperature phase diagram, by performing an exhaustive and predictive sampling of the potential energy surface, using the nested sampling technique^38,39. Nested sampling (NS) was first introduced by John Skilling in the area of Bayesian statistics^40,41, later taken up by various research fields³⁹ and adapted to sample the potential energy surface of atomistic systems^38,42. The main advantages of NS are that it automatically generates thermodynamically relevant structures without any prior knowledge of, e.g., (meta)stable crystalline structures; moreover it provides unique and easy access to the notoriously elusive partition function. Thermodynamic properties that are otherwise difficult to determine, such as the heat capacity or free energy, thus become straightforwardly calculable. The added usefulness of NS resides in the fact that a broad picture of the phase diagram can be gained by a single technique, overcoming the typical procedural barriers one faces when working with multiple simulation methods and/or packages.

The power and usefulness of NS has been thoroughly demonstrated in studying various systems, as well as in comparison to widely used computational techniques. Examples of its application include cluster formation^42,43,44; calculation of the quantum partition function⁴⁵; sampling transitions paths⁴⁶, as well as the calculation of the pressure-temperature phase diagram for various metals^47,48,49, alloys^50,51, and model potentials⁵², identifying previously unknown stable solid phases.

In the current work we compare the behaviour of three widely used interatomic potential models for carbon using NS, which span a suitable range in terms of complexity, accuracy and computational cost. We first use the ML potential, GAP-20³², considered to be the state-of-the-art model for carbon^3,16,23, to examine its reliability outside its original training conditions and hence understand better the extent of the model’s transferability and predictive power. The majority of our GAP-20 calculations were performed using the original model detailed in Ref. ³², and we also provide supplemental results generated with the updated version of the model, GAP-20U, released recently³³. As the fastest and simplest model, we evaluate the phase diagram of the Tersoff model in the original parameterisation form, as available in LAMMPS (although valuable modifications to the Tersoff carbon potential also exist^53,54). We also selected EDIP²² for modelling, providing a mid-point in accuracy and computation cost between the Tersoff and GAP-20 potentials.

Results

GAP-20 and GAP-20U

Nested sampling runs with the GAP-20 potential were carried out with a system size of 16 atoms at ten different pressures between 0.1 and 1000 GPa. Due to the large computational cost of the GAP potential, fewer calculations were carried out with 32 atoms - at pressures of 1, 10, 50, 500 and 800 GPa - to assess the finite size effects at pressures where different solid phases are expected. The configuration space of the GAP-20U potential was also sampled using NS, at pressures of 0.1, 1, 10 and 50 GPa, to assess the extent to which the melting line may deviate from the original GAP-20 model in the graphite and cubic diamond phases. The resulting pressure-temperature phase diagram is illustrated in Fig. 1. The experimentally determined phase boundaries^55,56,57 are shown by solid black lines, highlighting that the graphite melting line has a slight maximum, as above 0.4 GPa the density of graphite becomes lower than that of the liquid, causing the melting line to have a negative gradient. This change however is very subtle, driven by the relatively weak interaction between graphite’s neighbouring hexagonal layers. Above 20 GPa the liquid carbon freezes into the high-density cubic diamond structure, resulting in melting temperatures increasing rapidly with pressure in comparison to the graphite phase.

**Fig. 1: Pressure-temperature phase diagram of GAP-20 ML potentials.**

The melting curve predicted by the GAP-20 model follows these experimental features with reasonable accuracy, though at pressures below 10 GPa there is a clear positive gradient in the graphite melting line where the experimental melting line is non-monotonic. It is in the graphite region of the phase diagram that we also observe the only notable deviation between the melting lines of the GAP-20 and GAP-20U models, with a difference of around 10% in melting temperatures at 0.1 GPa such that the GAP-20U model’s phase boundary has a steeper gradient and deviates further from experimental trends compared to the GAP-20. Figure 2 shows the heat capacity curves calculated by NS using GAP-20, showing how the points on the melting line were determined based on the location of the peaks. The peaks corresponding to 32-atom runs are sharper than those of the 16-atom runs, reflecting how in the thermodynamic limit the heat capacity diverges at first-order phase transitions. The difference between the transition temperature predicted by 16 and 32-atom runs is approximately 8% at lower pressures, with the difference diminishing at pressures above 100 GPa, suggesting that finite size effects become negligible at higher pressures.

**Fig. 2: Heat capacity and densities of GAP-20 ML potentials.**

At 0.1 GPa, the liquid phase generated by NS is dominated by chain-like structures. This is in agreement with the known low-coordinated liquid phase formed at low pressures, dominated by branch-like structures³¹. To demonstrate the change in the typical coordination of carbon atoms at different temperatures and pressures, we calculated the NS weighted average of the coordination number over a range of temperatures using Eq. (1), as shown in Fig. 3. Here we see that at 0.1 GPa the average number of neighbours in the liquid phase reaches a maximum of two before rapidly increasing to three at the freezing transition. As pressure increases, the liquid can no longer sustain the chain-like structures, and we observe an increase in the average coordination number. The average coordination number also reflects the structure of the solid phases, with three nearest neighbours in the case of graphite and four in the case of diamond, with higher values for the extreme high pressure phases.

**Fig. 3: Average coordination number of GAP-20 model at different pressures.**

Up to 20 GPa the liquid freezes into the graphite structure. While at lower pressures the density of the graphite is found to be higher than that of the liquid, this trend changes, and at 20 GPa we can observe a maximum on the density curve at the transition, shown in the bottom panel of Fig. 2. This is consistent with the expectation that the melting line has a negative gradient in that pressure range. We can therefore deduce that within the GAP-20 and GAP-20U models there is a compensation point around 10–20 GPa where the density of the liquid phase is equal to that of graphite at the melting transition, corresponding to a maximum in the phase boundary. This is in qualitative agreement with experiment, though the maximum is expected to occur at lower pressures, around 0.5 GPa.

The graphite configurations explored by NS are diverse both in terms of the distance between adjacent graphite layers and in stacking pattern. Among the configurations generated by NS we can find the most energetically favourable AB and ABC stacking variants^58,59, alongside AA stacking and a variety of unique arrangements where adjacent layers are shifted only partially in relation to each other, spanning the phase space between the typical AA, AB and ABC patterns. In terms of the distance between the graphite layers, we see a significant change with respect to temperature and pressure. Figure 4 shows the distribution of carbon atoms along the normal vector of the graphite structure at different pressures and temperatures, calculated as the phase space–weighted average (using Eq. (1)) from configurations generated by NS. At 0.1 GPa the typical spacing between neighbouring layers is around 3.8 Å at temperatures below 2000 K, while the intralayer distributions become significantly broader as the temperature increases. This layer distance corresponds to a lattice parameter of c = 7.6 Å, much larger than the experimentally observed value of c = 6.71 Å⁶⁰. The underlying reason for this discrepancy becomes obvious by calculating the energy of graphite structures as a function of the lattice parameters, shown in panel (a) of Fig. 5. These calculations reveal that the graphitic energy surface has multiple minima with respect to layer spacing in the case of GAP-20, with the lowest energy distance confirmed to be at c = 7.6 Å. At higher pressures the contribution of the pressure-volume term to the enthalpy becomes significant enough that local minima corresponding to shorter layer distances become enthalpically favourable. This is reflected in the histograms of Fig. 4, which show that NS runs at 10 and 20 GPa sampled graphite configurations that are consistent with the local minimum at c = 5.5 Å. We even observe a phase transition at 10 GPa as temperature is reduced below 2000 K, as the average spacing rapidly decreases from c = 6.3 Å to 5.5 Å, with a double peak feature at 1000 K that reflects the simultaneous sampling of graphite basins with distinct layer separations. It is important to note that this behaviour naturally influences the average density of the sampled graphite phases as well. Specifically, it leads to a lower-than-expected density at low pressures and a higher density than expected at higher pressures. We could speculate that this behaviour, if affecting the density ratio between graphite and liquid carbon, is capable of changing the gradient of the melting curve and shifting the expected maximum in the melting temperature to higher pressures. We will address this idea further in a later section on improving the potential. The multiple minima as a function of graphite lattice parameters can be still observed, though to a lesser extent, in the case of GAP-20U (see Fig. 5, panel b). As in the case of the GAP-20, the updated model exhibits a phase transition at 10 GPa from high to low density graphite just below 2000 K, though the change in density is significantly smaller. Calculations of the graphite energy landscape using DFT (shown in panel d of Fig. 5) show that these curves should be completely smooth, with only a single minimum at 6.7 Å.

**Fig. 4: Pressure and temperature dependence of graphite layer spacing.**

**Fig. 5: Minimum energy layer spacing of graphite using different models.**

Although the liquid freezes to the graphite structure at 20 GPa, in the case of the GAP-20U potential we observe the solid-solid transition to diamond at 2800 K. This is marked by a sudden and significant jump in density, which can be seen in the bottom panel of Fig. 2. Using a combination of density and the Steinhardt bond-order parameters⁶¹ Q₄ and W₄, we are able to distinguish between diamond and graphite configurations generated by NS and calculate their contributions to the Gibbs free energy separately, as a function of temperature. Comparing these free energy contributions allows us to locate the phase transition between the different crystalline structures and the liquid, as shown in Fig. 6. As expected, the temperatures at which graphite and diamond become the most stable phases correspond exactly with peaks in the heat capacity, as well as the sudden step in density that was previously noted.

**Fig. 6: Graphite–diamond phase transition of GAP-20U.**

While NS simulations at 40, 50 and 100 GPa also explored graphite and hexagonal diamond structures to some extent, these phases remain metastable at all temperatures, as the cubic diamond structure becomes the dominant phase. Crucially, the change in the stable solid phase, from graphite to diamond, also corresponds to the change in melting line from a roughly vertical curve to one with a large positive gradient, as also observed experimentally^55,56,57. These agreements are particularly notable, as high-pressure behaviour was not explicitly considered in the potential development process, and there is no indication that structure optimisation was performed at non-zero pressures in the training data. It must be noted however that the training data contains configurations where the stress tensor has non-zero diagonal elements, corresponding to pressures ranging between −100 and 100 GPa, isotropic or otherwise.

To evaluate the accuracy of the GAP-20U model more generally – across the liquid, graphite and cubic diamond phases – we take configurations generated with NS at three different pressures (0.1, 10 and 50 GPa) over a suitable range of temperatures (500–11000 K) and for each sample calculate the difference in potential energy predicted by the GAP-20U and DFT models. The results of these calculations are shown in Fig. 7, showing a maximum energy difference of ~0.35 eV/atom in the liquid phase at 0.1 GPa. At each pressure, the energies of the liquid configurations are typically underestimated by the GAP-20U, and unsurprisingly the overall distribution of energy differences in the liquid phase is considerably larger than those of the solid phases, with a sharp decrease in the distributions at the freezing transitions. In the graphite phase at 0.1 GPa and 10 GPa we see that the agreement between the GAP-20U and DFT energies improves as the temperature decreases and crystal order increases, however at 10 GPa there is a sudden deviation in energies just below 1000 K, corresponding to the graphite spacing transition than can be seen in Fig. 4. In comparison, the energy difference in the cubic diamond phase at 50 GPa are far smaller than in the graphite phases at low temperatures, suggesting that diamond’s higher degree of crystal symmetry and stronger, isotropic bonding makes its energy landscape less difficult to approximate via machine learning.

**Fig. 7: Comparison between GAP-20U and DFT energies.**

Before continuing on to discuss the GAP-20 model’s extreme high pressure behaviour, we acknowledge the erroneous stability of a very low density bcc phase in the PES of the GAP-20, which would later be addressed in the updated GAP-20U model³³. Due to its large nearest neighbour bonds (~80% larger than typical carbon bonds) and a coordination structure that is very different from that of the corresponding liquid phase, this phase is not explored by the NS, nor by the structure searches performed in the original work. Hence, we speculate that the phase space volume of the low density bcc phase is likely to be negligible compared to the graphite structure, and separated from the liquid by extremely high free energy barriers. Further evidence of the bcc structure possessing a relatively small phase space volume can be found in the Discussion section of the Supplementary Material.

The predictive power of the GAP-20 and GAP-20U models is reasonably good, even up to 100 GPa. Further increasing the pressure will certainly break down the reliability of the model, but exploring to what extent and under what conditions this will occur can still provide us critical information about the ability of the machine learning to extrapolate, as well as areas for future improvement. NS simulations above 100 GPa suggest that the melting line closely follows the trend expected from DFT calculations (see Fig. 1), but at extreme high pressures two new phases emerge as ground state structures of the GAP-20, both in the 16-atom and 32-atom simulations. At 500 and 800 GPa the stable structure predicted by NS is that of a strained variant of cubic diamond, where the strain is positive, in the direction of an arbitrary cubic axis and coupled with a compression along the perpendicular axes. Between 800 and 1000 GPa the system transitions to a highly compressed hexagonal close packed structure. This belongs to the P6₃/mmc spacegroup, having two atoms in the unit cell, each with eight nearest neighbours. We will refer to this structure as strained hexagonal close-packed (strained hcp). Figure 8 shows snapshots of these two new structures along with cubic diamond, as well as the corresponding radial distribution functions, with all three structures optimised at 300 GPa. The enthalpy differences between the different optimised structures at 0 K are shown in Fig. 9, calculated by the GAP-20 and GAP-20U potential up to 1 TPa, as well as with DFT for comparison up to 10 TPa. While both GAP-20 and GAP-20U predict the stabilisation of strained cubic diamond structure at very high pressures, cubic diamond becomes the ground state again above 380 GPa in the case of GAP-20U. This demonstrates that changes to an ML potential from refitting may influence the behavior of the model in data-sparse regions of phase space, far from its fitting conditions. It is notable that, as the bc8 structure was not included in the training data, neither versions of the GAP model predict it to be a low-enthalpy state at pressures above 1 TPa. Geometry optimisations carried out with the same DFT parameters as those used in the training show good agreement with previous ab initio random structure search results³, showing a ground state transition from cubic diamond to bc8, simple cubic, then to simple hexagonal as pressure increases. While neither of the high-pressure configurations predicted by the GAP models have proven to be ground state structures according to DFT, they are nevertheless low-enthalpy metastable states that may be worth further consideration. Finally, the considerable agreement between the extreme high pressure melting lines predicted by GAP and DFT, in spite of the GAP’s erroneous phase stability, implies that the model maintains an accurate description of carbon’s macroscopic density.

**Fig. 8: Extreme high pressure structures using GAP-20.**

**Fig. 9: Ground state structures of different models as a function of pressure.**

GAP-20U+gr

The exhaustive and unbiased sampling of carbon’s phase space afforded by NS allows us to identify regions where each model’s description could be improved. Moreover, it helps to identify structural features that are captured inaccurately by the model. An obvious area for improvement is the extreme high pressure behaviour, i.e., the relative stability of crystal structures at pressures above 200 GPa - most notably the lack of a stable bc8 phase. Making these improvements will require the inclusion of configurations of multiple crystalline phases in the training set, with repeated exhaustive sampling to confirm the finite temperature stability of the solid phases. Given the associated computational cost of this particular flavour of GAP modelling, which focuses on providing an accurate description of the long range van der Waals interactions, we will address these improvements in a future project by concentrating on shorter range interactions which dominate at high pressures.

However, using NS we were able to identify another shortfall of the GAP-20 models, the erroneous local minima with respect to inter-layer spacing in the graphite phase. In this section we aim to improve the accuracy of the GAP-20U model’s description of the graphite phase – the primary goal being the elimination of local minima that we have previously shown to result in unphysical solid-solid graphite phase transitions. We have therefore expanded the DFT dataset on which the potential is trained by including an additional 165 ordered graphite configurations with AA, AB and ABC stacking patterns (the entire training dataset, including these new configurations, are available at DOI:10.5281/zenodo.7463706). The lattice parameter c spans a range of ±40% of the equilibrium value for each stacking pattern (determined from DFT), where c_aa = 7.02 Å; c_ab = 6.64 Å, and c_abc = 6.70 Å; while a is varied by ±2% about an equilibrium value of 2.47 Å. We otherwise used the same GAP fitting parameters as in the original GAP-20U, in order to preserve the work that was done in optimising the potential’s transferability³². The additional data points from the AB-ordered set are illustrated in panel (c) of Fig. 5 alongside the energy landscape of the updated potential, which we refer to as the GAP-20U+gr. These results resemble DFT calculations much more closely than both the GAP-20 and GAP-20U models. The minimum-energy layer separation remains the same, with the c lattice parameter being 6.7 Å. Performing structure optimisations with the new potential reveals that the 0 K graphite–diamond transition has been shifted to 7.2 GPa, much closer to the ab initio prediction of 5.8 GPa as compared to 9.0 GPa in the case of the GAP-20U. Given that the GAP-20U+gr remains practically unchanged from the GAP-20U with respect to energies of non-graphite configurations and the pressure-dependent stability of different crystalline phases (see the Discussion section of the Supplementary Material), we do not expect significant deviation from the GAP-20U in other respects, though of course this is difficult to fully evaluate without considerable time and resources. Though the error with respect to the DFT graphite energy landscape is reduced considerably, there remains a shallow local minimum around c = 7.4 Å. Additional tests show that this artifact persists even when additional data points are included in this region, suggesting it is the result of influence from other configurations in the dataset. It should also be noted that long range interactions such as those between adjacent graphite layers are difficult to accurately capture with ML methods, due to the inherent increase in configurational complexity as the potential’s cut-off radius is increased. This is why recent ML potentials aiming to model the graphite phase have opted to tabulate the long range interactions³⁵.

In order to evaluate the performance of the enhanced potential in the case of unbiased PES sampling, we have performed single NS runs — using the same NS parameters as those used for the GAP-20U — at four different pressures: 0.1 GPa, 1 GPa, 10 GPa and 20 GPa. The resulting phase transitions are included in Fig. 1 and the corresponding densities are shown in the bottom panel of Fig. 2. We find that the enhanced potential, GAP-20U+gr, predicts graphite densities that are closer to experimental values than the GAP-20U, which can most clearly be seen at 10 GPa, where the GAP-20U+gr model does not undergo a phase transition to a lower density graphite phase as temperature increases, as the GAP-20U does just below 2000 K. Performing the same thermal averaging analysis as shown in Fig. 4 on the new potential, the local minimum at c = 7.4 Å does appear to affect the average spacing at 0.1 GPa by broadening the distribution, but the resulting decrease in average density is minimal.

However, despite these improvements, we do not observe a significant change in the melting behaviour, with melting temperatures matching the GAP-20U results almost perfectly. This suggests that the inaccuracy of the gradient of the melting line, closely tied to the density ratio between graphite and liquid carbon, may in fact originate from problems not with the graphite density, as we originally suspected, but from the liquid being less dense than expected.

Tersoff potential

The phase diagram calculated with the Tersoff potential is shown in Fig. 10. Compared to the GAP-20 models, the Tersoff potential shows a significantly larger finite-size effect that is consistent with finite size effects seen in other empirical potentials^47,49. The effect’s significance, quantified by the difference in temperature between 16- and 64-atom runs, is non-monotonic with respect to pressure, peaking around the graphite–diamond transition at 50 GPa. Overall, the melting line reflects the experimental trend reasonably well at 64 atoms, however the pressure-dependent phase stability is less accurate. Though graphite is formed below 50 GPa, the melting line does not reflect the expected negative gradient at lower pressures, nor the significant change in the melting line gradient above the graphite–diamond-liquid triple point, which is overestimated by around ~400% compared to experimental results. The origin of Tersoff’s monotonic melting curve in the graphite phase is its small cutoff of 4.1 Å, which leads to a dramatic underestimation of the equilibrium lattice spacing compared to DFT, by around 40%. This corresponds to a graphite phase that is more dense than the liquid phase at all pressures, hence the lack of a maximum in the melting curve.

Once again we use the Steinhardt bond-order parameters to sort solid configurations into diamond and graphite basins, allowing for the calculation of each phase’s contribution to the Gibbs free energy and the determination of solid-solid phase transitions. In Fig. 11 we demonstrate this at two different pressures. At 30 GPa, the large majority of the solid configurations fall into the basin of the graphite structure, however, the metastable diamond phase is also sampled to a lesser extent. A third and smaller basin (appearing to have Q₄ = 0.45) can be observed between these, representing a structure where small graphite-like motifs are interconnected by four-coordinated carbon atoms. As the pressure increases, the diamond structure becomes more dominant, until it becomes the ground state structure at 80 GPa. Due to the Tersoff potential’s short range, the potential energy of the perfect cubic and hexagonal diamond structures are the same, and at pressures where the diamond phases are stable their sampling is about equal, suggesting that their free energy is comparable as well.

**Fig. 11: Simultaneous sampling of graphite and diamond phases using Tersoff potential.**

EDIP

Nested sampling calculations using the EDIP potential were performed with 16 and 32 atoms, at pressures ranging from 1 GPa to 1500 GPa. The resulting phase diagram is shown in Fig. 12, showing overall excellent agreement with experimental phase behaviour up to 100 GPa. The melting line follows the experimental trends well, with a considerably smaller finite-size effect compared to the Tersoff potential. At lower pressures graphite is formed upon freezing, as expected, with typical layer spacings at low temperature corresponding to a lattice parameter of c = 6.4 Å, only a 5% underestimation of experimental data. To explore the EDIP’s graphite phase further, we plot its energy as a function of lattice parameters in Fig. 13. One of the potential’s shortcomings is its lack of dispersive, long-range interactions, and that is reflected in its graphitic energy landscape, as we see no change in energy beyond c = 6.4 Å. While this is not consistent with the clearly defined minimum separation predicted by DFT, the influence of the PV term in the enthalpy effectively prevents larger separations from being energetically relevant at finite pressures and zero temperature. To evaluate the finite temperature effect of this short interplanar cutoff, we plot thermally averaged distributions of carbon atoms perpendicular to the graphite planes in Fig. 14, for pressures of 1 and 10 GPa. These show an expected broadening of carbon atom dispersion in the reference layer at higher temperatures, due to thermal disorder, but for the nearest-layer distributions this broadening becomes more biased towards larger spacings as temperature increases, suggesting that the lack of a long-range energy barrier allows unphysically large layer separations to overcome the PV term and become thermodynamically relevant.

**Fig. 13: Minimum energy layer spacing of graphite using EDIP.**

**Fig. 14: Pressure and temperature dependence of graphite layer spacing using EDIP.**

This large separation-bias persists at higher pressures, however the effect is diminished, which can be intuitively understood as the increased pressure (and PV energy) encouraging smaller volumes, and preventing thermal fluctuations from stabilising larger-separation structures. As the temperature decreases, there is less kinetic energy available to smear the energies of the optimised structure, and thus fewer large separation configurations can be energetically viable. In spite of EDIP’s short interplanar cutoff and its effects, its description of the graphite melting line is remarkably accurate at only 32 atoms. Given the importance of the ratio between liquid and solid densities in sha** the melting line, and that graphite’s volume is particularly sensitive to interplanar separation, these results show that an accurate description of graphite’s phase space is essential for determining its melting behaviour.

At a pressure of 10 GPa we begin to observe a small number of cubic and hexagonal diamond structures among the sampled configurations, around the freezing transition. However, these structures quickly lose thermodynamic relevance in comparison to the graphite phase. When pressure is increased to 20 GPa, the diamond configurations become enthalpically viable enough that the NS algorithm simultaneously samples them along with the graphite phase, such that we can identify the solid-solid transition. Supplementary Fig. 15 shows the densities and Q₄ parameters of configurations sampled by NS, which we sort into different structural basins as before. The resulting free energy of the diamond and graphite structures are shown in the middle panel of Fig. 15, demonstrating that below the melting point graphite is more stable than diamond, however their free energy difference is smaller in comparison to the GAP-20U model. The most notable difference between the two potentials is that, in the temperature region where the liquid phase is the most stable, EDIP’s graphite phase is less stable than the diamond, whereas the GAP-20U shows graphite to be more stable than diamond up to the solid-solid transition. This is likely a result of the EDIP’s short cutoff providing an unphysically broad distribution of layer spacings at higher temperatures, which necessarily incurs an entropic energy penalty. Like in the case of the Tersoff potential, the potential energy of the perfect cubic and hexagonal diamond structures are the same using EDIP, and above 20 GPa NS runs sampled both structures equally.

**Fig. 15: Graphite–diamond phase transition of EDIP.**

The top panel of Fig. 12 shows that at extreme high pressures, above 100 GPa, the EDIP’s diamond melting line rapidly increases in temperature before reaching a maximum of 24000 K at 1000 GPa. This turning point is at a much larger temperature than those predicted by the GAP-20 and DFT¹⁴, 2.5 and 3 times larger respectively. The pressure at which it occurs is also larger, though by only 10%. At 1500 GPa NS calculations explore the six-coordinated P2₁2₁2₁ structure below the freezing transition, while zero temperature structure optimisations confirm that this structure is stable for EDIP above 2650 GPa.

Discussion

In the current work we reviewed the performance of three interatomic potential models of carbon, ranging from fast but less transferable empirical force fields to slower ML potentials with state-of-the-art accuracy. Our study focused on assessing their ability to reproduce experimentally observed macroscopic properties. We used the nested sampling technique to sample the potential energy surface of these models over a wide pressure range, calculating their pressure-temperature phase diagram and predicting crystalline phases. We emphasise that nested sampling is a unique tool that allows us exhaustive exploration of the phase space and makes the calculation of the entire phase diagram a relatively straightforward process, while also being predictive and not restricted by known or considered crystalline structures. All three models, GAP-20, Tersoff and EDIP, predicted the graphite structure to be more stable at low pressures and the diamond structure at higher pressures. However, the transition between these as well as the location of the melting line differed considerably. Empirical potentials are often fitted to specific microscopic properties, for example to typical coordination of graphite and diamond structures, hence their high-temperature and high-pressure behaviour cannot be expected to accurately reflect the diverse structural properties of carbon. Nevertheless, while the macroscopic properties of the Tersoff potential differ from the experimental phase diagram considerably, we found the phase diagram of the EDIP potential to be very close to experimentally observed behaviour, accurately reflecting both the predicted graphite–diamond transition, as well as the melting line up to relatively high pressures.

Machine learning (ML) potentials provide the state-of-the-art in descriptions of atomic interactions, opening up routes to materials discovery that are otherwise out of our reach and, to some degree, offering ab initio level accuracy at an affordable computational cost. However, the main criticism of ML potentials is that they are inherently best suited to interpolation problems, and perform reliably only in the regime of configuration space where the potential was trained. This means that their behaviour in unexplored territory, in configurational regions where the potential is forced to extrapolate from the training data, can be unrealistic or unphysical — inhibiting their use in scientific discovery. Therefore, our results showing that the GAP-20 potential performs well and predicts the expected phase transitions reliably up to 200 GPa, well outside the original training conditions, is remarkable, emphasising the power of including a diverse range of local atomic environments in the training process. Moreover, the exhaustive exploration provided by NS also highlighted local weaknesses of the model, such as the stabilisation of unexpected graphite-layer distances or the prediction of erroneous phases at very high pressures, offering areas for potential improvement and extensions of the GAP-20 model. Using these observations we have presented an enhanced version of the GAP-20U potential called the GAP-20U+gr, which includes additional ordered graphite configurations in the training set to successfully avoid graphite phases with unphysical layer spacings becoming stable under certain thermodynamic conditions. However, these improvements to the model’s description of the graphite phase did not provide a more accurate melting line, leading us to conclude that the density of the liquid phase at low pressures must also be addressed in further updates to the machine-learned potential.

Methods

Nested sampling

The NS calculations were performed as presented in ref. ⁴⁷. After the sampling has finished, we calculate the partition function and derive thermodynamic response functions to determine the phase behaviour. We use the position of peaks in the heat capacity to locate phase transitions, and calculate the phase space-weighted averages of observables (e.g., coordination number) to evaluate their finite temperature values using the following equation:

$$\langle A\rangle \approx \frac{1}{\Delta }\mathop{\sum}\limits_{i}{A}_{i}({\Gamma }_{i-1}-{\Gamma }_{i}){e}^{-\beta {H}_{i}},$$

(1)

where Δ is the isobaric partition function; β is the inverse temperature; and A_i, H_i and Γ_i are the observable value, enthalpy and phase space volume of the i-th configuration respectively, where Γ_i = (K/(K + 1))ⁱ and K is the number of walkers in the simulation.

In an infinite system, the heat capacity peaks would be divergent due to a first order discontinuity in the corresponding enthalpy vs. temperature curves, but the finite size of these systems causes a broadening of the peaks. The temperature of a given transition and its error are ascertained from the combination of data from each of the three independent runs we performed at every pressure. In order to test the convergence of the simulations we fit Gaussian functions to the heat capacity peaks, and the lower and upper bounds of the error are taken to be the minimum and maximum temperature values of the peaks’ half-maximums. The simulations were run at constant pressure, and the bounding cell of variable shape and size contained 16, 32 or 64 particles (depending on the potential), in order to estimate the finite size effect. Previous calculations show that the small system size usually causes the melting temperature to be overestimated, however, the solid-solid transitions are less affected, with sampled crystalline phases usually remaining consistent across different system sizes^38,47. We note that these results may be augmented by further calculations using standard simulation techniques (e.g. parallel tempering^62,63, coexistence simulations⁶⁴, thermodynamic integration⁶⁵) with larger system sizes, using the NS-predicted phases as a guide. For each calculation, the number of walkers, K, was chosen such that the resulting heat capacity peaks were sufficiently converged, thus predicted transition temperatures were generally within a range of 200K (exceptions are noted). Using a larger number of walkers means a sampling of higher resolution, with the computational cost increasing linearly with K. The number of walkers used for each potential and system size are recorded in Table 1. Initial sample configurations were generated randomly to simulate the gas phase, while subsequent samples were acquired by performing a sufficiently large number of randomly selected “moves”, referred to as the number of model calls in Table 1. These include Hamiltonian Monte Carlo (all-atom) moves; isotropic volume changes; and perturbations to the shape of the simulation cell via stretch and shear transformations, where the probability that each move occurs is given by the ratio 5:3:2:2 (atom:volume:stretch:shear)³⁸.

Table 1 Summary of NS parameters used for each carbon model.

Full size table

DFT calculations

In order to compare the energies of configurations and phase stability predicted by the GAP-20 and GAP-20U models, we employ density functional theory (DFT) with the same input parameters as those used to generate the data on which the GAP-20U model was trained³³. DFT calculations are therefore carried out using the Vienna ab initio Simulation Package (VASP), with the dispersion-inclusive optB88-vdW exchange-correlation functional^66,67,68,69, and the projector augmented wave (PAW) pseudopotential method (PAW_PBE C 08Apr2002)^70,71,72 with a plane-wave cutoff of 600 eV. In each case, reciprocal space is sampled using an automatically generated, Γ-centred Monkhorst-Pack mesh such that the smallest spacing between k-points is no greater than 0.2 Å⁻¹, and energy levels are smeared by Gaussian distributions with widths of 0.1 eV.

Data availability

A vertical slice of the data used to generate the results found in the current work, as well as the extended ML potential, GAP-20U+gr, and its corresponding dataset are available at https://doi.org/10.5281/zenodo.7463706.

Code availability

A parallel implementation of the NS algorithm is available in the pymatnest Python software package⁷³, using the LAMMPS package⁷⁴ for the dynamics (the pymatnest input files that were used to perform the NS calculations in this work is available at https://doi.org/10.5281/zenodo.7463706).

References

Kroto, H. W., Heath, J. R., O’Brien, S. C., Curl, R. F. & Smalley, R. E. C60: Buckminsterfullerene. Nature 318, 162–163 (1985).
Article CAS Google Scholar
Novoselov, K. S. et al. Electric field effect in atomically thin carbon films. Science 306, 666–669 (2014).
Article Google Scholar
Martinez-Canales, M. & Pickard, C. J. Thermodynamically stable phases of carbon at multiterapascal pressures. Phys. Rev. Lett. 108, 045704 (2012).
Article Google Scholar
Powles, R. C., Marks, N. A. & Lau, D. W. M. Self-assembly of sp2-bonded carbon nanostructures from amorphous precursors. Phys. Rev. B 79, 075430 (2009).
Article Google Scholar
Tománek, D. Guide through the Nanocarbon Jungle: Buckyballs, nanotubes, graphene and beyond (Morgan & Claypool Publishers, 2014).
Shang, Y. et al. Ultrahard bulk amorphous carbon from collapsed fullerene. Nature 599, 599–604 (2021).
Article CAS Google Scholar
Ugarte, D. Curling and closure of graphitic networks under electron-beam irradiation. Nature 359, 707–709 (1992).
Article CAS Google Scholar
Takagi, M. & Maeda, S. Global search for crystal structures of carbon under high pressure. ACS Omega 5, 18142–18147 (2020).
Article CAS Google Scholar
Hoffmann, R., Kabanov, A. A., Golov, A. A. & Proserpio, D. M. Homo citans and carbon allotropes: for an ethics of citation. Angew. Chem. Int. Ed. 55, 10962–10976 (2016).
Article CAS Google Scholar
Zhang, W. et al. Recent development of carbon electrode materials and their bioanalytical and environmental applications. Chem. Soc. Rev. 45, 715–752 (2016).
Article CAS Google Scholar
Bonaccorso, F., Sun, Z., Hasan, T. & Ferrari, A. C. Graphene photonics and optoelectronics. Nat. Photonics 45, 611–622 (2010).
Article Google Scholar
Knudson, M., Desjarlais, M. & Dolan, D. Shock-wave exploration of the high-pressure phases of carbon. Science 322, 1822–1825 (2008).
Article CAS Google Scholar
Sundqvist, B. Carbon under pressure. Phys. Rep. 909, 1–73 (2021).
Article CAS Google Scholar
Correa, A. A., Bonev, S. A. & Galli, G. Carbon under extreme conditions: phase boundaries and electronic properties from first-principles theory. Proc. Am. Nat. Soc. 103, 1204 -1208 (2006).
Article Google Scholar
Tersoff, J. Modeling solid-state chemistry: interatomic potentials for multicomponent systems. Phys. Rev. B 39(Mar), 5566–5568 (1989).
Article CAS Google Scholar
de Tomas, C., Suarez-Martinez, I. & Marks, N. A. Graphitization of amorphous carbons: a comparative study of interatomic potentials. Carbon 109, 681–693 (2016).
Article Google Scholar
Mahon, P., Pailthorpe, B. & Bacskay, G. A quantum mechanical calculation of interatomic interactions in diamond. Philos. Mag. B 63, 1419–1430 (1991).
Article CAS Google Scholar
Marks, N., McKenzie, D. R. & Pailthorpe, B. A. Molecular-dynamics study of compressive stress generation. Phys. Rev. B 53, 4117 (1996).
Article CAS Google Scholar
Brenner, D. Empirical potential for hydrocarbons for use in simulating the chemical vapor deposition of diamond films. Phys. Rev. B 42, 9458 (1990).
Article CAS Google Scholar
Brenner, D. W. et al. A second-generation reactive empirical bond order (rebo) potential energy expression for hydrocarbons. J. Phys.: Cond. Mat. 14, 783 (2002).
CAS Google Scholar
Stuart, S. J., Tutein, A. B. & Harrison, J. A. A reactive potential for hydrocarbons with intermolecular interactions. J. Chem. Phys. 112, 6472–6486 (2000).
Article CAS Google Scholar
Marks, N. A. Generalizing the environment-dependent interaction potential for carbon. Phys. Rev. B 63, 035401 (2000).
Article Google Scholar
de Tomas, C. et al. Transferability in interatomic potentials for carbon. Carbon 155, 624–634 (2019).
Article Google Scholar
Karasulu, B., Leyssale, J.-M., Rowe, P., Weber, C. & de Tomas, C. Accelerating the prediction of large carbon clusters via structure search: evaluation of machine-learning and classical potentials. Carbon 191, 255–266 (2022).
Article CAS Google Scholar
Los, J. H. & Fasolino, A. Intrinsic long-range bond-order potential for carbon: performance in Monte Carlo simulations of graphitization. Phys. Rev. B 68, 024107 (2003).
Article Google Scholar
Ghiringhelli, L. M., Los, J. H., Meijer, E. J., Fasolino, A. & Frenkel, D. Modeling the phase diagram of carbon. Phys. Rev. Lett. 94, 145701 (2005).
Article Google Scholar
van Duin, A. C. T., Dasgupta, S., Lorant, F. & Goddard, W. A. Reaxff: a reactive force field for hydrocarbons. Phys. Chem. A 105, 9396–9409 (2001).
Article Google Scholar
Srinivasan, S., van Duin, A. T. & Ganesh, P. Development of a reaxff potential for carbon condensed phases and its application to the thermal fragmentation of a large fullerene. J. Phys. Chem. 119, 571–580 (2015).
Article CAS Google Scholar
Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article Google Scholar
Bartók, A. P. & Kondor, R. On representing chemical environments. Phys. Rev. B 87, 184115 (2013).
Article Google Scholar
Deringer, V. L. & Csányi, G. Machine learning based interatomic potential for amorphous carbon. Phys. Rev. B 95, 094203 (2017).
Article Google Scholar
Rowe, P., Deringer, V. L., Gasparotto, P., Csányi, G. & Michaelides, A. An accurate and transferable machine learning potential for carbon. J. Chem. Phys. 153, 034702 (2020).
Article CAS Google Scholar
Rowe, P., Deringer, V. L., Gasparotto, P., Csányi, G. & Michaelides, A. Erratum: “an accurate and transferable machine learning potential for carbon”. J. Chem. Phys. 156, 159901 (2022).
Article CAS Google Scholar
Muhli, H. et al. Machine learning force fields based on local parametrization of dispersion interactions: application to the phase diagram of C 60. Phys. Rev. B 104, 054106 (2021).
Article CAS Google Scholar
Wang, Y., Fan, Z., Qian, P., Ala-Nissila, T. & Caro, M. A. Structure and pore size distribution in nanoporous carbon. Chem. Mater. 34, 617–628 (2022).
Article CAS Google Scholar
Wang, J. et al. A deep learning interatomic potential developed for atomistic simulation of carbon materials. Carbon 186, 1–8 (2022).
Article Google Scholar
Qamar, M., Mrovec, M., Lysogorskiy, Y., Bochkarev, A. & Drautz, R. Atomic cluster expansion for quantum-accurate large-scale simulations of carbon. J. Chem. Theory Comput. https://doi.org/10.1021/acs.jctc.2c01149 (2022).
Pártay, L. B., Csányi, G. & Bernstein, N. Nested sampling for materials. Eur. Phys. J. B 94, 159 (2021).
Article Google Scholar
Ashton, G. et al. Nested Sampling for physical scientists. Nat. Rev. Methods Prim. 2, 39 (2022).
Article CAS Google Scholar
Skilling, J. Bayesian inference and maximum entropy methods in science and engineering. AIP Conf. Proc. 735, 395, (2004).
Skilling, J. Nested sampling for general bayesian computation. Bayesian Anal. 1, 833–859 (2006).
Article Google Scholar
Pártay, L. B., Bartók, A. P. & Csányi, G. Efficient sampling of atomic configurational spaces. J. Phys. Chem. B 114, 10502–10512 (2010).
Article Google Scholar
Rossi, K., Pártay, L. B., Csányi, G. & Baletto, F. Thermodynamics of cupt nanoalloys. Sci. Rep. 8, 9150 (2018).
Article CAS Google Scholar
Dorrell, J. & Pártay, L. B. Thermodynamics and the potential energy landscape: case study of small water clusters. Phys. Chem. Chem. Phys. 21, 7305–7312 (2019).
Article CAS Google Scholar
Szekeres, B., Pártay, L. B. & Mátyus, E. Direct computation of the quantum partition function by path-integral nested sampling. J. Chem. Theory Comput. 14, 4353–4359 (2018).
Article CAS Google Scholar
Bolhuis, P. G. & Csányi, G. Nested transition path sampling. Phys. Rev. Lett. 120, 250601 (2018).
Article CAS Google Scholar
Baldock, R. J. N., Pártay, L. B., Bartók, A. P., Payne, M. C. & Csányi, G. Determining pressure-temperature phase diagrams of materials. Phys. Rev. B 93, 174108 (2016).
Article Google Scholar
Baldock, R. J. N., Bernstein, N., Salerno, K. M., Pártay, L. B. & Csányi, G. Constant-pressure nested sampling with atomistic dynamics. Phys. Rev. E 96, 43311–43324 (2017).
Article Google Scholar
Dorrell, J. & Pártay, L. B. Pressure-temperature phase diagram of lithium, predicted by embedded atom model potentials. J. Phys. Chem. B 124, 6015–6023 (2020).
Article CAS Google Scholar
Gola, A. & Pastewka, L. Embedded atom method potential for studying mechanical properties of binary cu-au alloys. Model. Simul. Mater. Sci. Eng. 26, 055006 (2018).
Article Google Scholar
Rosenbrock, C. W. et al. Machine-learned interatomic potentials for alloys and alloy phase diagrams. npj Comp. Mat. 7, 24 (2021).
Article CAS Google Scholar
Bartók, A. P., Hantal, G. & Pártay, L. B. Insight into liquid polymorphism from the complex phase behavior of a simple model. Phys. Rev. Lett. 127, 015701 (2021).
Article Google Scholar
Lindsay, L. & Broido, D. Optimized tersoff and brenner empirical potential parameters for lattice dynamics and phonon thermal transport in carbon nanotubes and graphene. Phys. Rev. B 81, 205441 (2010).
Article Google Scholar
Sha, Z., Branicio, P., Pei, Q., Sorkin, V. & Zhang, Y. A modified tersoff potential for pure and hydrogenated diamond-like carbon. Comp. Mat. Sci. 67, 146–150 (2013).
Article CAS Google Scholar
Bundy, F. Pressure-temperature phase diagram of elemental carbon. Phys. A 156, 169–178 (1989).
Article CAS Google Scholar
Bundy, F. et al. The pressure-temperature phase and transformation diagram for carbon; updated through 1994. Carbon 34, 141–153 (1996).
Article CAS Google Scholar
Steinbeck, J., Braunstein, G., Dresselhaus, M., Venkatesan, T. & Jacobson, D. A model for pulsed laser melting of graphite. J. Appl. Phys. 58, 4374–4382 (1985).
Article CAS Google Scholar
Cançado, L. et al. Measuring the degree of stacking order in graphite by Raman spectroscopy. Carbon 46, 272–275 (2008).
Article Google Scholar
Telling, R. H., Ewels, C. P., El-Barbary, A. A. & Heggie, M. I. Wigner defects bridge the graphite gap. Nat. Mater. 2, 333–337 (2003).
Article CAS Google Scholar
Lynch, R. & Drickamer, H. Effect of high pressure on the lattice parameters of diamond, graphite, and hexagonal boron nitride. J. Chem. Phys. 44, 181–184 (1966).
Article CAS Google Scholar
Steinhardt, P. J., Nelson, D. R. & Ronchetti, M. Bond-orientational order in liquids and glasses. Phys. Rev. B 28, 784 (1983).
Article CAS Google Scholar
Frantz, D. D., Freemann, D. L. & Doll, J. D. Reducing quasi-ergodic behavior in monte carlo simulations by j-walking: Applications to atomic clusters. J. Chem. Phys. 93, 2769–2784 (1990).
Article CAS Google Scholar
Swendsen, R. H. & Wang, J. S. Replica monte carlo simulation of spin-glasses. Phys. Rev. Lett. 57, 2607–2609 (1986).
Article CAS Google Scholar
Morris, J. R., Wang, C., Ho, K. & Chan, C. T. Melting line of aluminum from simulations of coexisting phases. Phys. Rev. B 49, 3109 (1994).
Article CAS Google Scholar
Frenkel, D. & Ladd, A. J. New monte carlo method to compute the free energy of arbitrary solids. application to the fcc and hcp phases of hard spheres. J. Chem. Phys. 81, 3188–3193 (1984).
Article CAS Google Scholar
Lee, K., Murray, É. D., Kong, L., Lundqvist, B. I. & Langreth, D. C. Higher-accuracy van der waals density functional. Phys. Rev. B 82, 081101 (2010).
Article Google Scholar
Klimeš, J., Bowler, D. R. & Michaelides, A. Van der waals density functionals applied to solids. Phys. Rev. B 83, 195131 (2011).
Article Google Scholar
Dion, M., Rydberg, H., Schröder, E., Langreth, D. C. & Lundqvist, B. I. Van der waals density functional for general geometries. Phys. Rev. Lett. 92, 246401 (2004).
Article CAS Google Scholar
Klimeš, J., Bowler, D. R. & Michaelides, A. Chemical accuracy for the van der waals density functional. J. Phys. -Condens. Mat. 22, 022201 (2009).
Article Google Scholar
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169 (1996).
Article CAS Google Scholar
Kresse, G. & Furthmüller, J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comp. Mat. Sci. 6, 15–50 (1996).
Article CAS Google Scholar
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758 (1999).
Article CAS Google Scholar
Bernstein, N. et al. pymatnest. https://github.com/libAtoms/pymatnest (2016).
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article CAS Google Scholar
Kerley, G. I. and Chhabildas, L. Multicomponent-Multiphase Equation of State for Carbon, Technical Report (Sandia National Laboratory, 2001).

Download references

Acknowledgements

The authors thank Nigel Marks for providing access to the carbon EDIP. The authors also thank Albert P. Bartók, Gábor Csányi and Volker Deringer for useful discussions around the performance of the GAP-20 model. L.B.P. and B.K. acknowledge support from the EPSRC through the individual Early Career Fellowships (LBP: EP/T000163/1 and BK: EP/T026138/1). M.A.C. acknowledges personal funding from the Academy of Finland, under project #330488. Computing facilities were provided by the Scientific Computing Research Technology Platform of the University of Warwick. Calculations using the GAP potential were performed using the Sulis Tier 2 HPC platform hosted by the Scientific Computing Research Technology Platform at the University of Warwick. Sulis is funded by EPSRC Grant EP/T022108/1 and the HPC Midlands+ consortium.

Author information

Authors and Affiliations

Department of Chemistry, University of Warwick, Coventry, CV4 7AL, UK
George A. Marchant, Bora Karasulu & Livia B. Pártay
Department of Chemistry and Materials Science, Aalto University, 02150, Espoo, Finland
Miguel A. Caro

Authors

George A. Marchant
View author publications
You can also search for this author in PubMed Google Scholar
Miguel A. Caro
View author publications
You can also search for this author in PubMed Google Scholar
Bora Karasulu
View author publications
You can also search for this author in PubMed Google Scholar
Livia B. Pártay
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.A.M.: Generation and post-analysis of nested sampling and DFT data; re-fitting of GAP model; draft writing and figure creation. M.A.C.: Guidance on re-fitting of GAP model, including support with generation of DFT data that was consistent with GAP-20U dataset; and draft feedback. B.K.: Guidance on DFT calculations; and draft feedback. L.B.P: Supervision of nested sampling calculations and post-analysis; draft writing and figure creation. All authors contributed to discussion of results.

Corresponding authors

Correspondence to George A. Marchant or Livia B. Pártay.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Marchant, G.A., Caro, M.A., Karasulu, B. et al. Exploring the configuration space of elemental carbon with empirical and machine learned interatomic potentials. npj Comput Mater 9, 131 (2023). https://doi.org/10.1038/s41524-023-01081-w

Download citation

Received: 23 January 2023
Accepted: 02 July 2023
Published: 27 July 2023
DOI: https://doi.org/10.1038/s41524-023-01081-w
Springer Nature Limited

Exploring the configuration space of elemental carbon with empirical and machine learned interatomic potentials

Abstract

Similar content being viewed by others

Robust training of machine learning interatomic potentials with dimensionality reduction and stratified sampling

A systematic approach to generating accurate neural network potentials: the case of carbon

A new active learning approach for global optimization of atomic clusters

Introduction