High-throughput and data-driven machine learning techniques for discovering high-entropy alloys

Zhichao, Lu; Dong, Ma; **ongjun, Liu; Lu, Zhao**

doi:10.1038/s43246-024-00487-3

High-throughput and data-driven machine learning techniques for discovering high-entropy alloys

Review Article
Open access
Published: 17 May 2024

Volume 5, article number 76, (2024)
Cite this article

Download PDF

You have full access to this open access article

Communications Materials

High-throughput and data-driven machine learning techniques for discovering high-entropy alloys

Download PDF

1451 Accesses
Explore all metrics

Abstract

High-entropy alloys (HEAs) have attracted extensive attention in recent decades due to their unique chemical, physical, and mechanical properties. An in-depth understanding of the structure–property relationship in HEAs is the key to the discovery and design of new compositions with desirable properties. Related to this, materials genome strategy has been increasingly used for discovering new HEAs with better performance. This review paper provides an overview of key advances in this fast-growing area, along with current challenges and potential opportunities for HEAs. We also discuss related topics, such as high-throughput preparation, characterization, and computation of HEAs, and data-driven machine learning for accelerating alloy development. Finally, future research directions and perspectives for the materials genome-assisted design of HEAs are proposed and discussed.

Machine Learning for High-Entropy Alloys

High Entropy Alloys Mined From Binary Phase Diagrams

Article Open access 29 October 2019

Emergence of machine learning in the development of high entropy alloy and their prospects in advanced engineering applications

Article Open access 09 July 2021

Introduction

High-entropy alloys (HEAs), also called multi-principal element alloys^1,2,3, are chemically disordered but topologically ordered with the formation of random solid-solution (SS) structures, such as face-centered cubic (FCC), body-centered cubic (BCC), or hexagonal-close-packed (HCP). Understanding the composition–structure–properties relationship has long been a topic of great interest in HEAs. Thus, extensive studies have been carried out on various HEAs, and many attractive properties have been achieved in the last two decades. These properties include good plasticity, high strength and hardness, outstanding high-temperature-softening resistance, and unique electrical and magnetic properties. In the past few years, besides metallic systems, high entropy materials have expanded to ceramics made of carbides, borides, or nitrides of IV and V group transition metals, which have remarkable properties^4,5,6. Due to these unique properties and large composition space, high entropy materials have promising potential applications under extreme conditions, such as, in high-temperature structural components, corrosion-resistant parts, coatings, and nuclear materials⁷.

However, with regard to the property-oriented designs of HEAs, some challenges remain to be solved. (1) Owing to the chemically disordered structure, HEAs are not necessarily equimolar compositions; that is, many potential elements in the periodic table can conceivably be incorporated into HEAs via microalloying or principal element substitution. Therefore, an essentially infinite number of HEAs are available. Since the compositions of HEAs can be continuously adjustable, the properties of interest can be optimized. Conceptually, this poses a serious challenge—How can potential HEAs with properties of interest be fine-tuned efficiently in such a large composition space rather than in a conventional “trial and error” manner⁸? (2) Coupled with the fact that fully understanding the complicated interplay between constituents and properties is a prerequisite when designing new HEAs, How can the intrinsic relationship in a vast and complex database be uncovered? To date, inspired by the Materials Genome Initiative (MGI), high-throughput techniques (preparation, characterization, and calculation) and the data-driven machine learning (ML) method have been adopted by synergistically combining experiment, theory, and computation in a tightly integrated and high-throughput manner, and to predict and optimize HEAs at an unparalleled scale and in an effective way ⁹. These tools can be used to screen extensive composition space for a desired property and simultaneously pinpoint specific alloys with the desired properties. Specifically, high-throughput techniques are able to bridge the gap between experiments and ML modeling; that is, high-throughput approaches can provide valuable materials information for the following ML, and vice versa, ML can provide intelligent feedback to the experiments^10,11,12. Through continuing efforts to integrate experiment, computation, and data-driven ML, the underlying structure–property relationships to the materials genome can be revealed and thus seed a new generation of advanced HEAs¹³.

This review aims to present a brief state-of-the-art overview of the materials genome strategy (MGS) applied in HEAs and provide a timely focus on key developments, including challenges and opportunities, in this interdisciplinary area. Specifically, we will give a brief introduction to the development of HEAs and the application of MGI in this field. Additionally, some challenges will also be listed in a brief manner in “Introduction”. In section “High-throughput preparation and characterization of HEAs”, the main high-throughput preparation and characterization techniques for HEAs will be discussed in detailed and critical issues needed to be solved will also be proposed. In section “High-throughput computing for HEAs”, we will present and discuss applications of high-throughput computation method in accelerating the development of HEAs. An in-depth discussion about data-driven ML strategy for HEAs will be provided in section “Data-driven machine learning strategies “. Finally, in “Outlook” section, we will give an outlook of potential research activities to be exploited and main scientific challenges to be addressed in the future. The core purpose underlying the brief review is to provide an important opportunity to advance the understanding of MGS employed in HEAs and to offer researchers a platform to foster new ideas.

High-throughput preparation and characterization of HEAs

The design of HEAs poses a significant challenge when exploring the phase structure and desirable properties through the vast potential multicomponent compositional space available¹⁴. As such, unconventional high-throughput preparation techniques are crucially important, particularly for effectively narrowing down the alloys in a wide composition space. Among these, HEAs exploit a variety of preparation techniques, such as, combinatorial thin film deposition, laser additive manufacturing (LAM), rapid alloying prototype, diffusion multiples, and those based on welding. In what follows, we will give an overview of the different high-throughput techniques that were used to prepare multi-component HEAs and point out some critical issues that needed to be resolved.

High-throughput preparation techniques for HEAs

LAM

Combinatorial LAM endows the process with both high heating and cooling rates, and has been used as an efficient method for the synthesis of HEAs. Among various LAM methods, laser metal deposition (LMD) is the preferred technique used to make HEA combinational libraries. During the LMD process, the feedstock nozzles convey the raw material powder to a rapidly moving melt pool formed by a laser through an inert gas flow. Apparently, LMD is more suitable for high-throughput synthesis owing to the advantage of its real-time and variable feeding system, which applies two or more hoppers with different powder feeders to permit changes in the deposited powder compositions^{15,16,17,18,19,20,21}.

Combinatorial laser deposition of compositionally graded complex alloys has been regarded as an attractive approach for assessing the composition–microstructure–property relationships of HEAs. LMD is quite capable of synthesizing refractory HEAs that are difficult to make¹⁹. Melia et al. prepared a MoNbTaW alloy system by additive manufacturing with commercial refractory elemental powders, which have good spherical morphology, leveraging the additive manufacturing process and mechanical testing to enable rapid alloy exploration, as shown in Fig. 1. In the steady state, there was an evident linear spatial trend in the composition and a significantly variation of hardness, with composition dominated by solution strengthening (Fig. 1d)¹⁹. Compared to other mechanical properties (i.e., strength, plasticity, toughness, etc.), hardness is the simplest one that can be obtained effectively by mechanical testing automatically in areas with different compositions of small samples. In view of the hardness–strength relationship (\({H}_{v}\;\approx\;\frac{3{\sigma }_{y}({MPa})}{9.81}\))²², hardness allows for indirect and efficient evaluations of mechanical properties.

**Fig. 1: Analysis of the additive manufacturing processed (MoTaW)_x(Nb)_1−x compositionally graded part cross-section.**

Borkar et al. studied the compositionally graded Al_xCrCuFeNi₂ (0 <x < 1.5) HEAs produced by laser deposition from a blend of elemental powders, using a double powder feeder with two hoppers containing CrCuFeNi₂ and Al₂CrCuFeNi₂ powders, respectively. The sample of a cylindrical geometry was deposited with a smooth change of alloy composition in height¹⁵. Additionally, an identical laser deposition processing method, laser-engineered net sha** (LENS), was also applied to construct the compositional and microstructural libraries of Al_xCoCrFeNi in a high-throughput manner¹⁸. The discrepancy between LENS and the above-mentioned case was that the substrate (CoCrFeNi plate) for LENS was priorly made by an arc-melting and copper mold-casting method, while in Borkar’s work, a blend of powders of a nominal composition of CrCuFeNi₂ was used. During the LENS process, the laser power and moving speed remained unchanged, and the feeding rate of Al powder for each monolayer patch increased in certain increments. The entire deposition process includes the addition of Al and two subsequent remelting processes perpendicular to the deposition direction, to improve the mixing and compositional homogeneity of the alloyed region¹⁸.

In fact, the design and parameter adjustment of the LMD process has an important effect on sample preparation. For example, the substrate greatly influences the composition and microstructure of the deposited alloys, which can be improved by increasing the stack thickness or a reasonable experimental design. The former will not only increase the preparation cost, but will also affect the microstructure uniformity. Selecting the main component of the alloy as the substrate material, depositing the sample in the thickness direction with less affection for the substrate, and a controlled composition gradient could form a reasonable experimental design^17,23.

Combinatorial deposition of thin film materials libraries

Combinatorial thin film synthesis by sputtering using multiple deposition sources is a state-of-the-art route for constructing of materials libraries that are composed of a wide range of gradually changed alloy compositions^24,25. Continuous preparation of multiple gradients can be achieved by adjusting the processing parameters, such as the compositions of the targets, the power and angle of each gun, and the material and rotation of the substrate. For HEAs, several approaches based on sputtering have been employed for alloy design by tweaking these parameters^25,26,27,44. Shukla et al. performed micro-hardness tests along four depth levels of a sample from the top surface, and each indent point was also 0.5 mm apart along the alloying path so that the systemic hardness and moduli were obtained. It is found that an increase in moduli and hardness values can be attributed to solute–matrix interaction. The as-cast ε phase-dominant microstructure showed ∼153 GPa moduli, while the same for a completely γ microstructure with supersaturated Cu content reached up to ∼224 GPa⁵². In measuring nano hardness and modulus of multicomponent samples, the setting route of nanoindentation is usually along the gradient direction of a specific alloying element, and the discrete points and micro-hardness can be analyzed for multiscale observation⁷⁰. According to the hardness–strength relationship, one can efficiently select the potential HEA candidates with the desired mechanical properties, such as higher yield strength in a relatively large composition space. Although there may exist discrepancies between the high-throughput made and bulk samples for the absolute value of mechanical properties, the variation trend of the composition and properties of interest can still provide effective information for the design of new HEAs.

Due to the small scale of the HEAs prepared by the above-mentioned high-throughput methods (i.e., addictive manufacturing, sputtering, etc.), it is usually difficult to cut bulk specimens from the thin layers or coatings. The SPT is an evolving small specimen test technique with the potential to extract the mechanical properties (ductility, elastic modulus, yield strength, ultimate tensile strength, fracture toughness, etc.) from small-volume HEA specimens prepared by high-throughput methods^67,69. It should be noted here that a prerequisite for using this test is to establish correlations between SPT and conventional mechanical tests such as tensile testing for HEAs in priori. However, the SPT response is easily influenced by different test parameters, that is, for specimen shapes and thickness, test speed, ball diameter, and so on. It is therefore imperative to understand the effects of these parameters. This necessitates the optimization of test parameters to obtain nearly unique SPT responses, at least for a class of HEA materials. Thus, it is necessary to relate the conventional and SPT results by empirical and analytical relations.

Additionally, the cooling rates of commonly used addictive manufacturing and sputtering high-throughput methods are much higher than those of traditional casting methods used for the preparation of bulk HEAs. Notably, in some extreme cases, owing in part to the multi-principle nature of HEAs, the HEA coatings or layers via high-throughput methods can form amorphous structures, which make the mechanical properties quite different from the bulk HEAs. Thus, the optimization of preparation parameters to make the cooling rate agree with the casting method is significant for the formation of HEAs. In sum, although there are some discrepancies between thin and bulk HEA materials, the SPT methods can at least determine the mechanical properties and guide the researchers to develop better HEAs in a such large composition space.

Physical properties

As one of the typical physical properties, the magnetic properties of HEAs depend heavily on the size, microstructure, and preparation process of the sample. Many efforts have been made to measure and map magnetic properties at very high spatial resolution. Borkar et al. presented a new combinatorial approach, based on laser additive deposition of compositionally graded alloys, for rapid assessment of the composition–microstructure–magnetic relationships in Al_xCrCuFeNi₂ alloys (0<x < 1.5 at.%) HEAs. Along the same alloy gradient, the microstructures are FCC solid solution, FCC/L12, mixed FCC/L12 + BCC/B2, and finally predominantly BCC/B2 with increasing Al content. Owing to the change of microstructures, the low Al-containing FCC/L12 regions are weakly ferromagnetic, while the BCC/B2 regions with higher Al contents are strongly ferromagnetic, exhibiting lower coercivity and higher saturation magnetization¹⁵. For the FeMnCoCrAl HEA system, Marshal et al. developed thin-film libraries for the combinatorial evaluation of the phase formation and magnetic properties combined with spatially resolved atom probe tomography and DFT simulation. It was found that the addition of Al can promote the formation of BCC structure, which exhibits soft ferromagnetic behavior. A further increase in the non-ferromagnetic Al content beyond 8 wt% decreased the overall saturation magnetization due to the substitution of ferromagnetic species by paramagnetic Al and lattice distortions, which was in agreement with DFT predictions³². As can be seen in these cases, high-throughput techniques are efficient in explaining the microalloying effects on the magnetic properties of HEAs and therefore have great potential for the future designs of soft magnetic HEAs with better performance. However, it should also be noted that the size effect and magnetocrystalline anisotropy caused by thin film may lead to some artifacts, which can be eliminated by increasing the thickness of as-prepared film/layer libraries or changing the measurement direction when performing magnetic testing.

Besides saturation magnetization, studies of high-throughput techniques for other physical properties of HEAs are rather sparse. However, when expanding the scope to other materials synthesized via high-throughput techniques, there are different physical properties of interest. For example, useful combinatorial methods for examining magnetic properties include magnetic force microscopy and scanning magneto-optical Kerr effect imaging⁴⁰. In addition, the Decay microwave probe microscope, with very high micro-region resolution, can measure magnetic properties, including susceptibility and spin resonance. Combined with automatic sample table control and data acquisition, it is possible to realize a high-throughput automatic electromagnetic measurement of the composite material chips^71,89. There are different software developed based on the CALPHAD approach, one of the typical commercial software is Thermo-Calc, which includes high-throughput modules such as TC-Python. Thermo-Calc users run batch calculations for many varied parameters in a high-throughput manner. Many attempts have been made to develop thermodynamic modeling in a variety of different alloy systems using the high-throughput CALPHAD method, including phase diagrams and thermodynamic properties^{90,91,92,93,94,95}. Due to the limitations of the empirical VEC rule in different HEA systems, Zhong et al. recently proposed a data screening procedure to develop new HEAs via a high-throughput CALPHAD approach (as shown in Fig. 7)⁹⁴ and found the relationship between phase formation behavior and VEC. Additionally, Zhang et al.⁹⁰ reported a sufficiently large database of the Al–Co–Cr–Cu–Fe–Ni HEA system to calculate the primary solidification phase. Klaver et al.⁹³ used the Thermo-Calc to determine the phase evolution behavior of AlCrMnMoTi, AlCrMoNbTiV, AlCrMnNbTiV, and AlCrFeTiV alloys at different temperatures and found that AlCrMnNbTiV and AlCrMoNbTiV were better HEA formers. Gurao and Biswas⁹¹ studied 1287 equiatomic quinary alloys using the CALPHAD method to find single-phase FCC and BCC HEAs. According to their calculation results, they achieved the optimized alloy composition just by preparing two FCC alloys and seven BCC alloys, which dramatically increased the efficiency of alloy designing. In particular, CALPHAD can predict the phase diagram under extreme conditions, such as high temperatures and high pressures, which are difficult to explore for experimental studies.

**Fig. 7: The schematic of discovering HEAs with a high-throughput CALPHAD approach.**

As a newly emerging technology, HTC still faces critical challenges. First, most integrated calculation programs currently available are based on first-principles calculations; thus the material data are obtained from a few to dozens of atoms, which requires develo** the HTC further on a larger scale. In this regard, combining ML and first principles to develop high-precision potential functions for MD simulations is a significant trial⁹⁶. Second, the classification of the accumulated materials data is still vague, making it difficult to maintain a materials database in the future. It should be clearly divided according to an authoritative materials classification system. In addition, the data format should be strictly followed in the acquisition process. In terms of an in-depth understanding of HEAs, due to the multi-principal elements contained in HEAs and the metastable state in thermodynamics, there is an urgent need to develop a reliable thermodynamics database that contains a series of composition, temperature, and phase-equilibrium data for HEA systems. In this regard, the related binary and ternary systems should be gathered and assessed by implementing experiments and calculations on HEA systems.

Data-driven ML strategies

The enormous composition space for designing HEAs offers not only opportunities but also great challenges, requiring intelligent and efficient strategies for materials discovery. As a burgeoning branch of materials science, data-driven methods, such as ML, which are used to study a wealth of existing experimental and computational data, have become a very exciting area of research in materials science. ML refers to programs that automatically improve their ability to perform tasks by learning from experience in many scenarios. This automates the time-consuming knowledge acquisition process, which is essential to speed up computing and reduce the cost of develo** data-based systems. With ML, when given enough data and a rule-discovery algorithm, computers can analyze the trends in datasets and further help one to understand the relationships between properties and different parameters, which is beneficial in guiding materials modeling. ML is most useful in situations in which human learning is impossible, such as when data and interactions within the data are too complicated and intractable for human understanding and conceptualization⁹⁷.

Datasets for HEAs

The first and most important step in ML is to generate robust datasets for training the ML model. The selection of suitable data can be deceptive in ML, which is why so much emphasis is placed on the visualization of the datasets⁹⁸. The construction of a dataset is task-oriented; that is, the final prediction plays a decisive role in what type of data should be collected.

The study of ML in HEAs mainly focuses on the formation of single-phase solid solutions (i.e., BCC, FCC, and HCP), while some work has been carried out on mechanical properties such as hardness and modulus. Compared to traditional metallic materials, HEAs are newcomers that have been studied for only nearly two decades. To date, most HEA data have been collected from published experimental work or simulation methods. Miracle and Senkov’s review summarizes a dataset containing 648 entries of HEAs in different systems¹⁴. Based on this dataset, Zhuang et al. constructed a dataset composed of 401 HEAs, which consists of 174 SS phases, 54 intermetallics (IM), and 173 SS + IM phases, by removing some multiple alloys with the same composition⁹⁹. Later, in 2020, Gao et al. built a dataset consisting of 1252 samples—625 single-phase and 627 multi-phase alloys—covering binaries and multi-component systems¹⁰⁰. Besides experimental data, computational methods, such as high-throughput ab initio and DFT-based approaches, are used alternatively to produce phase formation information. Curtarolo et al. developed a high-throughput ab initio method called LTVC (Lederer–Toher–Vecchio–Curtarolo) to predict the transition temperature of multi-component systems⁸⁸. In this way, a dataset containing a total of 1798 unique equiatomic compositions was constructed, consisting of 117 binaries, 441 ternaries, 1110 quaternaries, and 130 quinaries. Based on this dataset, Vecchio et al. built a data-driven workflow for predicting the composition–phase–structure relationship¹⁰¹.

Besides the phase formation data, there are property datasets of HEAs. Using the integrated CALPHAD-ML approach, Sun and Lu et al. predicted the hardness of Ti–Zr–Nb–Ta refractory HEA, which included building a database of 100 quaternary alloys, training the ML model, hardness prediction, and experimental verification¹⁰². A database composed of alloy composition and hardness data for the Ti–Zr–Nb–Ta RHEAs was established by combining CALPHAD. To search for high-entropy ceramics, Vecchio et al. performed an ML framework on 56 previously reported entropy-formation ability values, including nine synthesized compositions, six single phase, and three multi-phases. The high-entropy ceramics in the dataset are mainly composed of eight carbide-forming metal elements (Hf, Nb, Ta, Ti, Mo, V, W, and Zr)¹⁰³. Regarding the modulus, Chen et al. combined first principles and ML to predict the elasticity of severely lattice-distorted HEAs with experimental validation. The ML models were trained on 6826 ordered inorganic compounds from the Materials Project database to predict the Voigt–Reuss–Hill averages of bulk and shear modulus with log-normalization¹⁰⁴. In the case of experimental data for modulus, Roy et al. compiled Young’s modulus consisting of only 87 HEA entries from limited available experimental reports¹⁰⁵. All the above-mentioned datasets are summarized in Table 1.

Table 1 Datasets of phase and mechanical properties for HEAs

Full size table

Despite substantial progress in the construction of datasets for HEAs, the data size improvement is still far from complete. As a result, the results of calculations and predictions based on these databases may deviate significantly from the experimental results. Moreover, when reporting their findings, researchers tend to publish only favorable data, while the bad data points are often dropped. This will lead to the dataset being unbalanced and will affect subsequent ML models’ performance. Therefore, there is an urgent need to develop reliable and robust databases dedicated to HEAs. As such, high-throughput preparation and characterization, as well as HTC, would be a reliable approach to batch production of HEA libraries, including composition and property information.

Phase formation prediction

As a new paradigm for develo** HEAs, the data-to-knowledge ML strategy has the potential to explore complex structures and property space in an efficient way. Additionally, it can also yield valuable insights into the key factors that determine macro-performance and thus guide the design of HEAs with enhanced properties. As mentioned above, ML in the field of HEAs relies on the availability of libraries of compositions, structures, and properties that have been assembled and scrutinized by experimental and computational methods. Considering the different data sizes, phase formation behaviors (i.e., single solid solution formation for HEAs) have attracted much attention from the academic community^{105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120}. In addition, there are increasing studies on the physical or mechanical properties of HEAs. From the perspective of ML, the two cases above correspond to classification and regression issues, respectively. As such, in this section, we will review ML techniques and propose the possibility of further development of ML in HEAs.

Phase formation behavior is crucial to the performance of HEAs. While computer simulations, such as first-principles calculations and MD simulations, have become a commonly used tool for materials discovery, their computation expense limits their application in the accelerated exploration of potential HEAs. The recent implementation of data-driven techniques has provided a possible alternative for efficiently predicting phase formation in HEAs^{109,110,111,112,117,119,121,122}. ML can recognize the inner data pattern and construct a model to make quick predictions for unseen samples. Based on very sparse data, Raabe et al. proposed an active learning framework, which includes three main steps—targeted composition generation, physics-informed screening, and experimental feedback—to accelerate the design of high-entropy Invar alloys in an almost infinite compositional space (see Fig. 8). Compared with the conventional design approach, which requires years and many experiments, this ML workflow requires only a few months to develop HEAs with desirable properties¹²¹. Wu et al. used ML to successfully predict eutectic HEAs with excellent mechanical properties in the Al–Co–Cr–Fe–Ni HEA system, and analyzed the key elements for forming eutectic HEAs¹¹⁷. Islam et al. established a neural network model to predict the formation of the HEA phase. Cross-validation revealed a predictive accuracy of 83% on this limited data set¹⁰⁹. Amitava et al. used more algorithms to establish multiple prediction models and forecast the different structures of the solid solution (FCC, BCC). The prediction accuracy is over 90%, which is attributed to the fact that the random forest model has overwhelming advantages in dealing with small datasets compared to the artificial neural network algorithm¹¹¹. Thus, understanding and applying multiple ML algorithms is necessary for the prediction of HEA phase formation. Moreover, to solve the data shortage problem of HEAs, Lee utilized a conditional generative adversarial network to find a model distribution that emulates the distribution of known HEAs, then augmented realistic samples based on feature representation, and finally realized the expansion of the original dataset¹¹⁹. The results show that the accuracy of the model is significantly improved due to data augmentation.

**Fig. 8: Schematic flow chart of the active learning framework.**

Compared with the original ML modeling method, using feature engineering to construct a new descriptor can effectively determine the structure–performance relationship¹²³. Material descriptors and models determine the robustness of the ML prediction. Pei et al. carried out the ML modeling analysis of many parameters and the link between the phases, and identified the physical parameters that are crucial to the formation of solid solutions¹⁰⁰, such as volume modulus, melting temperature, etc. Dai et al. used feature engineering and the ML strategy to extend the descriptor dimension from a low dimension originally to a high dimension¹¹⁴. Due to the uniqueness of different algorithm constructions, the best performance model depends on the effective combination of datasets, descriptors, and algorithms. In this regard, Zhang et al. proposed a systematic framework that utilized a genetic algorithm (GA) to efficiently select the ML model and materials descriptors from a huge number of alternatives and demonstrated its efficiency on two-phase formation problems in HEAs¹¹⁴. Generally, the prediction accuracy of the model can be improved through hyperparameter optimization, such as increasing the number of hidden layers and neurons in the neural network¹⁰⁷. Overfitting and underfitting are the common problems that any ML may encounter¹¹³, and there is no exception in the study of predicting HEA phases by ML. Huang et al. found the overfitting phenomenon using ML phase projection. By adjusting the super parameters involved in the training process, training accuracy can always be improved to a higher level⁹⁹. Wen et al. proposed ML models to predict the solid solution strength/hardness of HEAs¹²³. Figure 9 shows the prediction error for the hardness of HEAs by five-fold cross-validation with possible combinations of different features (ξ, δXr, and ε, etc.). All ML models, including random forests (RF), support vector regression (SVR), kernel ridge regression (KRR), Gaussian process (GP), extreme gradient boosting (XGB), and Bayesian regularized neural networks (BRNN), show a basin-like tendency, indicating that too many or too few features will reduce the accuracy. According to “Occam’s razor” principle, simplicity, and interpretability with a minimum number of features are necessary for adequate accuracy. Using more features complicates the interpretation of the model and risks overlearning.

**Fig. 9: Feature selection based on combinations of features from different ML algorithms.**

In the absence of unified evaluation criteria, excessive optimism is often reported¹¹⁶ as a result of overfitting and the use of inappropriate training and test data. It is necessary to propose new standard criteria that can be used to evaluate the true accuracy and performance of ML models. An emphasis on experimental validation and repeatability through code archiving also helps overcome this challenge. The regularization method can be incorporated into the ML model to improve the generalizability of the model¹¹⁹. The hyperparameters of the model can also be optimized by the Bayesian optimization method to obtain good generalizability under the condition of high accuracy. In addition, constructing new rules with strong interpretability and universality through ML is desirable, which can be explored using conformable regression. Therefore, combining experimental results with theoretical guidance to analyze specific target characteristics is imperative to screen new HEAs with good performance¹¹⁵.

Prediction of mechanical properties

As a new kind of structural material that can serve under extreme environments, HEAs exhibit unique mechanical properties, such as high strength and hardness, and low moduli. These properties are generally used as selection parameters in the search for new alloys. This raises the question of whether ML algorithms can be readily used to the search for candidate alloys with better mechanical properties in such a large composition space.

As one of the most typical mechanical properties of HEAs, hardness has strong correlations with other properties, which requires an in-depth understanding. For instance, based on a reliable hardness–strength relationship, complex mechanical tests can be replaced to some extent by efficient and inexpensive hardness tests for a fast and comprehensive assessment of mechanical properties. Hence, develo** data-driven methods, in addition to experimental methods, is essential to effectively calculate, predict, and evaluate the hardness of HEAs. In this regard, several studies have attempted to explore the possibility of ML as an aid in hardness assessment. For example, using the integrated CALPHAD-ML approach, Sun and Lu et al. predicted the hardness of Ti–Zr–Nb–Ta refractory HEAs, which included building a database of 100 quaternary alloys, training the ML model, hardness prediction, and experimental verification, as shown in Fig. 10¹⁰². Menou et al. used a multi-objective optimization GA, together with solid solution hardening and thermodynamic modeling (CALPHAD), to design HEAs with high hardness¹²⁴. Combining the radial basis function neural network algorithm and first-principles calculations, Zhu et al. found the key role of Al and its significant influence on hardness in modeling the Al–Cr–Fe–Ni system¹²⁵. In a similar Al–Co–Cr–Cu–Fe–Ni system, Su et al. formulated a property-orientated materials design strategy combining ML, design of experiment, and feedback from experiment to search for HEAs with high hardness¹²⁶. On this basis, they further proposed ML models, including feature engineering and physical models, to provide insights for predicting the hardness of these HEAs.

**Fig. 10: Hardness distributions as functions of the Ta content.**

In recent years, there have been several studies on the moduli of HEAs. Recent developments in the field of HEAs have sparked interest in using ML to predict moduli. Balasubramanian et al. implemented gradient boost algorithms to predict Young’s modulus (\(E\)) as well as the phase structure of low-, medium-, and HEAs composed of refractory elements. The ML result was in good agreement with the experiments and revealed that the melting temperature and the enthalpy of mixing are the key features determining the \(E\) of refractory HEAs¹⁰⁵. Fewer studies have evaluated the role of ML in the plasticity or strength of HEAs compared to other mechanical properties (e.g., hardness and modulus). A principal reason is that the plasticity and strength data are very sensitive to the preparation process and sample sizes, leading to the poor quality of the original input dataset. Despite the obstacles, some attempts have been made to investigate the possibility of an ML framework for predicting the plasticity and strength of disordered alloys. Recently, Liu et al. constructed a data set through high-throughput preparation of solid solutions using powder metallurgy with Zr–Ti–Nb–O alloys as target materials¹²⁷. Their study provides an enlightening idea for enhancing the plasticity of HEAs by tailoring key features via tuning the element content.

ML force fields

MD simulations are normally conducted with classic interatomic potentials. As these potentials often scale linearly with the number of atoms, they are computationally inexpensive, and the loss in accuracy is ignored to facilitate longer simulations or simulations with large-scale systems that include hundreds of thousands of atoms. However, the construction of force fields and tight-binding parameters is not straightforward. Given this, ML methods can provide a useful option for creating a reliable potential energy representation. Machine learning potentials (MLPs) are mathematical representations of the multidimensional potential-energy surface as a function of atomic positions. Unlike traditional potentials, reference databases of MLPs are usually generated by DFT calculations without experimental information. The other two ingredients required for MLPs are local structural descriptors, such as atom-centered symmetry function descriptors¹²⁸, the smooth overlap of atomic positions¹²⁹, and spectral neighbor analysis potential descriptors^130,131,132 etc., representing atomic configurations and supervised learning models to obtain reliable relations between structure and energy, force, or stress tensor^133,134,135.

MLPs have greatly promoted the studies of structure, thermodynamics, and mechanical properties of HEAs. Short-range ordering (SRO) refers to local chemical/structural ordering, which is a common structural feature in HEAs. It arises from the chemical interactions of constituent elements and significantly affects structural stability, and magnetic and mechanical properties^136,137,138. Meshkov et al. used a low-rank potential in combination with MC simulations to investigate chemical SRO in the equiatomic fcc CoCrFeNi HEA, and demonstrated that Fe and Cr form sublattices¹³⁹. Similar schemes were also employed to study the phase stability, phase transitions, and chemical SRO of the bcc NbMoTaW HEA by Kostiuchenko et al.¹⁴⁰ They claimed that if local lattice distortions are introduced, the single phase stabilizes instead of separating into sublattices until it drops to room temperature. Later on, a new algorithm combining the thermodynamic integration method with moment tensor potentials was developed by Grabowski et al. to study the anharmonic free energy of a five-component VNbMoTaW refractory HEA, which achieved DFT-level accuracy¹⁴¹. DeepMD was also applied to molten TiZrHfNb using ab initio molecular dynamics (AIMD) trajectories¹⁴². Structural analyses of a VZrNbHfTa melt via partial RDFs and SRO parameters were exploited using high-dimensional neural network potential, indicating that vanadium atoms are repulsed by other types of atoms¹⁴³. Another NbMoTaW potential, adopting the SNAP model, was applied to study the complex strengthening mechanisms by modeling Nb segregations to the grain boundaries. Applying the SNAP model, polycrystalline models with and without Monte Carlo/MD simulations were obtained, as shown in Fig. 11a–b¹⁴⁴. Byggmästar et al. developed a set of Gaussian approximation potentials that were used to study segregation and radiation damage of the bcc refractory VNbMoTaW HEA^145,146. The potentials show good accuracy and transferability in terms of elasticity, thermal stability, liquid and defect structure, and surface properties¹⁴⁵. Figure 11c, d shows that the final defect structure of irradiated VNbMoTaW contains only smaller dislocation loops with respect to the pure W. In conclusion, the reduction of interstitial migration, the immovable dislocation loops, and the increase of vacancy mobility together promote the recombination of defects rather than clustering in HEAs¹⁴⁶. In addition, there are some MLPs for medium entropy alloys^147,148,149 and high entropy ceramics^4,5,6. For example, Pak et al. used Canonical Monte Carlo simulations with the ML interatomic potentials to determine the temperature conditions for the formation of single-phase and multi-phase high-entropy ceramics and claimed that for TiZrNbHfTaC₅ produced with electric arc discharge, the single-phase formation temperature was as high as 2000 K⁶.

**Fig. 11: Polycrystalline models obtained via simulation method.**

In general, interatomic potentials based on ML help to address the longstanding dilemma between efficiency and accuracy in MD simulations, but there are still some challenges in this field. First, the completeness of databases organized for the potentials of multicomponent chemically disordered systems is complicated and non-standardized, which is further exacerbated by short- or medium-range orders. Additionally, it is difficult to apply MLPs out of databases due to better flexibility but less extrapolation. Another concern is that MLPs are not based on physical information¹⁵⁰. While active learning approaches¹⁵¹ and physically informed MLPs¹⁵² may be the solutions, further development is still needed.

Outlook

This paper presents a concise review covering several aspects of this rapidly growing field over the past two decades, from high-throughput experiments and computations to the data-driven ML of HEAs. To inspire and spur new ideas, we present some perspectives and possible research directions in HEAs.

High-throughput characterization techniques and high-quality data acquisition for HEAs

To keep pace with continuous advancements in high-throughput material preparation methods, it is crucial to develop high-throughput characterization techniques that offer high resolution, efficiency, and affordability. From a microdomain or in situ measurement perspective, synchrotron X-ray techniques possess exceptional capabilities for high-throughput characterization of a vast array of material samples due to their remarkable brightness, and high temporal and spatial resolution, thereby alleviating the flux bottleneck in high-throughput experiments. In addition, subsequent data crafting with high quality remains an ongoing challenge. Manually extracting data with expert knowledge is a time-consuming task for thousands of articles. Thus, it is increasingly necessary to develop methods for automated data extraction that are both rapid and accurate. Techniques such as web-crawler, natural language processing, or pattern recognition could potentially facilitate the automatic extraction of information from articles or patterns such as SEM, EBSD synchrotron XRD, and others.

Metastable state of HEAs

Due to the multi-principal elements contained in HEAs and the metastable state, there is an urgent need to understand the nonequilibrium thermodynamics of HEAs from both experimental and calculation perspectives. The cooling rates of some high-throughput methods are much higher than those of traditional casting methods used for the preparation of bulk HEAs. In some extreme cases, owing in part to the multi-principal nature of HEAs, the combinational materials libraries made using high-throughput methods can form amorphous structures, which make the properties quite different from bulk HEAs. In terms of high-throughput CALPHAD, to develop a reliable thermodynamic database for HEA systems, the related binary and ternary systems should be gathered and assessed by implementing experiments and calculations.

Analysis of SRO in HEAs

To understand comprehensively the correlations between SRO and properties, and to facilitate the development of innovative alloys, it is imperative to scientifically describe and quantitatively characterize SRO in these compositionally complex alloys. However, the multi-principal element nature of HEAs poses significant challenges for direct experimental observation and accurate description of the SRO. Detailed chemical ordering information can be obtained by combining ML techniques with AIMD simulations or reverse Monte Carlo refinement methods.

Evaluation criteria and interpretability of ML methods for HEAs

In the absence of unified evaluation criteria, excessive optimism is frequently observed, resulting from overfitting and the use of unsuitable training and test data. It is essential to propose new standardized criteria to properly assess the true accuracy and performance of ML models. Prioritizing experimental validation and repeatability through code archiving can also help mitigate this issue. Additionally, the interpretability of ML models remains limited and necessitates bridging existing gaps. There is a need to develop new rules with robust interpretability and universality through ML exploration using appropriate algorithms. Techniques such as partial dependence plots, individual conditional expectation, permutation feature importance, global surrogate, local surrogate (LIME), and SHAP (SHapley Additive exPlanations) exhibit varying technical characteristics that enhance interpretability.

In summary, the future studies of high-throughput experiments, computations, and data-driven ML in HEAs will focus on a comprehensive workflow design, incorporating rational experimental design, automated high-throughput synthesis, fundamental principles of high-throughput materials characterization, computational modeling, and data mining techniques. This multidisciplinary approach will offer a robust framework for the rational design and discovery of materials.

References

Zhang, Y. et al. Microstructures and properties of high-entropy alloys. Prog. Mater. Sci. 61, 1–93 (2014). A comprehensive revier of high entropy alloys.
Article Google Scholar
George, E. P., Raabe, D. & Ritchie, R. O. High-entropy alloys. Nat. Rev. Mater. 4, 515–534 (2019).
Article CAS Google Scholar
Cantor, B. Multicomponent high-entropy Cantor alloys. Prog. Mater. Sci. 120, 100754 (2021).
Article CAS Google Scholar
Dai, F.-Z. et al. Theoretical prediction on thermal and mechanical properties of high entropy (Zr_0.2Hf_0.2Ti_0.2Nb_0.2Ta_0.2)C by deep learning potential. J. Mater. Sci. Technol. 43, 168–174 (2020).
Article CAS Google Scholar
Dai, F.-Z. et al. Temperature dependent thermal and elastic properties of high entropy (Ti_0.2Zr_0.2Hf_0.2Nb_0.2Ta_0.2)B₂: molecular dynamics simulation by deep learning potential. J. Mater. Sci. Technol. 72, 8–15 (2021).
Article CAS Google Scholar
Pak, A. Y. et al. Machine learning-driven synthesis of TiZrNbHfTaC₅ high-entropy carbide. npj Comput. Mater. 9, 7 (2023).
Article CAS Google Scholar
Li, H. et al. Fe-based bulk metallic glasses: glass formation, fabrication, properties and applications. Prog. Mater. Sci. https://doi.org/10.1016/j.pmatsci.2019.01.003 (2019).
Ye, Y. et al. High-entropy alloy: challenges and prospects. Mater. Today 19, 349–362 (2016).
Article CAS Google Scholar
Jain, A. Commentary: the materials project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Ramprasad, R. et al. Machine learning in materials informatics: recent applications and prospects. npj Comput. Mater. 3, 1–13 (2017).
Article Google Scholar
Liu, X. et al. Machine learning-based glass formation prediction in multicomponent alloys. Acta Mater. 201, 182–190 (2020).
Article CAS Google Scholar
Liu, Y. et al. Machine learning in materials genome initiative: a review. J. Mater. Sci. Technol. 57, 113–122 (2020).
Article Google Scholar
de Pablo, J. J. et al. New frontiers for the materials genome initiative. npj Comput. Mater. 5, 1–23 (2019).
Article Google Scholar
Miracle, D. B. & Senkov, O. N. A critical review of high entropy alloys and related concepts. Acta Mater. 122, 448–511 (2017).
Article CAS Google Scholar
Borkar, T. et al. A combinatorial assessment of Al_xCrCuFeNi₂ (0 < x < 1.5) complex concentrated alloys: microstructure, microhardness, and magnetic properties. Acta Mater. 116, 63–76 (2016). This article discusses a novel combinatorial approach for assessing composition–microstructure–microhardness–magnetic property relationships of laser deposited compositionally graded Al_xCrCuFeNi₂ (0 < x < 1.5) complex concentrated alloys.
Article CAS Google Scholar
Knoll, H. et al. Combinatorial alloy design by laser additive manufacturing. Steel Res. Int. 88, 1600416 (2017).
Article Google Scholar
Li, M. et al. Evaluation of microstructure and mechanical property variations Al_xCoCrFeNi high entropy alloys produced by a high-throughput laser deposition method. Intermetallics 95, 110–118 (2018).
Article CAS Google Scholar
Li, M. & Flores, K. M. Laser processing as a high-throughput method to investigate microstructure–processing–property relationships in multiprincipal element alloys. J. Alloys Compd. 825, 154025 (2020).
Article CAS Google Scholar
Melia, M. A. et al. High-throughput additive manufacturing and characterization of refractory high entropy alloys. Appl. Mater. Today 19, 100560 (2020).
Article Google Scholar
Moorehead, M. et al. High-throughput synthesis of Mo–Nb–Ta–W high-entropy alloys via additive manufacturing. Mater. Des. 187, 108358 (2020).
Article CAS Google Scholar
Pegues, J. W. et al. Exploring additive manufacturing as a high-throughput screening tool for multiphase high entropy alloys. Addit. Manuf. 37, 101598 (2021).
CAS Google Scholar
Huang, X. et al. Machine learning assisted modelling and design of solid solution hardened high entropy alloys. Mater. Des. 211, 110177 (2021).
Article CAS Google Scholar
Tsai, P. & Flores, K. M. High-throughput discovery and characterization of multicomponent bulk metallic glass alloys. Acta Mater. 120, 426–434 (2016).
Article CAS Google Scholar
Kelly, P. J. & Arnell, R. D. Magnetron sputtering: a review of recent developments and applications. Vacumm 56, 159–172 (2000).
Article CAS Google Scholar
Ding, S. et al. Combinatorial development of bulk metallic glasses. Nat. Mater. 13, 494–500 (2014).
Article CAS PubMed Google Scholar
Liu, Y. et al. Combinatorial development of antibacterial Zr–Cu–Al–Ag thin film metallic glasses. Sci. Rep. 6, 1–8 (2016).
Google Scholar
Kauffmann, A. et al. Combinatorial exploration of the high entropy alloy system Co–Cr–Fe–Mn–Ni. Surf. Coat. Technol. 325, 174–180 (2017).
Article CAS Google Scholar
**ng, Q. et al. High-throughput screening solar-thermal conversion films in a pseudobinary (Cr, Fe, V)–(Ta, W) system. ACS Comb. Sci. 20, 602–610 (2018).
Article CAS PubMed Google Scholar
Zhang, Y. et al. Compositional gradient films constructed by sputtering in a multicomponent Ti–Al–(Cr, Fe, Ni) system. J. Mater. Res. 33, 3330–3338 (2018).
Article CAS Google Scholar
Li, M.-X. et al. High-temperature bulk metallic glasses developed by combinatorial methods. Nature 569, 99–103 (2019).
Article CAS PubMed Google Scholar
Banko, L. et al. Unravelling composition–activity–stability trends in high entropy alloy electrocatalysts by using a data‐guided combinatorial synthesis strategy and computational modeling. Adv. Energy Mater. 12, 2103312 (2022). A strategy for effective extensions of high-dimensional composition spaces for the exemplary Ru–Rh–Pd–Ir–Pt system covered by combinatorial synthesis was demonstrated.
Article CAS Google Scholar
Marshal, A. et al. Combinatorial evaluation of phase formation and magnetic properties of FeMnCoCrAl high entropy alloy thin film library. Sci. Rep. 9, 1–11 (2019).
Article CAS Google Scholar
Ren, F. et al. Accelerated discovery of metallic glasses through iteration of machine learning and high-throughput experiments. Sci. Adv. 4, eaaq1566 (2018).
Article PubMed PubMed Central Google Scholar
Kube, S. A. et al. Phase selection motifs in high entropy alloys revealed through combinatorial methods: large atomic size difference favors BCC over FCC. Acta Mater. 166, 677–686 (2019).
Article CAS Google Scholar
Datye, A. et al. Accelerated discovery and mechanical property characterization of bioresorbable amorphous alloys in the Mg–Zn–Ca and the Fe–Mg–Zn systems using high-throughput methods. J. Mater. Chem. B 7, 5392–5400 (2019).
Article CAS PubMed Google Scholar
Ding, S. et al. Solidification of Au–Cu–Si alloys investigated by a combinatorial approach. J. Appl. Phys. 111, 114901 (2012).
Article Google Scholar
Zhao, J.-C., Jackson, M. & Peluso, L. Determination of the Nb–Cr–Si phase diagram using diffusion multiples. Acta Mater. 51, 6395–6405 (2003).
Article CAS Google Scholar
Zhao, J.-C. et al. A diffusion multiple approach for the accelerated design of structural materials. MRS Bull. 27, 324–329 (2002).
Article CAS Google Scholar
Zhao, J.-C., Zheng, X. & Cahill, D. G. High-throughput diffusion multiples. Mater. Today 8, 28–37 (2005).
Article CAS Google Scholar
Zhao, J.-C. Combinatorial approaches as effective tools in the study of phase diagrams and composition–structure–property relationships. Prog. Mater. Sci. 51, 557–631 (2006).
Article Google Scholar
Zhao, J.-C., Zheng, X. & Cahill, D. G. High-throughput measurements of materials properties. JOM 63, 40–44 (2011).
Article Google Scholar
Wilson, P., Field, R. & Kaufman, M. The use of diffusion multiples to examine the compositional dependence of phase stability and hardness of the Co–Cr–Fe–Mn–Ni high entropy alloy system. Intermetallics 75, 15–24 (2016).
Article CAS Google Scholar
Chen, W. & Zhang, L. High-throughput determination of interdiffusion coefficients for Co–Cr–Fe–Mn–Ni high-entropy alloys. J. Phase Equilib. Diffus. 38, 457–465 (2017).
Article CAS Google Scholar
Coury, F. G. et al. High-throughput solid solution strengthening characterization in high entropy alloys. Acta Mater. 167, 1–11 (2019).
Article CAS Google Scholar
Ding, W. et al. Diffusion bonding of copper to titanium using CoCrFeMnNi high-entropy alloy interlayer. Intermetallics 129, 107027 (2021).
Article CAS Google Scholar
Tsai, K.-Y., Tsai, M.-H. & Yeh, J.-W. Sluggish diffusion in Co–Cr–Fe–Mn–Ni high-entropy alloys. Acta Mater. 61, 4887–4897 (2013).
Article CAS Google Scholar
Kucza, W. et al. Studies of “sluggish diffusion” effect in Co–Cr–Fe–Mn–Ni, Co–Cr–Fe–Ni and Co–Fe–Mn–Ni high entropy alloys; determination of tracer diffusivities by combinatorial approach. J. Alloys Compd. 731, 920–928 (2018).
Article CAS Google Scholar
Wang, T. et al. Effect of reactive alloy elements on friction stir welded butt joints of metallurgically immiscible magnesium alloys and steel. J. Manuf. Processes 39, 138–145 (2019).
Article Google Scholar
Wang, T. et al. Towards heterogeneous Al_xCoCrFeNi high entropy alloy via friction stir processing. Mater. Lett. 236, 472–475 (2019).
Article CAS Google Scholar
Sinha, S. et al. Immiscible nanostructured copper–aluminum–niobium alloy with excellent precipitation strengthening upon friction stir processing and aging. Scr. Mater. 164, 42–47 (2019).
Article CAS Google Scholar
Agrawal, P. et al. Friction stir gradient alloying: a high-throughput method to explore the influence of V in enabling HCP to BCC transformation in a γ-FCC dominated high entropy alloy. Appl. Mater. Today 21, 100853 (2020).
Article Google Scholar
Shukla, S. et al. Friction stir gradient alloying: a novel solid-state high throughput screening technique for high entropy alloys. Mater. Today Commun. 23, 100869 (2020).
Article CAS Google Scholar
Tong, L. & Reddy, R. G. Synthesis of titanium carbide nano-powders by thermal plasma. Scr. Mater. 52, 1253–1258 (2005).
Article CAS Google Scholar
Zhu, B. et al. Fast and high‐throughput synthesis of medium‐and high‐entropy alloys using radio frequency inductively coupled plasma. Adv. Eng. Mater. 23, 2001116 (2021).
Article CAS Google Scholar
Shi, Y. et al. High-throughput synthesis and corrosion behavior of sputter-deposited nanocrystalline Al_x(CoCrFeNi) 100 − x combinatorial high-entropy alloys. Mater. Des. 195, 109018 (2020).
Article CAS Google Scholar
Haase, C. et al. Combining thermodynamic modeling and 3D printing of elemental powder blends for high-throughput investigation of high-entropy alloys—towards rapid alloy screening and design. Mater. Sci. Eng. A 688, 180–189 (2017).
Article CAS Google Scholar
Kaufmann, K. et al. Crystal symmetry determination in electron diffraction using machine learning. Science 367, 564–568 (2020).
Article CAS PubMed Google Scholar
Kaufmann, K. et al. Efficient few-shot machine learning for classification of EBSD patterns. Sci. Rep. 11, 8172 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tsutsui, K. et al. Microstructural diagram for steel based on crystallography with machine learning. Comput. Mater. Sci. 159, 403–411 (2019).
Article CAS Google Scholar
Yoo, Y. K. et al. Identification of amorphous phases in the Fe–Ni–Co ternary alloy system using continuous phase diagram material chips. Intermetallics 14, 241–247 (2006).
Article CAS Google Scholar
Hui, J. et al. High-throughput investigation of crystal-to-glass transformation of Ti–Ni–Cu ternary alloy. Sci. Rep. 9, 1–8 (2019).
Article CAS Google Scholar
Joress, H. et al. A high-throughput structural and electrochemical study of metallic glass formation in Ni–Ti–Al. ACS Comb. Sci. 22, 330–338 (2020).
Article CAS PubMed PubMed Central Google Scholar
Haque, M. & Saif, M. A review of MEMS-based microscale and tensile and bending testing. Exp. Mech. 43, 248–255 (2003).
Uchic, M. D. et al. Sample dimensions influence strength and crystal plasticity. Science 305, 986–989 (2004).
Article CAS PubMed Google Scholar
McCluskey, P. J. et al. Precipitation and thermal fatigue in Ni–Ti–Zr shape memory alloy thin films by combinatorial nanocalorimetry. Acta Mater. 59, 5116–5124 (2011).
Article CAS Google Scholar
Kim, H.-J. et al. High-throughput analysis of thin-film stresses using arrays of micromachined cantilever beams. Rev. Sci. Instrum. 79, 045112 (2008).
Article PubMed Google Scholar
Figiel, H., Zogał, O. & Yartys, V. Effect of iron content on the microstructure evolution, mechanical properties and wear resistance of FeXCoCrNi high-entropy alloy system produced via MA-SPS Parisa. J. Alloys Compd. 404, 1 (2005).
Article Google Scholar
Arunkumar, S. Overview of small punch test. Met. Mater. Int. 26, 719–738 (2020).
Article Google Scholar
Cai, Y. et al. Fracture and wear mechanisms of FeMnCrNiCo + x(TiC) composite high-entropy alloy cladding layers. Appl. Surf. Sci. 543, 148794 (2021).
Article CAS Google Scholar
Marshal, A. et al. Combinatorial synthesis of high entropy alloys: introduction of a novel, single phase, body-centered-cubic FeMnCoCrAl solid solution. J. Alloys Compd. 691, 683–689 (2017).
Article CAS Google Scholar
Wei, T. et al. Scanning tip microwave near‐field microscope. Appl. Phys. Lett. 68, 3506–3508 (1996).
Article CAS Google Scholar
Gao, C., Duewer, F. & **ang, X.-D. Quantitative microwave evanescent microscopy. Appl. Phys. Lett. 75, 3005–3007 (1999).
Article CAS Google Scholar
Turchinskaya, M. et al. Rapid constructing magnetic phase diagrams by magneto-optical imaging of composition spread films. J. Mater. Res. 19, 2546–2548 (2004).
Article CAS Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117, 1–19 (1995).
Article CAS Google Scholar
Kresse, G. & Furthmüller, J. Efficiency of ab initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 6, 15–50 (1996).
Article CAS Google Scholar
Raicu, I. (ed) Many-task Computing: Bridging the Gap Between High-Throughput Computing and High-performance Computing (University of Chicago, 2009).
Ong, S. P. et al. Python materials genomics (pymatgen): a robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013).
Article CAS Google Scholar
Jain, A. et al. FireWorks: a dynamic workflow system designed for high‐throughput applications. Concurrency Comput. Pract. Exper. 27, 5037–5059 (2015).
Article Google Scholar
Mathew, K. et al. Atomate: a high-level interface to generate, execute, and analyze computational materials science workflows. Comput. Mater. Sci. 139, 140–152 (2017).
Article Google Scholar
Wang, G. et al. ALKEMIE: an intelligent computational platform for accelerating materials discovery and design. Comput. Mater. Sci. 186, 110064 (2021).
Article CAS Google Scholar
Yang, X. et al. MatCloud: a high-throughput computational infrastructure for integrated management of materials simulation, data and resources. Comput. Mater. Sci. 146, 319–333 (2018).
Article CAS Google Scholar
Kirklin, S. et al. The open quantum materials database (OQMD): assessing the accuracy of DFT formation energies. npj Comput. Mater. 1, 1–15 (2015).
Article Google Scholar
Soven, P. Coherent-potential model of substitutional disordered alloys. Phys. Rev. 156, 809 (1967).
Article CAS Google Scholar
Tian, F. A review of solid-solution models of high-entropy alloys based on ab initio calculations. Front. Mater. 4, 36 (2017).
Article Google Scholar
Aitken, Z. H., Sorkin, V. & Zhang, Y.-W. Atomistic modeling of nanoscale plasticity in high-entropy alloys. J. Mater. Res. 34, 1509–1532 (2019).
Article CAS Google Scholar
Santodonato, L. J. et al. Predictive multiphase evolution in Al-containing high-entropy alloys. Nat. Commun. 9, 4520 (2018).
Article CAS PubMed PubMed Central Google Scholar
C, S. C. A. B. et al. AFLOWLIB.ORG: a distributed materials properties repository from high-throughput ab initio calculations. Comput. Mater. Sci. 58, 227–235 (2012).
Article Google Scholar
Lederer, Y. et al. The search for high entropy alloys: a high-throughput ab initio approach. Acta Mater. 159, 364–383 (2018).
Article CAS Google Scholar
Kaufman, L. & Bernstein, H. (eds) Computer Calculation of Phase Diagrams. With Special Reference to Refractory Metals (Academic Press, 1970).
Zhang, C. et al. Computational thermodynamics aided high-entropy alloy design. JOM 64, 839–845 (2012).
Article CAS Google Scholar
Gurao, N. & Biswas, K. In the quest of single phase multi-component multiprincipal high entropy alloys. J. Alloys Compd. 697, 434–442 (2017).
Article Google Scholar
Chen, H.-L., Mao, H. & Chen, Q. Database development and Calphad calculations for high entropy alloys: challenges, strategies, and tips. Mater. Chem. Phys. 210, 279–290 (2018).
Article CAS Google Scholar
Klaver, T., Simonovic, D. & Sluiter, M. H. Brute force composition scanning with a CALPHAD database to find low temperature body centered cubic high entropy alloys. Entropy 20, 911 (2018).
Article CAS PubMed PubMed Central Google Scholar
Yang, S. et al. Revisit the VEC rule in high entropy alloys (HEAs) with high-throughput CALPHAD approach and its applications for material design—a case study with Al–Co–Cr–Fe–Ni system. Acta Mater. 192, 11–19 (2020).
Article CAS Google Scholar
Feng, R. et al. High-throughput design of high-performance lightweight high-entropy alloys. Nat. Commun. 12, 4329 (2021).
Article CAS PubMed PubMed Central Google Scholar
Cubuk, E. D. et al. Identifying structural flow defects in disordered solids using machine-learning methods. Phys. Rev. Lett. 114, 108001 (2015).
Article CAS PubMed Google Scholar
Wang, A. Y.-T. et al. Machine learning for materials scientists: an introductory guide toward best practices. J. Chem. Mater. 32, 4954–4965 (2020).
Article CAS Google Scholar
Jablonka, K. M. et al. Big-data science in porous materials: materials genomics and machine learning. Chem. Rev. 120, 8066–8129 (2020).
Article CAS PubMed PubMed Central Google Scholar
Huang, W., Martin, P. & Zhuang, H. Machine-learning phase prediction of high-entropy alloys. Acta Mater. 169, 225–236 (2019).
Article CAS Google Scholar
Pei, Z. et al. Machine-learning informed prediction of high-entropy solid solution formation: beyond the Hume–Rothery rules. npj Comput. Mater. 6, 1–8 (2020).
Article Google Scholar
Kaufmann, K. & Vecchio, K. S. Searching for high entropy alloys: a machine learning approach. Acta Mater. 198, 178–222 (2020). A novel high-throughput approach called “ML-HEA” was proposed to predict the solid solution forming ability by coupling thermodynamic and chemical features with a random forest machine learning model.
Article CAS Google Scholar
Sun, Y. et al. Prediction of Ti–Zr–Nb–Ta high-entropy alloys with desirable hardness by combining machine learning and experimental data. Appl. Phys. Lett. 119, 201905 (2021). This work combines a machine learning (ML) model with phase diagram calculations (CALPHAD) to design Ti–Zr–Nb–Ta refractory HEAs with a desirable hardness.
Article CAS Google Scholar
Kaufmann, K. et al. Discovery of high-entropy ceramics via machine learning. npj Comput. Mater. 6, 9 (2020).
Article Google Scholar
Kim, G. et al. First-principles and machine learning predictions of elasticity in severely lattice-distorted high-entropy alloys with experimental validation. Acta Mater. 181, 124–138 (2019).
Article CAS Google Scholar
Roy, A. et al. Machine learned feature identification for predicting phase and Young’s modulus of low-, medium-and high-entropy alloys. Scr. Mater. 185, 152–158 (2020).
Article CAS Google Scholar
Guo, S. et al. Effect of valence electron concentration on stability of fcc or bcc phase in high entropy alloys. J. Appl. Phys. 109, 103505 (2011).
Article Google Scholar
Crisci, C., Ghattas, B. & Perera, G. A review of supervised machine learning algorithms and their applications to ecological data. Ecol. Modell. 240, 113–122 (2012).
Article Google Scholar
Najafabadi, M. M. et al. Deep learning applications and challenges in big data analytics. J. Big Data 2, 1–21 (2015).
Article Google Scholar
Islam, N., Huang, W. & Zhuang, H. Machine learning for phase selection in multi-principal element alloys. Comput. Mater. Sci. 150, 230–235 (2018).
Article CAS Google Scholar
Jha, R. et al. Combined machine learning and CALPHAD approach for discovering processing-structure relationships in soft magnetic alloys. Comput. Mater. Sci. 150, 202–211 (2018).
Article CAS Google Scholar
Choudhury, A. et al. Structure prediction of multi-principal element alloys using ensemble learning. Eng. Comput. 37, 1003–1022 (2020).
Zhou, Z. et al. Machine learning guided appraisal and exploration of phase design for high entropy alloys. npj Comput. Mater. 5, 1–9 (2019).
Article CAS Google Scholar
Bu, C. & Zhang, Z. Research on overfitting problem and correction in machine learning. J. Phys. Conf. Ser. https://doi.org/10.1088/1742-6596/1693/1/012100 (2020).
Dai, D. et al. Using machine learning and feature engineering to characterize limited material datasets of high-entropy alloys. Comput. Mater. Sci. 175, 109618 (2020).
Article CAS Google Scholar
Li, R. et al. High-throughput calculations for high-entropy alloys: a brief review. Front. Mater. 7, 290 (2020).
Article Google Scholar
Sparks, T. D. et al. Machine learning for structural materials. Annu. Rev. Mater. Res. 50, 27–48 (2020).
Article CAS Google Scholar
Wu, Q. et al. Uncovering the eutectics design by machine learning in the Al–Co–Cr–Fe–Ni high entropy system. Acta Mater. 182, 278–286 (2020).
Article CAS Google Scholar
Krishna, Y. V., Jaiswal, U. K. & Rahul, M. Machine learning approach to predict new multiphase high entropy alloys. Scr. Mater. 197, 113804 (2021).
Article CAS Google Scholar
Lee, S. Y. et al. Deep learning-based phase prediction of high-entropy alloys: optimization, generation, and explanation. Mater. Des. 197, 109260 (2021).
Article CAS Google Scholar
Machaka, R. Machine learning-based prediction of phases in high-entropy alloys. Comput. Mater. Sci. 188, 110244 (2021).
Article CAS Google Scholar
Rao, Z. et al. Machine learning-enabled high-entropy alloy discovery. Science 378, 78–85 (2022).
Article CAS PubMed Google Scholar
Pei, Z. et al. Toward the design of ultrahigh-entropy alloys via mining six million texts. Nat. Commun. 14, 54 (2023).
Article CAS PubMed PubMed Central Google Scholar
Wen, C. et al. Modeling solid solution strengthening in high entropy alloys using machine learning. Acta Mater. 212, 116917 (2021).
Article CAS Google Scholar
Menou, E. et al. Computational design of light and strong high entropy alloys (HEA): obtainment of an extremely high specific solid solution hardening. Scr. Mater. 156, 120–123 (2018).
Article CAS Google Scholar
Qiao, L. et al. Modelling and prediction of hardness in multi-component alloys: a combined machine learning, first principles and experimental study. J. Alloy. Compd. 853, 156959 (2021).
Article CAS Google Scholar
Wen, C. et al. Machine learning assisted design of high entropy alloys with desired property. Acta Mater. 170, 109–117 (2019).
Article CAS Google Scholar
Si, S. et al. Study on strengthening effects of Zr–Ti–Nb–O alloys via high throughput powder metallurgy and data-driven machine learning. Mater. Des. 206, 109777 (2021).
Article CAS Google Scholar
Behler, J. Atom-centered symmetry functions for constructing high-dimensional neural network potentials. J. Chem. Phys. 134, 074106 (2011).
Article PubMed Google Scholar
Bartók, A. P., Kondor, R. & Csányi, G. On representing chemical environments. Phys. Rev. B 87, 184115 (2013).
Article Google Scholar
Thompson, A. P. et al. Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials. J. Comput. Phys. 285, 316–330 (2015).
Article CAS Google Scholar
Chen, C. et al. Accurate force field for molybdenum by machine learning large materials data. Phys. Rev. Mater. 1, 043603 (2017).
Article Google Scholar
Li, X.-G. et al. Quantum-accurate spectral neighbor analysis potential models for Ni–Mo binary alloys and fcc metals. Phys. Rev. B 98, 094104 (2018).
Article CAS Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article PubMed Google Scholar
Bartók, A. P. et al. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 104, 136403 (2010).
Article PubMed Google Scholar
Shapeev, A. V. Moment tensor potentials: a class of systematically improvable interatomic potentials. Multiscale Model. Simul. 14, 1153–1173 (2016).
Article Google Scholar
Lei, Z. et al. Enhanced strength and ductility in a high-entropy alloy via ordered oxygen complexes. Nature 563, 546–550 (2018).
Article CAS PubMed Google Scholar
Ding, Q. et al. Tuning element distribution, structure and properties by composition in high-entropy alloys. Nature 574, 223–227 (2019).
Article CAS PubMed Google Scholar
Zhang, L. et al. The effect of randomness on the strength of high-entropy alloys. Acta Mater. 166, 424–434 (2019).
Article CAS Google Scholar
Meshkov, E. et al. Sublattice formation in CoCrFeNi high-entropy alloy. Intermetallics 112, 106542 (2019).
Article CAS Google Scholar
Kostiuchenko, T. et al. Impact of lattice relaxations on phase transitions in a high-entropy alloy studied by machine-learning potentials. npj Comput. Mater. 5, 1–7 (2019). This work proposed an efficient computational method based on machine-learning potentials and combined Monte Carlo simulations to study phase stability, phase transitions, and chemical short-range order of HEAs.
Article Google Scholar
Grabowski, B. et al. Ab initio vibrational free energies including anharmonicity for multicomponent alloys. npj Comput. Mater. 5, 1–6 (2019).
Article CAS Google Scholar
Balyakin, I. & Rempel, A. Machine learning interatomic potential for molten TiZrHfNb. AIP Conf. Proc. 2313, 030037 (2020).
Balyakin, I. et al. Ab initio molecular dynamics and high-dimensional neural network potential study of VZrNbHfTa melt. J. Phys. Condens. Matter 32, 214006 (2020).
Article CAS PubMed Google Scholar
Li, X.-G. et al. Complex strengthening mechanisms in the NbMoTaW multi-principal element alloy. npj Comput. Mater. 6, 1–10 (2020).
Article Google Scholar
Byggmästar, J., Nordlund, K. & Djurabekova, F. Gaussian approximation potentials for body-centered-cubic transition metals. Phys. Rev. Mater. 4, 093802 (2020).
Article Google Scholar
Byggmästar, J., Nordlund, K. & Djurabekova, F. Modeling refractory high-entropy alloys with efficient machine-learned interatomic potentials: defects and segregation. Phys. Rev. B 10, 104101 (2021).
Article Google Scholar
Jafary-Zadeh, M. et al. Applying a machine learning interatomic potential to unravel the effects of local lattice distortion on the elastic properties of multi-principal element alloys. J. Alloys Compd. 803, 1054–1062 (2019).
Article CAS Google Scholar
Kostiuchenko, T. et al. Short-range order in face-centered cubic VCoNi alloys. Phys. Rev. Mater. 4, 113802 (2020).
Article CAS Google Scholar
Zhao, L. et al. Anomalous dislocation core structure in shock compressed bcc high-entropy alloys. Acta Mater. 209, 116801 (2021).
Article CAS Google Scholar
Behler, J. Perspective: machine learning potentials for atomistic simulations. J. Chem. Phys. 145, 170901 (2016).
Article PubMed Google Scholar
Podryabinkin, E. V. & Shapeev, A. V. Active learning of linearly parametrized interatomic potentials. Comput. Mater. Sci. 140, 171–180 (2017).
Article CAS Google Scholar
Pun, G. P. et al. Physically informed artificial neural networks for atomistic modeling of materials. Nat. Commun. 10, 1–10 (2019).
Article CAS Google Scholar
Meng, H. et al. Formation ability descriptors for high-entropy diborides established through high-throughput experiments and machine learning. Acta Mater. 256, 119132 (2023).
Article CAS Google Scholar
Jaafreh, R. et al. Machine learning guided discovery of super-hard high entropy ceramics. Mater. Lett. 306, 130899 (2022).
Article CAS Google Scholar
**ong, J., Shi, S.-Q. & Zhang, T.-Y. Machine learning of phases and mechanical properties in complex concentrated alloys. J. Mater. Sci. Technol. 87, 133–142 (2021).
Article Google Scholar
Bhandari, U. et al. Yield strength prediction of high-entropy alloys using machine learning. Mater. Today Commun. 26, 101871 (2021).
Article CAS Google Scholar
Wang, J. et al. A neural network model for high entropy alloy design. npj Comput. Mater. 9, 60 (2023).

Download references

Acknowledgements

This research was supported by National Natural Science Foundation of China (Nos. 52130108, 52301213, 52071024, and 52271003), Guangdong Basic and Applied Basic Research Foundation (No. 2022A1515110805), National Key R&D Program of China (No. 2022YFA1603801), the Funds for Creative Research Groups of China (No. 51921001), the Open Fund of the China Spallation Neutron Source Songshan Lake Science City (No. KFKT2023B11), Program for Changjiang Scholars and Innovative Research Team in University of China (No. IRT_14R05), and State Key Lab of Advanced Metals and Materials (No. 2022-ZD01).

Author information

Authors and Affiliations

Bei**g Advanced Innovation Center for Materials Genome Engineering, State Key Laboratory for Advanced Metals and Materials, University of Science and Technology Bei**g, Bei**g, China
Lu Zhichao, Liu ** Lu
Songshan Lake Materials Laboratory, Dongguan, China
Lu Zhichao & Ma Dong
Institute for Materials Intelligent Technology, Liaoning Academy of Materials, Shenyang, China
Liu ** Lu

Authors

Lu Zhichao
View author publications
You can also search for this author in PubMed Google Scholar
Ma Dong
View author publications
You can also search for this author in PubMed Google Scholar
Liu **ongjun
View author publications
You can also search for this author in Zhao** Lu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Zhichao L. and Zhao** L. conceived the idea and concept, wrote the initial manuscript, and validated the discussion. X.L. revised the paper. D.M. and Zhao** L. supervised the work, led the project, and contributed to the final writing. All authors discussed and approved the final manuscript.

Corresponding authors

Correspondence to Liu ** Lu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Materials thanks Vineeth Venugopal, Ben Breitung, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Eun Soo Park and John Plummer.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhichao, L., Dong, M., **ongjun, L. et al. High-throughput and data-driven machine learning techniques for discovering high-entropy alloys. Commun Mater 5, 76 (2024). https://doi.org/10.1038/s43246-024-00487-3

Download citation

Received: 01 May 2023
Accepted: 03 April 2024
Published: 17 May 2024
DOI: https://doi.org/10.1038/s43246-024-00487-3
Springer Nature Limited

Associated content

High-entropy alloys and ceramics

Collection 01 April 2022

High-throughput and data-driven machine learning techniques for discovering high-entropy alloys

Abstract