We have reanalyzed the rich plethora of ground state configurations of the asymmetric Wigner bilayer system that we had recently published in a related diagram of states [Antlanger et al., Phys. Rev. Lett. 117, 118002 (2016)], comprising roughly 60 000 state points in the phase space spanned by the distance between the plates and the charge asymmetry parameter of the system. In contrast to this preceding contribution where the classification of the emerging structures was carried out “by hand,” we have used for the present contribution machine learning concepts, notably based on a principal component analysis and a k-means clustering approach: using a 30-dimensional feature vector for each emerging structure (containing relevant information, such as the composition of the configuration as well as the most relevant order parameters), we were able to reanalyze these ground state configurations in a considerably more systematic and comprehensive manner than we could possibly do in the previously published classification scheme. Indeed, we were now able to identify new structures in previously unclassified regions of the parameter space and could considerably refine the previous classification scheme, thereby identifying a rich wealth of new emerging ground state configurations. Thorough consistency checks confirm the validity of the newly defined diagram of states.
REFERENCES
We refer to Ref. 1 to a review on various machine learning tools applied to physical systems.
See Ref. 2 for an in-depth discussion on different, problem specific similarity measures in data science problems. Methods from unsupervised machine learning1 can be used to analyze a data set of feature vectors (or of order parameters in our case) for certain similarity measures in the features that may permit us to algorithmically organize the elements of the data set into an initially unknown set of categories.2
Depending on the particularly applied clustering algorithm the number of clusters may be a preset parameter to the algorithm or may even be identified by the algorithm during execution.
The total number of possible compositions at each value of η is given that for N1 > 1, and only counting the monolayer structure with N2 = 0 and N1 = 1 once. For N = 40, we thus have Ntot = 401.
For a discussion about alternative dimensional reduction and clustering tools and a justification for opting for PCA and k-means clustering in this manuscript, we refer to the supplementary material, Sec. III.
The “elbow” in a linear scree plot,66 such as shown in the main panel of Fig. 3, is defined as the point where the contribution of the significant eigenvalues (here realized via the PEVs) seems to level off.4 Notably, this is a subjective measure as multiple elbows can occur in a scree plot. In this contribution, we opted for the second elbow (i.e., for i-values larger than i = 9) in Fig. 3—as opposed to the first elbow (at i = 4)—in an effort to include more information and to be on the safe side in our further analysis (cf., the supplementary material, Figs. 7 and 8, for the significance of PCi>4 in the state diagram of the asymmetric Wigner bilayer system).
It should be noted that we do not claim that the first three PCs are sufficient to fully describe the structural diversity of the asymmetric Wigner bilayer system. We have decided to use PC1 through PC3 in Fig. 5 simply for illustrative purposes in an effort to motivate the need for a more refined investigation of the diagram of states. Using other permutations of PC1 through PC9 will lead to similar plots that differ, however, in significant details, which, in turn, allows for a more systematic classification of structural phases in the system.
Nowadays, it is easily possible to train a neural network in a supervised way with the objective of performing classification tasks.1 For our purposes, such a task would be to classify structural data into a number of K different categories (identified, for instance, by unsupervised clustering), which would allow us to directly classify a structure from its geometric, structural data, e.g., via coordinates and lattice vectors.67 The output of the classifier would then be the probability of a structure falling into any of the K clusters or families (when using “softmax” activation in the output layer of the neural network and “categorical cross-entropy loss” during training1), which may give additional insight when comparing competing structures.
We report at this occasion that for all K-values both k- and k∗-means clustering results nicely correlate when evaluating the adjusted mutual information scores between the k- and k∗-means clustering, i.e., as defined in the Appendix E; for more details and graphical representations, we refer to Ref. 40.
Although most of the suggested ground state candidates identified by the evolutionary algorithm in Refs. 11–13 are very likely to represent the true ground state configurations of the asymmetric Wigner bilayer system at a given state point, there is no rigorous proof that they are, indeed, the ground states. Thus, whenever we use the term “ground state solutions,” we refer to “ground state candidate solutions” of the asymmetric Wigner bilayer system.
Also, for the k∗-means clustering, we rely on the set of analytically labeled data, w(sym), of the entire data set, X(asym), which correspond to the ground state solutions of the symmetric case, A = 1, in the evaluation of , given by Eq. (D1): we, respectively, compare in the labels w(sym) with and , i.e., the fraction of the samples and , which, respectively, corresponds to the known ground state structures of the symmetric Wigner bilayer system.