We outline a machine learning strategy for quantitively determining the conformation of AB-type diblock copolymers with excluded volume effects using small angle scattering. Complemented by computer simulations, a correlation matrix connecting conformations of different copolymers according to their scattering features is established on the mathematical framework of a Gaussian process, a multivariate extension of the familiar univariate Gaussian distribution. We show that the relevant conformational characteristics of copolymers can be probabilistically inferred from their coherent scattering cross sections without any restriction imposed by model assumptions. This work not only facilitates the quantitative structural analysis of copolymer solutions but also provides the reliable benchmarking for the related theoretical development of scattering functions.

Block copolymers, in which the chemically distinct monomer segments are grouped into discrete blocks along the polymer chain, have played essential roles in science and technology. The self-assembly of block copolymers in bulk, on surfaces, or in solution has been intensively studied because of their domain sizes and rich chemical tunabilities.1,2 In particular, block copolymers spontaneously undergo microphase separation in solutions to form an array of discrete nanostructures with a size range from sub-10 nm to micrometers.3 A suite of micellar structures can be achieved by tuning structural parameters of a block copolymer, such as the chemical compositions of the monomers, the molecular weights, and the copolymer architecture. In addition, solvent quality is one of the most important parameters in forming different self-assembled structures because it dictates the conformations of the polymer chains.

To control the thermodynamic properties and functionality of self-assembled structures by the de novo design of materials at the nanoscale, there is a critical need to understand the molecular configuration of individual copolymer amphiphiles, such as persistence and the contour length of each constituent segment.4,5 Prominent among the experimental tools for determining conformations of copolymers is the technique of small angle scattering, including both the neutron and x ray.6 Central to this existing experimental approach is the development of a theoretical model of static two-point correlation functions, through which the conformations of copolymers can be quantitatively described in terms of the optimized structural parameters obtained from regression analysis of measured coherent scattering intensity.

In the past few decades, there has been much interest in developing the scattering functions of homopolymers in the presence of excluded volume effects.7–9 In this pursuit, one continued endeavor is to analytically derive the static two-point correlation functions of semiflexible chains with self-avoiding effect on the intra-chain density distribution as a correction of ideal Gaussian chain statistics.10–16 The domain of interest also includes the phenomenological approaches, which develop the composite scattering functions of semiflexible chains by phenomenologically joining the asymptotic expressions of coherent scattering intensity manifesting at different length scales using computer simulations.17 However, the question of whether a scattering function of semiflexible diblock copolymers consisting of two conformationally distinct segments can be developed based on these two existing approaches has not been addressed unambiguously. Here, the two-point spatial correlation of an AB-type deblock copolymer in the reciprocal Q space, SQ, takes the form

SQ=SAAQ+SBBQ+SABQ,
(1)

where SAAQ, SBBQ, and SABQ represent the coherent scattering contributions from the intra-chain correlation of segment A, the intra-chain correlation of segment B, and the cross correlation between segments A and B, respectively. SABQ can be explicitly expressed as

SABQ=iAiBbibiδriδrjeiQrirj,
(2)

where bi and bj are the bound scattering length of positions i and j located in segments A and B, respectively, and ⟨⟩ represents the ensemble average. While SAAQ and SBBQ in Eq. (1) can be modeled satisfactorily,17,18 defining an analytical expression of SABQ is a formidable challenge because of the difficulty in identifying the functional expression of the relative positional vector rirj, given the heterogeneous intra-chain flexibility. Moreover, as indicated in Fig. 1(a), for a copolymer in the asymptotic power law regime6 of 101 < QL < 102, where L is the contour length, the corresponding SDCQ reflects the collective contributions of SAAQ,SBBQ, and SABQ. The development of SQ based on the existing phenomenological approach for homopolymers17 is severely hampered by the complication due to the heterogeneity in segmental flexibility since the parameterized crossover function used to connect different asymptotic regions in reciprocal space for homopolymers is no longer valid. As a result, no scattering function of diblock copolymers with excluded volume effect has been reported thus far. This challenge provides the motivation of our study.

FIG. 1.

(a) The intra-chain structure factor of an AB-type diblock copolymer SQL (black solid curve), the partial structure factors of segment A SAAQL (blue dotted curve), the partial structure factors of segment B SBBQL (red dashed curve), and the cross correlation SABQL (pink dotted-dashed curve). The data are presented in a dimensionless unit of QL, where L is the contour length of copolymer. The probed range of each parameter is given by the associated color bar. (b)–(d) The evolution of SQL as a function of lA, the persistence of segment A in the unit of L, γ, the ratio of lA to lB, the persistence of segment B, and f, the ratio of contour length of segment A to that of segment B, respectively. The black curves in (b)–(d) represent the rigid rod scattering function SrodQL. The insets give the evolutions of the corresponding excess structure factor ΔSQL, defined as the ratio S(QL)/SrodQL relative to that of a rigid rod with length L. The slopes of gray lines are −1 and −2, representing the Q dependence of SQL of the rigid rod and Gaussian chain in the intermediate QL range, respectively.

FIG. 1.

(a) The intra-chain structure factor of an AB-type diblock copolymer SQL (black solid curve), the partial structure factors of segment A SAAQL (blue dotted curve), the partial structure factors of segment B SBBQL (red dashed curve), and the cross correlation SABQL (pink dotted-dashed curve). The data are presented in a dimensionless unit of QL, where L is the contour length of copolymer. The probed range of each parameter is given by the associated color bar. (b)–(d) The evolution of SQL as a function of lA, the persistence of segment A in the unit of L, γ, the ratio of lA to lB, the persistence of segment B, and f, the ratio of contour length of segment A to that of segment B, respectively. The black curves in (b)–(d) represent the rigid rod scattering function SrodQL. The insets give the evolutions of the corresponding excess structure factor ΔSQL, defined as the ratio S(QL)/SrodQL relative to that of a rigid rod with length L. The slopes of gray lines are −1 and −2, representing the Q dependence of SQL of the rigid rod and Gaussian chain in the intermediate QL range, respectively.

Close modal

To eliminate critical deficiencies of the existing parametric methods, which are unable to treat the scattering behavior of copolymers realistically in a tractable manner, we propose a data-driven machine learning (ML) approach based on the algorithm of the Gaussian process (GP)19–21 for inversely determining the conformation of copolymers from the coherent scattering intensity SQ. Our approach presents a conceptually drastic departure from the conventional protocol of scattering data analysis: The relevant conformational parameters are now probabilistically inferred from the expressive features of SQ, instead of deterministically obtained based on its functional expression that must be specified a priori.

The prerequisite of every quantitative inverse problem of scattering is to identify an explicit mathematical relation between the experimentally coherent scattering cross section and the relevant parameters so that the structure of studied systems can be described based on the geometric picture built upon the optimized parameters obtained from regression analysis of collected spectra. Before establishing this relation, it is instructive to quantitatively examine the structure factor of the copolymer as a function of conformational parameters. In this study, we first identified a set of conformational parameters Y=lA,γ,f, where lA is the persistence of segment A in the unit of contour length of copolymer L, γ is the ratio of lB, the persistence of segment B, to lA, and f is the ratio of contour length of segment B to the contour length L of a copolymer. These parameters provide a unique description of the conformation of a diblock copolymer with contour length L by applying a constraint γ ≥ 1. Using Monte Carlo (MC) simulations (see  Appendix A), an extensive library of intra-copolymer structure factors SQLtraining based on experimentally relevant Ytraining was generated. Here, we present the simulated structure factor in a dimensionless unit of QL. Experimentally, L can be precisely determined in the intermediate Q range of collected spectra. As indicated in Fig. 1, continuous variations of lA, γ, and f resulted in smooth changes in SQL. One can therefore infer that these parameters are statistically correlated in the vector space of SQL. To further highlight the expressiveness of the two-point correlation of data, the scattering function of a rigid rod with length L, which is termed as SrodQL, was used as the comparative reference in this study. The excess structure factor ΔSQL was defined as the deviations of SQL from SrodQL in logarithms, namely, ΔSQLlnSQLlnSrodQL.

Each simulated ΔSQL is represented by a single point in this vector space of the structure factor. Therefore, by labeling each point with the values of lA, γ, and f, one can qualitatively examine the statistical dependence of ΔSQL and X. For each simulated ΔSQL, there are 65 sampled Q points. The dimension of this vector is 65, and we further denote it as R65. Obviously, the distribution of these data points in R65 cannot be directly visualized. To facilitate the inspection of data distribution in R65, we conducted a singular vector decomposition (SVD) ( Appendix B) to identify the orthogonal coordinates. By recasting the data into the space spanned by the principal components, the intrinsic correlations of these parameters can be highlighted. Our analysis shows that the first three principal axes retain more than 95% of the variance of original data. Therefore, the vector space R3 spanned by their singular vectors SVD0, SVD1, and SVD2 given in Fig. 2(a) is sufficiently expressive to demonstrate the data correlation. The insets of Figs. 1 and 2(a) show that the conformational difference between the copolymer and the reference system of the rigid rod causes a positive deviation in the structure factor, as reflected by the shape of SVD0 and its retained variance of 70%, within the range of 1 < QL < 100. SVD1 (20%) and SVD2 (5%) are responsible for subtle changes in ΔSQL. As demonstrated in (b)–(d) of Fig. 2 and the associated insets, the data points representing the simulated ΔSQL are seen to distribute over a twisted two-dimensional manifold: The black crosses on the origin represent ΔSQL of a rigid rod with contour length L, and the red edges represent that of homopolymers characterized by different persistent lengths. Upon changing lA, γ, and f, the characteristic developments of ΔSQL are observed as reflected by the color evolution. The nonuniform data distribution reflects that the dependence of ΔSQL on these parameters is highly nonlinear, which again manifests the mathematical challenge of developing a parametric expression of ΔSQL based on lA, γ, and f.

FIG. 2.

(a) The first three singular vectors identified to provide the orthogonal coordinates for evaluating the statistical correlation between conformational parameters and SQL. Each computationally generated SQL is represented by a point in the vector space spanned by SVD0, SVD1, and SVD2. One can further label each point by the corresponding conformational parameter, and the results for lA, γ, and f are given in (b)–(d), respectively.

FIG. 2.

(a) The first three singular vectors identified to provide the orthogonal coordinates for evaluating the statistical correlation between conformational parameters and SQL. Each computationally generated SQL is represented by a point in the vector space spanned by SVD0, SVD1, and SVD2. One can further label each point by the corresponding conformational parameter, and the results for lA, γ, and f are given in (b)–(d), respectively.

Close modal

In what follows, we outline the mathematical foundation of our nonparametric approach, which establishes the relation between ΔSQL and Y probabilistically, based on Gaussian process.19–21 Readers are referred to  Appendix C for more details of the implementation. lA, γ, and f are normally distributed random variables in R, which follow multivariate Gaussian statistics. Three covariance matrices KKlA,Kγ,Kf are further modeled by a radial basis function (RBF) kernel20–22 to define the statistical relationship of each sample. It is important to mention that K is determined by the data distribution in R65. The so-called training process in the ML procedure is to identify the optimized {K} using a kernel function to quantify data correlation in R65 using ΔSQLtraining as input. For an experimentally measured SmQL, its conformational parameters Ym must follow the Gaussian statistics described by K. Judging from the similarity between ΔSmQL and SQLtraining, which is quantitatively decided by their Euclidian distance in R, Ym can be obtained by probabilistic inference. The scattering data analysis is therefore a linear operation of conditioning K by ΔSmQL to extract the desired Ym and the statistical errors as the mean and variance of the conditional univariate Gaussian distribution. It is important to verify the feasibility of this ML-based approach for inversely determining the conformation of copolymers using scattering. For this benchmarking purpose, another library of the structure factor containing 1266 samples, SQLtesting, is computationally generated based on a new set of Ytesting different from Ytraining corresponding to SQLtraining. In (a)–(c) of Fig. 3, we present the comparison of computational inputs Ytesting, which is termed as ground truths according to the convention of information science,20,21 and those inverted from conditioning K using SQLtesting. A varying degree of statistical uncertainty is first noted: We found that the inverted conformational parameters are strongly correlated to their corresponding ground truth. For lnlA, lnγ, and f, the coefficients of determination23R2 are found to be 0.9992, 0.9749, and 0.9144 and the values of mean-absolute percentage error (MAPE)24 of the inverted conformational parameters for the ground truths are found to be 0.51%, 7.05%, and 5.46% for f, respectively. The average values of the confident intervals of these parameters returned by Gaussian Process Regression (GPR) are 0.46%, 6.93%, and 6.44%, which are also of the same magnitude. The observation reflects the difference in the susceptibility of SQL toward the change in these parameters. As indicated by the probability density distributions given in (d)–(f) of Fig. 3, the majority of the extracted parameters are in quantitative agreement with the ground truths. However, as indicated by the dispersion of data, especially in (b) and (c) of Fig. 3, there remains certain SQL whose inverted conformation parameters significantly deviate from the corresponding ground truths. We found that the numerical deviation becomes more severe in copolymers characterized by more asymmetric segmental chain length, namely, larger or smaller f.

FIG. 3.

The comparison of (a) lnlA, (b) lnγ, and (c) f inverted by the ML protocol (y axis) from SQLtesting and the corresponding computational references of lnlA,lnγ,ftesting (x axis). The numerical accuracy is quantitatively evaluated by the correlation coefficient23R2, as well as the mean-absolute percentage error (MAPE)23 of the inverted conformational parameters with respect to the corresponding reference inputs. The probability density profile of errors for the inverted lnlA, lnγ, and f are given in (d)–(f), respectively. As indicated by the FWHM, the majority of predicted parameters are all narrowly distributed within the peak region centered at the ground truths.

FIG. 3.

The comparison of (a) lnlA, (b) lnγ, and (c) f inverted by the ML protocol (y axis) from SQLtesting and the corresponding computational references of lnlA,lnγ,ftesting (x axis). The numerical accuracy is quantitatively evaluated by the correlation coefficient23R2, as well as the mean-absolute percentage error (MAPE)23 of the inverted conformational parameters with respect to the corresponding reference inputs. The probability density profile of errors for the inverted lnlA, lnγ, and f are given in (d)–(f), respectively. As indicated by the FWHM, the majority of predicted parameters are all narrowly distributed within the peak region centered at the ground truths.

Close modal

To investigate the origin of this observed disagreement, MC simulations were conducted to generate SQL based on the inverted parameters (red circles in Fig. 3). The corresponding ground truths (white circles) and the results are given in Fig. 4 as red curves and black dashed curves, respectively. In general, these two sets of SQL are indistinguishable in the numerical scales of Fig. 4. The fact that they are essentially indistinguishable provides compelling evidence, demonstrating that the origin of this numerical inaccuracy is the mathematical properties of SQL, instead of our developed GP-based ML inversion algorithm. As the consequence of Fermi’s golden rule,25 the structural information offered by any elastic scattering experiment is the statistical average of static two-point spatial correlation. With the inherent information due to coarse-graining by ensemble averaging, any higher-order correlation is not explicitly registered by the structure factor in reciprocal space or, equivalently, the pair distribution function in real space. The structural difference may be more discernible in higher-order spatial correlation, which is not accessible by scattering. Moreover, although all the structural inversion problem of scattering experiments is theoretically underpinned by the existence of a one-to-one mapping between the structural variables and the two-point static correlation function,26 the numerical sensitivity of a two-point correlator to any variation in the relevant parameters still needs to be taken into consideration.27 As indicated by the residual plots given in the insets, their numerical differences are merely 2%–4% and only manifest in the range of 10 ≲ QL ≲ 100. The statistical error in small angle neutron scattering (SANS) coherent intensity within this QL range is generally around 5% within reasonable measurement time.6 The difference between these two SQL is therefore indistinguishable experimentally. Given the additional data smearing encountered in practical experimental conditions, such as instrument resolution and polydispersity of materials, it is essentially impossible to quantify the conformation of a copolymer especially for the highly asymmetric ones, precisely. The reason can also be intuitively appreciated from Fig. 1(a): Assuming that f is exceedingly large, the measured SQL can be well approximated by SAAQL. It is not surprising that the conformation of segment B cannot be accurately inverted from SQL. It is instructive to mention that this issue of structural degeneracy in spectral inversion is not unique to the conformational study of copolymers: It has been known that a fluid characterized by radially symmetric repulsive interaction potential is structurally equivalent to a hard-sphere system with an effective radius.26,28,29 Again, the static structure, in terms of two-point spatial correlation functions, is only equivalent up to a certain numerical resolution.27 

FIG. 4.

(a)–(c) The MC reconstructed SQL (red curves) based on the conformational parameters of red circles given in (a)–(c) of Fig. 3 and the black dashed curves that represent SQL of the reference inputs (white circles). The numerical difference between them is nearly indiscernible. The corresponding residual plots are given in the insets.

FIG. 4.

(a)–(c) The MC reconstructed SQL (red curves) based on the conformational parameters of red circles given in (a)–(c) of Fig. 3 and the black dashed curves that represent SQL of the reference inputs (white circles). The numerical difference between them is nearly indiscernible. The corresponding residual plots are given in the insets.

Close modal

We have outlined a ML strategy, based on the framework of the Gaussian process, to inversely determine the conformation of diblock copolymers from their coherent scattering. By treating the probability distributions of the relevant conformational parameters in the structure factor vector space, we have demonstrated that the conformation of copolymers can be inversely determined without relying on any pre-determined analytical model of scattering functions.

The present work also demonstrates that a scattering study of polymers with complex conformation is of interest in its own right: Complemented by computer simulations, the application of our proposed ML approach should be able to clarify the subtle connection between their scattering signature and the properly identified conformational parameters, thereby facilitating the related quantitative scattering characterization. Moreover, there has been considerable interest in the theoretical description of the structure of interacting polymer solutions based on the mathematical framework of the Ornstein–Zernike equation.26,30 There exists a reasonable amount of computational results to indicate the important factors to be considered in developing a proper structural description of these highly correlated systems in the context of two-point static correlation.26,30 Now that a ML inversion tool for dilute polymer solutions has been established, perhaps the more challenging inversion problems of these highly correlated systems can be addressed based on the GPR approach or other ML-based schemes for elastic scattering.31–33 

A portion of this research used resources at the Spallation Neutron Source and Center for Nanophase Materials Sciences, two DOE Office of Science User Facilities operated by the Oak Ridge National Laboratory. C.-H.T. and S.-Y.C. acknowledge support from the Ministry of Science and Technology of Taiwan under Grant No. MOST 108-2221-E007-054-MY3. Y.W. was supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Early Career Research Program Award KC0402010, under Contract No. DE-AC05-00OR22725. Y.S. was supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Materials and Science and Engineering Division. B.G.S. acknowledges support from the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences Data, Artificial Intelligence and Machine Learning at the DOE Scientific User Facilities Program, under Award No. 34532. The authors acknowledge the National Center for High-Performance Computing of Taiwan for providing computational and storage resources.

This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (https://www.energy.gov/downloads/doe-public-access-plan).

The authors have no conflicts to disclose.

The data that support the findings of this study are available from the corresponding author upon reasonable request. The codes for MC simulation and GPR are available for evaluation at https://github.com/ch-tung/polymer_chain/tree/main/worm_like_micelle/batchscatter/block.

A discrete Kratky–Porod model34 composed of N beads connected by bonds of length l0 = L/N was used to model the worm-like chain. Based on Boltzmann statistics, the distribution of zenith angle between successive bonds follows Pθ=1Mexpaθ22sinθ, where a is the chain flexibility and M is the normalization factor.26 In this framework, the persistence length for sufficiently large N can be expressed as b=l01cosθal0. The azimuthal angle ϕ are randomly sampled from uniform distribution between 0 to 2π. Intra-chain self-intersection was checked by the criteria that the distance between two non-successive segments should not be less than 0.1b.14 Each SQ is obtained by averaging 1000 trajectories, which satisfy the condition of self-avoiding.

Through MC simulations, SQL was obtained based on the conformational parameters of logbA2,1, logγ0,2, and f0,1, respectively. Each conformation parameter was sampled uniformly from these intervals. Overall, 6331 SQL were generated by thoroughly sampling each conformational state where one fifth of generated SQL with f0.3,0.7 were assigned as the test set and excluded from the training process. The dimension of the vector space of SQL was 65 because there were 65 sampled QL points in each simulated SQL. Each SQL was therefore represented by a point in this R65 vector space. Visualization of the distribution of these points can be facilitated by dimensionality reduction: The data were arranged into a 65 × 6331 matrix F. Using singular value decomposition (SVD),23F can be decomposed into F = UΣVT, where U and V are unitary matrices, whose columns are the eigenvectors of FFT and FTF, respectively. Σ2 is a diagonal matrix whose entries are the eigenvalues of FFT. F can be further centralized, and by definition, FFT63311 is the covariance matrix C, which contains the information of correlation between different data points. C can be expressed as C=UΣ2UT63311 where the column vectors in U are the principal axes, which form an orthonormal basis in R65. The eigenvalues of Σ263311 are the percentages of the variance of the original data projected onto each corresponding principal axis. The variance of original data was found to be mostly retained by the first three singular value ranks. This principal component analysis allows us to re-express the data as a set of three orthogonal variables to extract the intrinsic correlations of original data.

In the context of Gaussian Process (GP), a function g relating S(QL) and lA, γ, and f can be formulated as gGPμ,k in terms of a prior mean function μ and a prior covariance function k. Given a training set (X, Y), where X represents the n sets of S(QL) in the training set and Y represents the corresponding regression targets of lA, γ, and f, the purpose of the ML process is to determine μ and k from the knowledge of training data. Following the standard procedure of GP,19–21 a constant function was used to specify μ. The n × n covariance matrix KXX specifies the correlations between the training data pairs modeled by the radial basis function (RBF) kernel.20–22 Specifically, for x, x′ in X, kernels kx,x as entries of KXX were formulated by the following squared exponential expression:

kx,x=expxx22l2,
(C1)

where l is the correlation length, which measures the similarity between training data points. The Gaussian observational noise term was added to the kernel matrix K = KXX + σ2I, where σ describes the variance of observational noise. Given a test point X*, the goal of GPR is to estimate Y* = g(X*), where the covariance matrix of the test set is denoted as KX*X*, and KXX* consists of entries measuring the correlations between training and test points. For simplicity, KKX, K*KXX*, and K**KX*X*. For a S(QL) measured from the ergodic conditions, the relevancy of its lA, γ, and f with those in the training sets should follow the same correlation patterns deduced from the training process. Therefore, they can be determined from this joint distribution,

YY*NμXμX*,KK*TK*K**,

where T indicates the matrix transposition and N denotes the normal distribution. The hyperparameters were determined by maximizing the log marginal likelihood using the gradient descent algorithm during training. Given the Gaussian likelihood, Y* and its variance can be predicted from the posterior pY*|X,X*,Y defined by the conditioned Gaussian distribution,

EY*|X,X*,Y=μX*+K*K1Y,
VarY*|X,X*,Y=K**K*K1K*T.

In this paper, we adopted the sklearn GaussianProcessRegressor library19,35 to implement GPR for the consideration of efficiency and easy deployment.

1.
T. P.
Lodge
, “
Block copolymers: Past successes and future challenges
,”
Macromol. Chem. Phys.
204
,
265
273
(
2003
).
2.
C. M.
Bates
and
F. S.
Bates
, “
50th anniversary perspective: Block polymers—Pure potential
,”
Macromolecules
50
,
3
22
(
2017
).
3.
A. O.
Moughton
,
M. A.
Hillmyer
, and
T. P.
Lodge
, “
Multicompartment block polymer micelles
,”
Macromolecules
45
,
2
19
(
2012
).
4.
K.
Binder
, “
Phase transitions in polymer blends and block copolymer melts: Some recent developments
,”
Adv. Polym. Sci.
112
,
183
299
(
1994
).
5.
K. E. B.
Doncom
,
L. D.
Blackman
,
D. B.
Wright
,
M. I.
Gibson
, and
R. K.
O’Reilly
, “
Dispersity effects in polymer self-assemblies: A matter of hierarchical control
,”
Chem. Soc. Rev.
46
,
4119
4134
(
2017
).
6.
Neutron, X-Ray and Light Scattering: Introduction to an Investigative Tool for Colloidal and Polymetric Systems
, edited by
Th.
Zemband
,
P.
Lindner
(
North-Holland
,
Amsterdam
,
1991
).
7.
P.
Schurtenberger
, in
Neutron, X-Rays and Light. Scattering Methods Applied to Soft Condensed Matter
, edited by
P.
Lindner
and
Th.
Zemb
(
North-Holland
,
Amsterdam
,
2002
).
8.
H.-P.
Hsu
,
W.
Paul
, and
K.
Binder
, “
Scattering function of semiflexible polymer chains under good solvent conditions
,”
J. Chem. Phys.
137
,
174902
(
2012
).
9.
H.-P.
Hsu
,
W.
Paul
, and
K.
Binder
, “
Estimation of persistence lengths of semiflexible polymers: Insight from simulations
,”
Polym. Sci., Ser. C
55
,
39
59
(
2013
).
10.
G.
Porod
, “
X-ray and light scattering by chain molecules in solution
,”
J. Polym. Sci.
10
,
157
166
(
1953
).
11.
P.
Sharp
and
V. A.
Bloomfield
, “
Light scattering from wormlike chains with excluded volume effects
,”
Biopolymers
6
,
1201
1211
(
1968
).
12.
J. A.
Schellman
, “
Flexibility of DNA
,”
Biopolymers
13
,
217
226
(
1974
).
13.
M. G.
Bawendi
and
K. F.
Freed
, “
A Wiener integral model for stiff polymer chains
,”
J. Chem. Phys.
83
,
2491
2496
(
1985
).
14.
M.
Rawiso
,
R.
Duplessix
, and
C.
Picot
, “
Scattering function of polystyrene
,”
Macromolecules
20
,
630
648
(
1987
).
15.
A. L.
Kholodenko
, “
Persistence length and related conformational properties of semiflexible polymers from Dirac propagator
,”
J. Chem. Phys.
96
,
700
713
(
1992
).
16.
S.
Stepanow
, “
Statistical mechanics of semiflexible polymers
,”
Eur. Phys. J. B
39
,
499
512
(
2004
).
17.
J. S.
Pedersen
and
P.
Schurtenberger
, “
Scattering functions of semiflexible polymers with and without excluded volume effects
,”
Macromolecules
29
,
7602
7612
(
1996
).
18.
W.-R.
Chen
,
P. D.
Butler
, and
L. J.
Magid
, “
Incorporating intermicellar interactions in the fitting of SANS data from cationic wormlike micelles
,”
Langmuir
22
,
6539
6548
(
2006
).
19.
C. E.
Rasmussen
and
C. K. I.
Williams
,
Gaussian Processes for Machine Learning
(
The MIT Press
,
Cambridge
,
2005
).
20.
K. P.
Murphy
,
Machine Learning: A Probabilistic Perspective
(
The MIT Press
,
Cambridge
,
2012
).
21.
C. M.
Bishop
,
Pattern Recognition and Machine Learning
(
Springer
,
New York
,
2006
).
22.
J.
Shawe-Taylor
and
N.
Cristianini
,
Kernel Methods for Pattern Analysis
(
Cambridge University Press
,
Cambridge
,
2004
).
23.
G.
Strang
,
Introduction to Linear Algebra
, 5th ed. (
Wellesley Cambridge Press
,
Wellesley
,
2016
).
24.
A.
de Myttenaere
,
B.
Golden
,
B.
Le Grand
, and
F.
Rossi
, “
Mean absolute percentage error for regression models
,”
Neurocomputing
192
,
38
48
(
2016
).
25.
W.
Marshall
and
S.
Lovesey
,
Theory of Thermal Neutron Scattering: The Use of Neutrons for the Investigation of Condensed Matter
(
Clarendon Press
,
Oxford
,
1971
).
26.
J.-P.
Hansen
and
I.
McDonald
,
Theory of Simple Liquids: With Applications to Soft Matter
(
Academic Press
,
Amsterdam
,
2013
).
27.
H.
Wang
,
F. H.
Stillinger
, and
S.
Torquato
, “
Sensitivity of pair statistics on pair potentials in many-body systems
,”
J. Chem. Phys.
153
,
124106
(
2020
).
28.
L.
Verlet
and
J. J.
Weis
, “
Equilibrium theory of simple liquids
,”
Phys. Rev. A
5
,
939
952
(
1971
).
29.
H. C.
Andersen
,
J. D.
Weeks
, and
D.
Chandler
, “
Relationship between the hard-sphere fluid and fluids with realistic repulsive forces
,”
Phys. Rev. A
4
,
1597
1607
(
1971
).
30.
K. S.
Schweizer
and
J. G.
Gurro
, “
PRISM theory of the structure, thermodynamics, and phase transitions of polymer liquids and alloys
,”
Adv. Polym. Sci.
116
,
319
377
(
1994
).
31.
T.
Kanazawa
,
A.
Asahara
, and
H.
Morita
, “
Accelerating small-angle scattering experiments with simulation-based machine learning
,”
J. Phys. Mater.
3
,
015001
(
2020
).
32.
M. G.
Wessels
and
A.
Jayaraman
, “
Machine learning enhanced computational reverse engineering analysis for scattering experiments (CREASE) to determine structures in amphiphilic polymer solutions
,”
ACS Polym. Au
1
,
153
164
(
2021
).
33.
P.
Tomaszewski
,
S.
Yu
,
M.
Borg
, and
J.
Rönnols
, “
Machine learning-assisted analysis of small angle x-ray scattering
,” arXiv:2111.08645v1.
34.
O.
Kratky
and
G.
Porod
, “
Röntgenuntersuchung gelöster fadenmoleküle
,”
Recl. Trav. Chim. Pays-Bas
68
,
1106
1123
(
1949
).
35.
F.
Pedregosa
,
G.
Varoquaux
,
A.
Gramfort
,
V.
Michel
,
B.
Thirion
,
O.
Grisel
,
M.
Blondel
,
P.
Prettenhofer
,
R.
Weiss
,
V.
Dubourg
,
J.
Vanderplas
,
A.
Passos
,
D.
Cournapeau
,
M.
Brucher
,
M.
Perrot
, and
E.
Duchesnay
, “
Scikit-learn: Machine learning in Python
,”
J. Mach. Learn. Res.
12
,
2825
2830
(
2011
).