X-ray computed microtomography (μCT) is an innovative and nondestructive versatile technique that has been used extensively to investigate bio-based systems in multiple application areas. Emerging progress in this field has brought countless studies using μCT characterization, revealing three-dimensional (3D) material structures and quantifying features such as defects, pores, secondary phases, filler dispersions, and internal interfaces. Recently, x-ray computed tomography (CT) beamlines coupled to synchrotron light sources have also enabled computed nanotomography (nCT) and four-dimensional (4D) characterization, allowing in situ, in vivo, and in operando characterization from the micro- to nanostructure. This increase in temporal and spatial resolutions produces a deluge of data to be processed, including real-time processing, to provide feedback during experiments. To overcome this issue, deep learning techniques have risen as a powerful tool that permits the automation of large amounts of data processing, availing the maximum beamline capabilities. In this context, this review outlines applications, synchrotron capabilities, and data-driven processing, focusing on the urgency of combining computational tools with experimental data. We bring a recent overview on this topic to researchers and professionals working not only in this and related areas but also to readers starting their contact with x-ray CT techniques and deep learning.
I. INTRODUCTION
X-ray computed tomography (CT) is a powerful noninvasive technique that delves into the structure of materials and reveals details in their internal morphology in three spatial dimensions (3D).1 Initially, x-ray CT was employed as a pillar for medical applications, acquiring images with a spatial resolution on the millimeters scale.2–5 At the beginning of the 1980s, the first x-ray computed microtomography (μCT) images were obtained from biological structures,6 reaching a spatial resolution on the μm scale. This improvement enabled morphometric analysis and quantification, which was impractical to perform with other imaging techniques.1,6–18 In the same decade, the emergence of synchrotron light sources made it possible to map the first images of organisms using monochromatic and small x-ray sources.5,19 The continuous evolution of synchrotron beamlines has now reached the nanoscale imaging of materials, opening new perspectives in CT.14 In the early 2000s, μCT was expanded to bio-based applications to investigate plants, animals, soils, and synthetic biomaterials,19,20 increasing considerably the number of publications related to this technique.21,22 Recently, the extreme brilliance of synchrotron sources has opened new possibilities in time-resolved x-ray tomography (4DCT) and allowed a new understanding of these systems through in situ, in vivo, and in operando characterizations.14,23,24 In this scenario, the first fourth-generation synchrotron light sources have emerged as an unprecedented characterization tool to perform nano- and time-resolved CT due to their high brightness and coherence characteristics enabling higher resolution and faster data acquisition (Fig. 1, at the top of the image).25
Concurrently, with x-ray CT advances, machine learning (ML) and neural networks have developed and culminated in solutions for challenging problems related to model estimation in large datasets as in x-ray CT using synchrotron beamlines. In 1958, the perceptron emerged as the first supervised learning algorithm for binary classification tasks, built around a linear prediction function.26 This development is considered the predecessor of neural networks. Then, backpropagation appeared as an effective algorithm to train feedforward neural networks by estimating the weights and biases of the model.27 In the following decades, neural networks evolved to form more complex architectures, such as the multilayer perceptron consisting of fully connected neuron class and LeNet architecture as a successful case of a convolutional neural network (CNN).28,29 Next, the dropout regularization technique was proposed to improve the training stage of neural networks by reducing the overfitting problems.30 Finally, deep learning (DL) emerged as a powerful computational tool, allowing the design of end-to-end solutions for visual recognition tasks that demand both representation learning and predictive classification, such as the U-net architecture.31 Recently, neural networks have increased their ability to accurately model large datasets by adopting attention mechanisms, inspired by the human cognitive attention,32 and self-supervised/unsupervised learning, which aims to reduce the dependence of labeled data (Fig. 1, at the bottom of the image).33
This review aims to compile introductory and recent literature to inspire and guide non-expert users in x-ray CT as well as those who have basic expertise to advance toward state-of-the-art synchrotron capabilities and data-driven processing. To reach this goal, the review is designed as a gateway for further studies and is structured as follows: Sec. II is devoted to applications of x-ray CT, outlining many research questions that have been addressed in recent research on biological systems and resources, basically giving an overview of bio-based applications. Section III presents the frontiers of capabilities for beamlines of fourth-generation synchrotrons, and how these capabilities are expanding the limits of what can be measured in spatial and time scales. Section IV discusses the deluge of data that is generated by cutting-edge x-ray CT beamlines, and how computational techniques can handle this challenge. Section V looks further into the future and provides an outlook and conclusions for this review. Therefore, this review provides a unique perspective of the recent literature addressing bio-based systems characterized by x-ray CT, bringing together an overview of recent synchrotron capabilities and data-driven processing, updating the state of the art in these fields.
II. SELECTED BIO-BASED APPLICATIONS OF X-RAY TOMOGRAPHY
The potential scientific applications of x-ray CT are countless. This section presents a selection of applications (by no means exhaustive of the potential of the technique) related to biological systems and resources: plants, animals, soils, and synthetic bio-based materials.1,19,41 The presentation in this Sec. II is driven by the research questions being addressed with the noninvasive 3D capabilities of x-ray CT, referencing to studies performed both in laboratory systems as well as in synchrotrons, using conventional 3D imaging as well as in situ, in vivo, and time-resolved experiments. This section provides an up-to-date overview of applications aiming at motivating in-depth, further studies, including the cutting-edge advances in synchrotron and computational capabilities discussed in Secs. III and IV of this review.
A. Plants: Cell walls, wood materials, and biomass conversion
Plants have a hierarchical multiscale structure, typical of biological matter, organized from molecules to whole organisms (or even ecological systems).42,43 Plant cell walls encase the cells and are a key structural element of this hierarchy. Cell walls are of two types, primary and secondary, both made as nanocomposites based on cellulose microfibrils. The primary cell walls are synthesized during cell growth; they are thin, pliant, and highly hydrated. The secondary cell walls are deposited after cell growth ceases; they are thicker (typically 1–8 μm) and provide strength and rigidity to plant tissues.44 With vessel diameters and cell sizes of about 10–100 μm, the spatial resolution of μCT reveals the plant's cellular and tissue structures, contrasting the cell walls with the cell interiors containing air or water. Based on this common structural foundation, the use of μCT has advanced in several research fields such as plant biology, wood science, and biomass conversion into renewable fuels and chemicals, as detailed in the following paragraphs and illustrated in Fig. 2.
Imaging techniques are workhorses of plant science, and x-ray CT is opening new avenues for research, as recently reviewed by Piovesan et al.45, Figure 2(a) shows examples of images from the Arabidopsis plant, where cells of the plant tissues can be resolved in 3D.46 With this level of resolution, μCT has been employed to investigate phenomena that are inherently tridimensional such as mass transport. Xylem embolism, the blockage of water transport from roots to leaves that may result in plant death, has been intensively researched using μCT,47,48 including many studies conducted in vivo.49–52 Food grains and fruits also stand out as an important part of the anatomy of plants in which μCT permits to analyze their texture, size, shape, and internal structure that are important parameters for the food quality.45,53
Wood is a natural material used by humanity since ancient times. Related natural materials (e.g., bamboo) and engineered forms of wood (e.g., plywood) complement the scenario of wood and related materials.56 3D images from μCT have found several applications in wood science, including visualization of wood biodegradation caused by fungi [Fig. 2(b)] and identification of wooden cultural heritage.54,57 Moreover, the technique has been expanded toward in situ experiments to reveal the structural changes promoted by wood chemical pulping,58 and the transport of fluids into the complex wood pore space, with implications, for instance, for the application of wood adhesives.59–61 In situ studies have also been developed to investigate wood micro-scale deformations under mechanical stress.62,63
Plant biomass is also a vast resource that can partly replace fossil fuels with renewable alternatives. A highlight is given to residuals from forestry (e.g., sawdust), agriculture (e.g., wheat straw), and agroindustry (e.g., sugarcane bagasse) usable as feedstocks for bioenergy and biorefining.64 Tomograms resolving the cellular and tissue structures have been used to reveal water retention within the cells of processed biomass65 and mineral particulate contaminants adhered and impregnated into disrupted biomass tissues.66 Moreover, x-ray CT revealed structural changes promoted by hydrothermal treatments that enhance biomass digestibility in bioconversion processes.67 In other studies of biomass processing, μCT provided microscopic visualizations of biomass combustion,68 chemical delignification,69,70 and pyrolysis.71,72 In all these fields of applications, from plant science to biomass conversion, 3D images from CT have been also used as inputs for realistic representations of biomass morphology in computational modeling [Fig. 2(c)].45,55
B. Animals: Soft and hard tissues
Soft (non-mineralized) and hard (mineralized) tissues constitute the main structural formations of animals. Hard tissues are characterized by having a certain degree of stiffness due to the process of tissue mineralization (e.g., bones and teeth). On the other hand, soft tissues present a certain degree of elasticity like ligaments, muscles, and tendons, and, in some cases, soft tissues are filled majorly by aqueous solution (e.g., internal organs).73 In this context, x-ray CT permits to obtain structural information on animal anatomy, aiming at the investigation of their functionality, development, lesions, disease progression, fractures, and regeneration, as illustrated in Fig. 3.
Considering the numerous examples of soft tissues characterized by x-ray CT, 3D images of animals assist in the quantitative analysis of mutant and experimental phenotypes, and in the generation of 3D anatomical data.74 As an illustrative example, tomograms of stained heart and lung allow to observe the orientation of the fibrous muscular tissue and the blood vessels, which are essential for organ functioning, as illustrated in Fig. 3(a).75, μCT has also been employed to map the whole neuronal architecture of animal brains [Fig. 3(b)], permitting to understand its spatial organization and connectivity.76–78 In the area of food science, the characterization of meat as soft tissue has also been highlighted in relation to the distribution and amount of fat in salami, beef, and lamb that directly affect their taste and texture.79
In relation to hard tissues, μCT has been widely applied in the mapping of porous structures, lesions, fractures, and tissue regenerations. For example, tomograms of corals can reveal their structure consisting of channels, which are crucial to understanding their directional growth.80 As illustrated in Fig. 3(c), tomograms of primate molar teeth exhibit its whole structure and determine their displacement under load to better understand the relationship between the morphology of periodontal ligament space and tooth movement.81,82 Furthermore, rendered images of human cortical bone demonstrated the effect of bone aging on the enlargement of Haversian canals, as indicated by the colorscale in Fig. 3(d), allowing for a better understanding about the causes of bone thinning.83–85 Other examples consolidate the advantages of x-ray CT being used in the description of bone regeneration in rat tibia,86 trabecular bone mass degeneration of mice,83 metabolic control of collagen synthesis in bone junctions,87,88 and complex lacuno-canalicular network (LCN) structure contained in bone.89
C. Soils: Pores and rhizospheres
Soil consists of solids and voids. With this duality, the description of soil structure has followed both a solid perspective focused on the mineralogy, size, shape, and state of aggregation of the solid matter and a pore perspective focused on the size, shape, and connectivity of the pore space and its occupation by microorganisms, water, and air,90,91 as shown in Fig. 4. From the solid perspective, sizing is used to classify soil texture—sand (50 μm–2 mm), silt (2–50 μm), and clay (<2 μm)—and hierarchy of aggregation—microaggregates (20–250 μm) and macroaggregates (>250 μm). These size-based classifications indicate the key length scales in soil structure.90 From the pore perspective, x-ray μCT has opened new horizons to soil science by enabling the visualization and analysis of the pore space of undisturbed soil with spatial resolution on the μm scale.91–93
The soil pore space governs a wide range of processes, such as the transport of water, nutrients, and contaminants; stabilization of organic matter; and microbial activity. Figure 4(a) exhibits examples of pore morphologies and pore filling with water and air.94 Pore-scale imaging techniques, most notably x-ray μCT, became essential to investigate such pore-scale phenomena.94,97,98 As an example of an urgent field of study, the reduction of agricultural N2O emissions and carbon sequestration in soils are recognized by the Intergovernmental Panel on Climate Change (IPCC) as important strategies for the mitigation of climate change.99 Research in this field employed μCT to reveal the role of soil pores on the microscale mechanisms of N2O emissions,100 and the stability of soil carbon.91,101
At the interface of plant and soil sciences, the rhizosphere has also attracted substantial attention, and the use of x-ray μCT in this research field was discussed in recent reviews.45,102 These reviews highlight that root–soil interactions are critical to plant growth and crop yield, regulating the acquisition of water and nutrients by the plants. X-ray μCT is a nondestructive method that, coupled to root segmentation algorithms, can delineate the configuration of the root system in space and time. Contrast agents for imaging the organic matter in the soil [Fig. 4(b)]95,103 and image segmentation algorithms tailored for roots in soils [Fig. 4(c)] are key ongoing developments in this field.96,104 An area that may be transformed by the new imaging capabilities is the understanding of the micromechanics of root development in soil.105
D. Synthetic bio-based materials: Pores, scaffolds, and interfaces
Synthetic bio-based materials are here understood as materials that are synthesized from building blocks obtained from biological resources. Contrasting with a natural material like wood that preserves the native biological structure, the morphology of synthetic materials is given by man-made material processing. In this field of research, μCT allows the correlation of the processing parameters and the 3D architecture of the obtained materials, revealing their porosity, scaffold structure, and interface features that influence properties and functionalities.106,107 Aiming at a better appreciation of the diversity of synthetic bio-based materials that has been investigated by μCT, tomographic images of organic-based aerogel, biocarbon-based foam, polymer composites reinforced with biofillers and green electronic components are shown in Fig. 5.
The morphological porous architecture of organic-based aerogel depends on its processing parameters and constituents, such as ice growth and additive addition, respectively. These variables are the physical–chemical conditions for homogeneity,108 higher porosity,109 mechanical robustness,110 and even thermal resistance of organic-based aerogels.111 Cellulose nanofibril (CNF)-based aerogels with natural rubber latex (NR) were produced as an illustrative example for that, varying the proportion of these two renewable ingredients. The μCT images of CNF/NR aerogels informed the correlation of the wall connectivity and porous microstructure with the addition of NR, as shown in the 2D slices for different ratios of CNF/NR [Fig. 5(a)].112 In this case, x-ray CT assists to tune the aerogel microstructures according to its application, such as metal ion capture,113,114 selective oil and organic solvent absorption,112,115,116 and pesticide incorporation for water remediation.117 The morphological porous structure of biocarbon-based foams has also been explored as adsorbents for effluents treatment, according to its porosity and specific surface area that can be analyzed by tomograms [Fig. 5(b)].118,119
With regard to polymeric composites, the biofiller dispersion, distribution, compatibility, and agglomeration can be mapped by μCT in the organic matrices. Tomograms of low-density polyethylene (LDPE) composites show the dispersion, distribution, and compatibility of the modified cellulose fibers (yellow color) into the polymer matrix (red color) [Fig. 5(c)]. These images enable the optimization of the filler incorporation process and distribution in the polymeric matrix, correlating with its mechanical performance,120 as also reported for polymers blends.121 In addition, in situ x-ray tomography analysis enables the observation of dynamic physical phenomena in polymers such as melting, bubbling, and ashing, promoting a better understanding of polymer processing.122
In relation to bioscaffolds, there are numerous works regarding their porous structure with their application to bone regeneration.124–128 The process parameters such as temperature combined with the ratio of additives affect the final morphology of these materials, impacting their applicability for cell attachment.124,129,130 In situ tomography analysis during the sintering process of bioscaffolds allows better control of their final architecture, such as the degree of porosity and pore size.131 Aiming 3D bioscaffold strong enough mechanically (similar to bone) with interconnectivity appropriate for tissue development, several studies explored 3D bioglass (BG) templating obtained by sol-gel process,124 3D printing process combined with renewable matrices,132,133 and organic phase (like a polymer).125 In these cases, x-ray CT has been applied to map BG structure, observing the result of simulated body fluid (SBF) in their internal porous structure, the effect of morphological/composition structure on bone formation,124–127 and even its antibacterial activity.134
μCT also stands out as a relevant method for the structural characterization of green electronic devices based on biocarbon, allowing electrical and morphological correlation.123,135 Lignin and cellulose nanofiber-based carbon aerogels demonstrated promising hierarchical structure applied as electrodes in supercapacitors, exhibiting an exceptional electrochemical performance.135 Additionally, tomographic images have been used to adjust the microfabrication process of bio-based electronic interfaces, enabling to map, for example, the permeabilization of pyrolyzed paper with polydimethylsiloxane (PDMS) aiming flexible electrodes [Fig. 5(d)].123
Thereby, μCT has been consolidated as a powerful nondestructive tool to analyze bio-based materials as demonstrated in this section. The possibility to correlate the morphology of these systems to processability, chemical treatments, biofunctionality, and electronic performance enable researchers to comprehend their final mechanical, physical, chemical, biological, and other intrinsic characteristics. On the other hand, for a better understanding of these systems, high spatial and time resolutions have been increasingly required, leading to nano- and four-dimensional characterization, as extensively discussed in Sec. III.
III. THE X-RAY COMPUTED TOMOGRAPHY BEAMLINE CAPABILITIES
The year 2022 marks the 75th anniversary of the discovery of synchrotron radiation. From 1947 until today, more than 60 synchrotron laboratories and free electron laser facilities have been built around the world.136 Since 1968, when the first spectrum was measured in the Tantalus I storage ring, until today, with the recent emergence of fourth-generation facilities, more than 170 000 articles, in many different areas of knowledge, have been published using this cutting-edge tool.137 As a way of characterizing the performance of these laboratories, one property of the beam is used, the so-called brilliance, which represents how the flux is collimated and how small the source is. Brilliance is calculated as the spectral flux divided by the unit source cross section and its solid angle.138 Green-field fourth-generation machines, such as Sirius and Max IV, increased brilliance by two orders of magnitude when compared to the third generation sources.
Another parameter worth mentioning is beam coherence, which reflects the fact that a real beam is not a perfect plane wave. As this is a property of the beam detailed in many different literatures,138,139 we will just mention that with the advent of fourth-generation synchrotrons, the coherence and coherent flux of the beam have increased considerably when compared to x-ray beams produced by previous generation machines. This fact enables plane wave coherent diffraction imaging (plane wave CDI) experiments, which were already possible with synchrotrons of previous generations, but little explored by the scientific community, due to all the experimental limitations associated with the reduced sample size (∼few micrometers) and long measurement times. This 3D imaging technique has great potential to increase the spatial resolution of x-ray images, with a theoretical limit of 10 nm, but still presents challenges for validation due to several causes, such as the difficulty of phase determination and object dimension.140
There are several commercially available benchtop equipment that specify pixel size ranging from the micro to the nanoscale,141 and very fast scans (seconds-scale for a 3D image), when operating in relatively low resolution.14,19,142 In this review article, we focus on the exclusive “punctual” x-ray source in synchrotron beamlines. We discuss on full field x-ray imaging techniques, both in the contact (absorption) and Fresnel (phase contrast) regimes, as most of the well-established x-ray CT beamlines in the world work in these conditions—for more information regarding x-ray imaging regimes look at Mc Morrow and Als Nielsen143 and Salditt et al.144 We also highlight x-ray CT and fourth-generation machines, which have recently come into operation to discuss some new possibilities that this type of laboratory brings to scientific fields.93,145–147 For bio-based materials, usually considered as light atomic weight samples in high-energy x-rays, most of the x-ray microscopy techniques rely on contact regime (e.g., x-ray attenuation) as the method for measurement. Thus, it is often imperative to increase contrast of the sample by using heavy elements solution to impregnate soft materials (e.g., animal tissues) and bring radiodensities close to those heavy materials.148,149 Iodine-based contrast enhancement is one of the most used methods, but there are plenty of solutions that can be used to better resolve particular tissues or samples according to their composition and/or fixation procedures.150 On the other hand, synchrotron could offer the possibility to explore phase-contrast imaging techniques, which can not only reveal fine structures with subtle changes in refractive index inside the sample but also significantly reduce the deposited radiation dose. This technique relies on refraction, i.e., changes in the beam trajectory when traveling through objects composed of different materials, rather than attenuation or absorption, and it can reveal features previously considered “invisible” to x-rays in contact regime.144,150 At the same time, as phase retrieval does not require energy absorption, one can use higher energy x-rays, thus reducing the radiation dose.144 Although conventional methods in the biomedical area such as microtomy used in histological analysis remains the gold standard for fine anatomical studies, being able to use molecular markers to identify particular proteins or molecules, their destructive nature prevents in-depth reconstruction of the whole 3D sample.151 Thus, μCT, as a nondestructive technique, complements this information and allows fine internal characterization of these samples.
Two topics will be explored in more detail in this session with illustrative examples of applications, regarding the powerful advancement reached by the fourth-generation beamline capabilities. The first one is x-ray computed nanotomography (nCT), which greatly benefits from the reduction of the x-ray source size in fourth-generation synchrotrons, which is essential for full field CT measurements carried out in the contact and Fresnel regimes, as the image resolution depends on the size of the x-ray source, beam magnification, and pixel size of the detector.144 The second topic is time-resolved CT (4DCT), which greatly benefits from the high flux of the beam generated in synchrotron light sources.
A. X-ray computed nanotomography
With the rapid development of nanotechnology, a non-intrusive morphological technique has been required for a better comprehension of bio-based systems at the nanoscale. It is important to note that a high spatial resolution is obtained as a function of objective lens application (e.g., Fresnel zone plate—FZP) or elliptically shaped mirrors (KB) for x-ray phase contrast zoom tomography.14,152 Moreover, the temporal resolution at the nanoscale has been increased by implementing Zernike phase rings, permitting faster experiments with a low contrast of noise using the nCT approach.153 Conversely, innovative designs of beamlines in extremely brilliant fourth-generation synchrotrons use focusing optics (KB systems) that focalizes the x-ray beam in a nanometric region (i.e., creating a nano-focus), and thus a conical beam, which magnifies the image geometrically, dispensing the need for magnification via the indirect detection systems (i.e., visible light microscopes). Consequently, the use of direct x-ray detectors considerably increases the detection efficiency by ∼100 to 1000 times, decreasing the exposure time of each projection and increasing the temporal resolution of the measurement.154 In this context, nCT has been developed as a powerful method to describe distinct 3D nanostructures, achieving high spatial resolution at orders of <100 nm, as described for bio-based systems in Fig. 6.
For example, the high spatial resolution is displayed in 3D data acquisition of the nanoporous materials of bio-based systems. A complex hierarchical porous structure of the algae from micro- to nanoscale was observed using the concepts of nCT [Fig. 6(a)].35 Moreover, nCT revealed structural changes at multiple scales of lignocellulosic biomass promoted by hydrothermal treatments [Fig. 6(b)].67 In neuroscience, nCT has enabled imaging of neural tissue with sufficient resolution (voxel size of 30 nm) for individual neurons to be observed in Drosophila melanogaster tissue.155 The nanoscale resolution achieved in this study allowed authors to describe the motor and sensory single-neurons contained in the Drosophila's leg.156 In all these cases, the miniaturization of samples was essential to ensure good spatial resolution aligned in some cases to staining of samples with heavy metals.
B. Time-resolved x-ray computed tomography
Aiming at quantitative three-dimensional structural investigation resolved over time, 4DCT permits the correlation of morphological changes over a short-time step (≤1 s) with simultaneous exploration of the mechanical, thermal, physical, and chemical events in several bio-based systems.19,157–160 In other words, it is possible to observe numerous events in situ, in vivo, and in operando modes by 4DCT, as summarized in Fig. 7.
The evolution of the internal microstructure of CNF aerogels under mechanical compression is one representative schema for in situ x-ray CT imaging [Fig. 7(a)]. The compressed aerogels' illustrations exposed the cell wall buckling/bending mechanisms, followed by the total aerogel's wall collapse.161 Other complex multiphase systems have been characterized in situ by x-ray CT, such as crystallization in basaltic magmas,162 grain compaction,163,164 wood micro-scale deformations under mechanical stress,62,63 combustion of wood,165 phosphorous release in soil,166 fluid flow in porous media applied to enhanced oil recovery,167 or in situ groundwater remediation [Fig. 7(b)],168–172 collapsing of a water drop within a sucrose suspension,173 and the bubbling of beer foam over time [Fig. 7(c)].36 Almost all of these in situ analysis required the development of setups that permit carrying out the tests during the x-ray scans.161
In vivo analysis is another important procedure to understand the physiology of various biological systems. The sample's gradient movements can be captured over seconds by in vivo analysis, generating a tomographic movie.174 The same analysis can be performed for mitral valve deformation175,176 and bone structure strain along mechanical compression in short-time step acquisition.177 It is important to note that in vivo analysis requires a shorter acquisition time than the sample movement to avoid motion artifacts in tomograms.174,178 Consequently, an equivalent dose is delivered in a short time, increasing the so-called dose rate, which is critical for biological systems.179,180 Its effects are not completely understood, but cells and organs may be severely affected, prohibiting very fast 3D measurements in live systems. The same concept is applied when using a cone beam since by increasing resolution, the illuminated area is decreased and the dose increases. Summarily, every detail of the measurement (e.g., beam geometry, beam profile, peak energy, energy resolution, illuminated area, and scan time) in biological systems must be considered and tested (or simulated) before the experiment to verify its feasibility.
In the area of organic electronics, the in operando mode can be performed combining x-ray imaging to electrical characterization techniques in setups like electrochemical impedance spectroscopy (EIS).181,182 Studies based on in operando mode of organic electronics relate the morphological change of polymeric electrolytes with the cathode catalysts of fuel cells.182–184
Even though numerous examples characterized by nCT and 4DCT have been presented in this section, these techniques are still little explored compared to conventional (benchtop equipment) x-ray CT. This scenario correlates with the challenge of real-time processing of large amount of data generated by x-ray CT coupled to synchrotron light source, which is discussed extensively in Sec. IV.
IV. TOMOGRAPHIC DATA PROCESSING CHALLENGES: REAL-TIME DEEP LEARNING TECHNIQUES ON LARGE DATASETS
With a large amount of data generated in synchrotron beamlines, semi-manual processing is time-consuming and unfeasible.185 Furthermore, a single tomographic image can have hundreds or thousands of classes for segmentation, making semi-manual segmentation even more impractical.186–188 There are also challenges in data storage and handling, but in the context of our discussion on the potential techniques and tools for acquiring and processing x-ray CT and processing, we have determined that exploring data storage and handling specifically would fall outside of our scope. Therefore, to address the challenge of data processing in synchrotron beamlines, machine learning (ML) has emerged as a powerful solution to these issues and will be covered in this section. ML is a subfield of artificial intelligence that provides algorithms and techniques designed to teach machines to perform tasks by identifying correlations and patterns in a given set of data to acquire knowledge.189 The learned relations are consolidated into a machine learning model that can be used for predictions or to understand the patterns from unknown data. In the context of tomographic data processing, ML has been widely used for a variety of tasks, such as the removal of artifacts,190–192 and assisting in digital image segmentation and recognition.188,193–195
We can categorize machine learning techniques into two main types: supervised and unsupervised approaches.189 Supervised learning techniques learn a task and estimate a machine learning model by taking as input a set of labeled data to learn from, i.e., containing both the inputs and outputs. The most common supervised tasks are classification and regression, for learning categorical and numerical outputs, respectively. Examples of supervised learning algorithms include ridge regression, random forests,196 K-nearest neighbors,197 support vector machines,198 and multilayer perceptron.28 In turn, unsupervised learning learns a task by searching for structures and hidden patterns using only input data, without outputs (labels). The most common unsupervised tasks are dimensionality reduction and clustering. Examples of well-known unsupervised learning algorithms include principal component analysis (PCA), K-means, hierarchical clustering, and autoencoders.199–201 Nowadays, deep learning-based techniques, defined as neural networks with multiple hidden layers, have emerged as state-of-the-art ML approaches due to their high capability of generalization, flexibility, and accuracy. However, different from many classic ML techniques, DL-based approaches require large amounts of data to estimate thousands or millions of parameters that compose the models, making this kind of technique a natural choice to solve image segmentation problems at synchrotron beamlines.31,202
Considering the context of large data volumes generated in synchrotron beamlines, deep learning (DL) approaches emerge as a promising solution that benefits from these large-scale datasets, and more importantly, to deal with the diverse computer vision problems related to tomographic image processing and understanding. Among the several deep learning techniques, convolutional neural networks (CNN) are established in the processing of complex computational data concerning visual computing tasks.203,204
Convolutional neural networks are a powerful type of neural network that comprises a hierarchical structure built from stacked layers capable of generating local spatial features by leveraging the fact that nearby information (pixels in an image) is typically related to each other.31 This is referred to in the literature as local spatial information. Another important characteristic of CNN is its ability to produce low-level features (e.g., edges and corners) by the earlier layers and high-level features (e.g., more complex structures) by the later layers.
Another key aspect of this technique relies on the possibility to devise end-to-end computer vision applications, which include finding suitable representations for the input data (representation learning) and designing classifiers to operate on top of those representations (classification). In contrast to the classic machine learning pipeline that considers the design of these two phases separately, deep learning techniques allow the design of representation learning and classifiers altogether in a single framework and whose trainable parameters are estimated considering the synergy of these two phases. This favors the construction of sophisticated domain-specific learning methods without the need for hand-crafted designs, which are usually engineering intensive. The CNN must be trained, automatizing processes for new input data. Each training results in a certain error, which is defined by the original data. The difference between the model approximation and the reference data, defined by a loss function, is minimized via backpropagation with stochastic gradient descent optimizers, where there is an attempt to find the minimum of this loss function.204–206 Furthermore, the learning rate needs to be adjusted by having optimization steps that adapt the learning rate as the network calculates errors.207
The success in CNN employment came from technological advances on one hand, represented by the efficient use of rapid graphics processing units (GPUs),208 and theoretical and algorithmic advances on the other,209 represented by the use of rectified linear units (ReLUs),210 and the dropout regularization technique.211 CNN architectures may have 10–20 layers of ReLUs, hundreds of millions of weights, and billions of connections between units, which provide efficiency in computational data processing.31,212 Therefore, DL has provided great advances in the learning processes of computer vision tasks containing high-dimensional data. In other words, DL can build its own computational model from a training dataset and even take advantage of classic algorithms such as the filtered back projection (FBP).213 Considering tomographic images, DL allows the processing of unknown images as opposed to conventional algorithms based on a known routine. In terms of different applications, DL can be applied to improve image quality, recognizing and classifying objects, segmenting phases of images, and generating labels automatically,214 as shown in the comparative scheme in Fig. 8.
A. Noise reduction and restoration in x-ray tomography
DL has been used to deal with problems related to image quality that appear due to some x-ray imaging conditions, such as low dose signal and sparse angle issues.215–217 The main challenge of dealing with low-dose x-ray imaging is the decreasing signal-to-noise ratio.217 In this case, the image can suffer from several artifacts, including streaking.218–232 Furthermore, ring artifact correction233,234 and Gaussian noise reduction are usually used to improve the quality of tomographic images.
In this direction, Ronneberger et al.235 proposed a U-Net-like architecture that takes advantage of both prior and phase-solved reconstructions to improve the quality of tomographic images. In addition to the U-Net architecture, other kinds of DL have been proposed toward improving the quality of x-ray-based data, such as generative adversarial networks (GAN).236,237 GANs are composed of two main components: the discriminator and generator networks. The first component is responsible for learning noise properties from original noisy data to guide the generator network, which, in turn, is responsible for denoising an input image and generating noisy synthetic images. The synthetic images are then used to challenge the discriminator toward improving its ability to discriminate noisy images from clean images. The main differences that distinguish GANs lie in the way the discriminator and generator are estimated. For further reading on this topic, we recommend additional references.238–240 Finally, for synchrotron nCT, DL can mitigate random jitter effects, enhancing spatial resolution and reducing time acquisition.241–243
B. Image segmentation in x-ray tomography
The flexibility of x-ray imaging techniques in measuring physical properties of different material types imposes several challenges in designing effective segmentation methods for solving a wide range of problems or target applications (e.g., porous media in soils and discrimination of biological tissues). With this regard, DL methods have been successfully used to devise methods able to deal with complex problems. Fully convolutional networks (FCN) and U-Net networks are prominent architectures to design end-to-end solutions for several tomography-related problems.235,244 In Xu's work, the FCN was used to devise an image decomposition algorithm capable of segmenting bones from tissues for a dual-energy CT application.245 Ben-Cohen et al. proposed the use of stacked FCN networks to design an end-to-end solution for liver segmentation and lesion detection tasks.246 Similarly, a cascade of FCN networks was employed to segment human abdominal organs, including artery, veins, stomach, and pancreas among others. The U-Net approach also has been used in a wide range of x-ray-based applications such as soil analysis and x-ray-based medical image segmentation.247,248
The great ability of both FCN-like and U-Net-like architectures in producing accurate segmentation maps lies in the introduction of skip-connection mechanisms. In both approaches, skip connections help estimate accurate segmentation maps, especially on boundaries, by recovering fine-grained details, even in deep architectures. That is because both kinds of architectures are composed of two sub-networks, an encoder and decoded network. Usually, the encoder network consists of sequences of two 3 × 3 convolutions, followed by a ReLU activation function, batch normalization, and max pooling operation to reduce spatial dimensions, although it is possible to use other backbone networks as an encoder. Later, He's work suggested that skip connections (also known as residual connections) may prevent the vanishing gradient problem in deeper architectures.249
Although it is possible to use other backbone networks as the encoder, in Sec. IV C, we discuss possible alternatives for backbones toward achieving real-time processing; usually, the encoder sub-network consists of sequences of layers composed of two 3 × 3 convolutions, followed by a ReLU activation function, batch normalization, and max pooling operation. Thus, on each layer, the encoder reduces the spatial dimensions and increases the channels or feature maps. As a result, the encoder sub-network can learn meaningful features that help to predict the class of pixels but lose the ability to perform fine-grained segmentation.250 In turn, the decoder sub-network increases the size of feature maps and decreases the number of channels to recover spatial information and produce fine-grained segmentation. To achieve this goal, a decoder sub-network comprises an up-sampling operation (e.g., bi-linear interpolation) of feature maps followed by a 2 × 2 transpose convolution, and a concatenation operation to fuse the feature maps from its corresponding encoder sub-network via skip connections. The great ability of this kind of architecture to produce high-accurate segmentation maps, makes the U-Net a reference method for image segmentation, especially in x-ray-based medical image analysis, and has inspired the design of several other architectures for image segmentation, for instance, the V-Net architecture,251 which was designed to handle volumetric data. For more details of U-Net and its variants, we recommend Siddique's survey252 and Hesamian's works.253
C. Drawbacks and Limitations of DL-based approaches in x-ray applications
Each analyzed sample requires different x-ray CT image acquisition parameters such as radiation dose, acquisition time, detector resolution, and x-ray source. Additionally, different CTs of the same sample will produce different clusters of datasets, i.e., different texture, contrast, intensity, morphology, artifacts, and even noise characteristics. These parameters lead to a unique image domain. A potential downside for DL as a data-driven method is the inaccuracy of the output obtained from different image domains, usually proportional to the distance between them. This problem is known as distributional shift.254–258
To address distributional shift, various techniques have been proposed, such as domain adaptation and data augmentation. Data augmentation increases the variety and size of the dataset by scaling, rotating, changing contrast, and adding noise to the original dataset.259–263 In domain adaptation, the model is trained to work in a new image domain, usually with adversarial neural networks such as generative adversarial networks (GANs), which can also be used for data augmentation with image synthesis.264–267 The advantage of these methods is that they succeed unsupervised. This is particularly valuable since most image segmentation using DL in synchrotron CT x-ray relies on producing a ground truth specific to its dataset and image domain in order to train a CNN like U-net, which in turn would require an expert in the particular field of the sample. Recent works on medical images are leading recent advances in DL-based methods for x-ray CT data, compared to reports related to synchrotron facilities. The design of efficient DL architectures is incipient in recent synchrotron x-ray-based developments, which mainly uses threshold-based segmentation techniques. While such techniques are less computationally expensive than DL-based methods, they lack accuracy in solving complex segmentation problems. In turn, medical CT techniques benefit from recent developments in deep learning techniques to devise advanced methods for x-ray image segmentation. However, in both research communities, recent works focus on achieving high accuracy to the detriment of efficiency.
Nowadays, there are several open-source software that implement DL-based image segmentation methods such as Biomedisa230 and Annotat3D.262 However, these software packages still depend on a high-performance computational infrastructure, being limited to a few laboratories. In this way, other open-source platforms like Caffe resurface, currently established in plug-and-play cloud-based DL (CDeep3M),268 democratizes access to data segmentation for users without access to high-performance computing resources. From the development of increasingly advanced and optimized algorithms based on DL, it is possible to extract the maximum amount of information from tomographic data, showing the interdependence between the advances in experimental capabilities and computation.
D. Designing deep learning architectures for real-time data processing
Real-time segmentation is an essential component in several computer vision applications, including autonomous vehicles269,270 and robotics.271 The requirement of a fast response for fast decision-making motivated several studies that concentrated on understanding the learning processes of CNNs and mechanisms that lead to the most effective and efficient architectures. In the context of x-ray-based image analysis in synchrotron beamlines, real-time data analysis arises due to demands for processing large datasets with low response latency. Modern synchrotron beamlines that produce time-resolved x-ray CT with high spatial resolutions require algorithms that leverage the power of high-performance computing environments to enable real-time applications in the context of x-ray image analysis as artificial intelligence-assisted segmentation and scientific visualization of large datasets.272 In addition to use of HPC environments, the following approaches can also be used to devise deep learning architectures for running on devices with computational constraints, such as FPGA devices.273,274
This section aims to provide a comprehensive overview of the most prominent approaches for designing efficient, and yet effective CNN architectures for real-time x-ray image analysis. Next, we discuss some approaches that have been used to reach this goal.
1. Lightweight backbones
The development of lightweight backbones is one of the early techniques for leading to real-time segmentation. We can understand the CNN architecture in two components: representation learning and classification. The representation learning component aims to extract features and, thus, automatically discover representations (or subspaces) that encode information useful for the classification task. Usually, the representation learning component is called the backbone of the network, and it concentrates most of the layers that compose a CNN architecture, which requires more computations. In turn, the classification component contains fewer layers and, thus, is expected to have fewer computations. Therefore, lightweight backbones refer to backbones that require less computational resources to extract meaningful representations, which impacts directly on the overall computations including the estimation of values for trainable parameters. For this reason, both the number of parameters and floating-point operations per second (FLOPS) performed by the network are the two main metrics to compare the efficiency of CNNs.275
In a recent study, Tan and Le276 proposed a family of CNNs called Efficient Nets, which can produce classification results in about 8.4× smaller and 6.1× faster in comparison with their correspondent non-optimized architectures. To achieve these results, the authors performed a systematic study of model scaling to identify strategies for decreasing networks' parameters by considering three aspects: the depth, width, and resolution of the input data, as illustrated in Fig. 9. Nowadays, there is strong evidence that the depth of the neural network plays an important role in generating more complex and richer features, which is desirable for segmenting complex objects or regions.249,276,277 In turn, both the width of a neural network and the resolution of the input data can produce more fine-grained representations that lead to better classification results, especially on the boundaries of an object or region of interest, where most misclassifications usually lie, in general. With this in mind, the authors proposed a compound coefficient that allows the design of efficient and effective backbones. Finally, Fan et al. proposed an FFBnet that combines a lightweight backbone development with a feature fusion box, which is essentially an approach for feature aggregation.275
2. Feature aggregation
Feature aggregation aims to enrich representations by aggregating light and powerful detectors, usually built upon lightweight backbones, toward improving the accuracy with minimal efficiency loss. As illustrated in Fig. 10, the idea of fusing different lightweight backbones emerged to make efficient CNN architectures, without losing efficiency. As a successful example of this approach, we could mention Tan 's work, in which the authors proposed a feature fusion block method that has two main components: the feature aggregation block (FAB) and dense feature pyramid (DFP).275 Similar to Tan and Le,276 the authors observed that lightweight backbones may not produce representations with enough spatial and semantic information, which impact the qualitative results of classification significantly. The authors observed that shallow backbones can produce rich feature maps, but they lack spatial information. To overcome this limitation, they proposed a method to fuse feature maps with different selecting layers from different depths on the backbone as it is known that earlier layers capture low-level features, while deeper layers capture high-level features. As a result, the aggregation of these layers with different spatial scales and high-level semantic features can produce strong representations.
Finally, the deep feature pyramid (DFP) component aims to improve multi-scale detections by concatenating the high-level layer with the shallow layer toward enhancing the detection of small objects. In comparison to similar strategies as dense connections,278 the DFP component has less trainable parameters, which favor the construction of lightweight backbones.
3. Attention mechanisms
Attention is an important concept in artificial neural networks, and it is present in several state-of-the-art algorithms such as BERT,279 Transformers,280 and Vision Transformers (ViT).281 Inspired by the cognitive attention of humans, studies on attention mechanisms in computer vision focus on guiding the training process of CNNs by enhancing part of the input data that really matter for the machine learning a specific task,282 while ignoring non-relevant information. This concept was introduced by Bahdanau et al.283 as a mechanism to improve neural machine translation systems. Later, Xu et al.284 proposed an adapted version of this mechanism for computer vision tasks, including object detection and image caption generation. Nowadays, the algorithms for attention mechanisms in computer vision and image understanding problems can be categorized according to which kind of information, or domain, the network will concentrate its attention, which includes spatial attention (where to pay attention), channel attention (what to pay attention to), temporal attention (when to pay attention), and, finally, branch attention (which task to pay attention to).
Recently, the adoption of attention mechanisms for real-time segmentation algorithms has emerged as an effective alternative approach to designing attention mechanisms with low computational costs.32 According to authors, to reach high accuracy detections, a CNN model must be able to produce representations with a rich spatial context, which in turn requires large receptive fields, and fine spatial details, which favor fine-grain segmentation and, thus, segmentation of high-resolution inputs. However, both requirements are achieved by using structures that demand high computational costs. To overcome this limitation, the authors introduced the fast spatial attention mechanism, using a self-attention mechanism to capture rich spatial context with a lower computation effort. Also, to segment high-resolution inputs efficiently, the authors reduce intermediate feature stages of the network by using feature fusion of such modules.
Finally, a more recent attention mechanism approach proposes a two-branch network BiAttnNet, with a unique Bilateral Attention structure that separates all attention modules into the Detail Branch, to contribute semantic detail selections for specialized detail exploring. The Detail Branch comprises AttnTrans entirely, which provides a better alternative for regular convolution. We can understand the AttnTrans based on three parts that comprise spatial attention, channel attention, and tensor reshaping parts. In summary, spatial attention is reached by averaging an input tensor on the channel dimension, while channel attention is performed by averaging on the spatial dimension. Two explored attention maps are applied to the input tensor by multiplying all three of them together. Then, a tensor reshaping operation is applied by using a 1 × 1 kernel convolution.285
V. CONCLUSION AND OUTLOOK
This review highlights the emerging trends and applications of μCT to characterize diverse bio-based systems, such as plants, animals, soils, and synthetic bio-based materials. The fundamental design of x-ray CT coupled to synchrotron beamline is addressed, considering their capabilities related to nCT, as well as in situ, in vivo, and in operando 4DCT characterization. To overcome challenges related to tomographic data, advanced computational algorithms based on machine learning and deep learning are outlined in this review, focusing on the urgency of combining computational tools with experimental data.
Future prospects for x-ray CT point toward increased access to fourth-generation synchrotron beamlines around the world, requiring robust setups, tomographic data management, and processing pipelines for fast and accurate analysis. While there are several strategies to design deep learning architectures for real-time data processing, as pointed out in this review, we observed that the adoption of those strategies to tackle x-ray CT-related problems is somewhat incipient still, especially in x-ray CT produced by modern synchrotron beamlines, which can produce images with a very high resolution, and thus, revealing the nanostructures of materials.
We hypothesize that the main challenges in adopting modern machine learning methods for x-ray tomographic data produced by modern synchrotrons are twofold: the lack of reliable ground-truth data, especially for nano-tomographic data, since most of the deep learning-based methods work in a supervised regime; and the lack of both explainability and interpretability of the complex machine learning models, which make difficult application-oriented analysis performed by specialists. We believe that recent advances in self-supervised and unsupervised learning,286,287 and the explainability of machine learning models may mitigate the aforementioned challenges.288,289 In summary, we believe that state-of-the-art methods for tomographic data must be able to address the following: handle large volumes of data, with as low latency as possible; operate in scenarios in which the obtaining of manual data labeling is very costly and time consuming; and be able to devise explainable or at least interpretable models so the specialist can judge when to trust, or not, predictions and inferences made by complex models, in order to have a more confident use of AI to answer research questions related to the relevant field of expertise.
ACKNOWLEDGMENTS
The authors would like to thank the Brazilian National Council for Scientific and Technological Development-CNPq (DTI-A/SisNANO Grant Nos. 380312/2020-4, 302334/2022-0 and 380173/2022-0), INCT Nanocarbono, and the São Paulo Research Foundation—FAPESP (Grant Nos. 2014/50884-5, 2017/18139-6, 2017/02317-2, 2018/16453-8, 2020/08651-4, and 2021/03097-1) for supporting this work.
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
Author Contributions
Pedro Ivo Cunha Claro, Egon Borges, Gabriel Ravanhani Schleder, Nathaly Archilha, Allan Pinto, Murilo Carvalho, Carlos Driemeier, Adalberto Fazzio, and Rubia Figueredo Gouveia contributed equally to this work.
Pedro Ivo Cunha Claro: Conceptualization (equal); Investigation (equal); Methodology (equal); Project administration (equal); Supervision (equal); Visualization (equal); Writing – original draft (equal); Writing – review & editing (equal). Egon Borges: Writing – original draft (equal); Writing – review & editing (equal). Gabriel Ravanhani Schleder: Writing – original draft (equal); Writing – review & editing (equal). Nathaly Archilha: Writing – original draft (equal); Writing – review & editing (equal). Allan Pinto: Writing – original draft (equal); Writing – review & editing (equal). Murilo Carvalho: Writing – original draft (equal); Writing – review & editing (equal). Carlos Driemeier: Writing – original draft (equal); Writing – review & editing (equal). Adalberto Fazzio: Writing – review & editing (equal). Rubia Figueredo Gouveia: Conceptualization (equal); Funding acquisition (equal); Investigation (equal); Methodology (equal); Project administration (equal); Supervision (equal); Visualization (equal); Writing – original draft (equal); Writing – review & editing (equal).
DATA AVAILABILITY
The data that support the findings of this study are available from the corresponding author upon reasonable request.