Strategies for effective transfer learning for HRTEM analysis with neural networks

Simulated dataset generation

Simulated datasets were generated using Construction Zone Rangel DaCosta et al., 2024 for structure sampling and generation, and a combination of the Prismatic Rangel DaCosta et al., 2021 and abTEM Madsen & Susi, 2021 software packages for image simulation. HRTEM simulation was performed using the multislice algorithm, with an accelerating voltage of 300kV, a real space sampling of 0.1 A˚\text{\AA} , and 8 frozen phonons. Images were then downsampled to a resolution 0.2 A˚\text{\AA} , after which defocus and electron dosage effects were applied; for all images, we applied chromatic aberration effects corresponding to focal spread of 10 A˚\text{\AA}.

For each structure, we simulate the full structure and the nanoparticles alone in vacuum. We generate ground-truth segmentation labels using the nanoparticle-in-vacuum simulation by applying a phase threshold against the exit wave, which generates masks that are consistent across imaging conditions and apply a small Gaussian blurring kernel to smoothen the segmentation label. We visually optimized the threshold to π/16\pi/16 for all structures, and the Gaussian kernel sizes to 1 A˚\text{\AA} , for the C, Fe, and Co3_3O4_4 structures, and 1.6 A˚\text{\AA} for all other structures. We calibrate zero defocus per structure to the minimum contrast condition of the nanoparticles in vacuum, and determined heuristically with a two-stage grid search of defocus values with grid samplings of 1 and 0.1 A˚\text{\AA} and using the variance of the image intensity as a proxy for image contrast. Electron doseage effects are applied via sampling of a Poisson distribution after scaling pixel values by the effective dose. Before training, input images are standardized to a mean of 0 and variance of 1, per image.

Source:Jupyter Notebook
Example training images from datasets used in this study. Scale bar is 2nm.

Figure 2.1:Example training images from datasets used in this study. Scale bar is 2nm.

Three dataset series were constructed in total: one varying in image defocus, one varying in electron doseage, and one varying in the nanoparticle structures. Example images from each dataset can be interactively viewed and compared in Figure 2.1. Across all datasets, nanoparticles were placed atop 12.8nm by 12.8nm amorphous carbon substrates between 2nm and 8nm thick. For the defocus dataset, we generate a series of CdSe nanoparticle structures. Each structure contains between one and five spherical wurtzite CdSe nanoparticles ranging between 3-5 nm in diameter, with each nanoparticle containing up to two stacking faults and separated from other nanoparticles by a distance of at least 1nm. Eleven defocus conditions were sampled post-simulation for each image from -25 to +25nm in steps of 5nm with a standard deviation of 1.5nm. The five defocus datasets between -10nm and +10nm defocus were used for model pretraining and all were used for transfer learning,resulting in 50 total possible dataset combinations (excluding repeat defoci) across both training stages. For the defocus datasets, we applied an electron doseage of 400 eper A˚2e-\text{per \AA}^{-2}. For the noise datasets, we utilize the same CdSe nanoparticle structures and simulations, a fixed defocus of -10nm with a standard deviation of 1.5nm, and electron doseages of 30, 75, 150, 300, and 600 eper A˚2e-\text{per \AA}^{-2}. For the structure varying datasets, we generate 18 different nanoparticle structure distributions corresponding to 2 size distributions and 9 crystal structures varying in chemistry and lattice symmetry (c.f. Table 2.1). Nanoparticle sizes are uniformly distributed with radii between 1 and 2nm (small) or between 2.5 and 3.5nm (large), with between 1 and 5 nanoparticles per structure, placed atop the substrate as previously described for the defocus dataset. For the imaging conditions, we use a fixed defocus of -10nm with a standard deviation of 1.5nm and apply an electron doseage of 300 eper A˚2e-\text{per \AA}^{-2}.

Table 2.1:Lattice structures used for nanoparticle generation.

StructureSpace GroupLattice parameters ( A˚\text{\AA} )
ZnS1863.8227 (a), 6.2607 (c)
CdSe1864.2985 (a), 7.0152 (c)
Cu2253.6157
Ag2254.0782
C2273.5673
Si2275.430
Fe3O42278.394
Co3O42278.065
Fe2292.866
References
  1. Rangel DaCosta, L., Sytwu, K., Groschner, C. K., & Scott, M. C. (2024). A Robust Synthetic Data Generation Framework for Machine Learning in High-Resolution Transmission Electron Microscopy (HRTEM). Npj Comput Mater, 10(1), 1–11. 10.1038/s41524-024-01336-0
  2. Rangel DaCosta, L., Brown, H. G., Pelz, P. M., Rakowski, A., Barber, N., O’Donovan, P., McBean, P., Jones, L., Ciston, J., Scott, M. C., & Ophus, C. (2021). Prismatic 2.0 – Simulation Software for Scanning and High Resolution Transmission Electron Microscopy (STEM and HRTEM). Micron, 151, 103141. 10.1016/j.micron.2021.103141
  3. Madsen, J., & Susi, T. (2021). The abTEM Code: Transmission Electron Microscopy from First Principles. Open Research Europe, 1(24), 13015. 10.12688/openreseurope.13015.1