Neural structure fields with application to crystal structure autoencoders

Communications Materials 2023
1OMRON SINIC X Corporation2Tohoku University3Toyota Motor Corporation4SOKENDAI5Randeft Inc.6Osaka University

TL;DR Express crystal structures as 3D continuous fields for crystal structure prediction

Overview

Representing crystal structures of materials to facilitate determining them via neural networks is crucial for enabling machine-learning applications involving crystal structure estimation. Among these applications, the inverse design of materials can contribute to explore materials with desired properties without relying on luck or serendipity. Here, we propose neural structure fields (NeSF) as an accurate and practical approach for representing crystal structures using neural networks. Inspired by the concepts of vector fields in physics and implicit neural representations in computer vision, the proposed NeSF considers a crystal structure as a continuous field rather than as a discrete set of atoms. Unlike existing grid-based discretized spatial representations, the NeSF overcomes the tradeoff between spatial resolution and computational complexity and can represent any crystal structure. We propose an autoencoder of crystal structures that can recover various crystal structures, such as those of perovskite structure materials and cuprate superconductors. Extensive quantitative results demonstrate the superior performance of the NeSF compared with the existing grid-based approach.

Crystal Structures as Continuous Fields

The crystal structure of an inorganic material is a regular and periodic arrangement of atoms in 3D space. This periodic arrangement is usually described by a minimum repeatable pattern called a unit cell. The atoms in a unit cell have no explicit order, and their quantity varies from one to hundreds in number.

Because ML models generally accept fixed-dimensional and consistently ordered tensors for processing, determining crystal structures via the models is challenging.

To address this problem, we represent a crystal structure as a continuous vector field tied to 3D space, rather than as a discrete set of atoms. The proposed approach, named neural structure fields (NeSF), uses two types of vector fields: position field and species field. These fields implicitly represent the positions and species of the atoms in the unit cell, respectively.

neural structure fields

To illustrate the concept of the NeSF, assume that we are given target material information as a fixed-dimensional vector, z\bm{z}, and consider the problem of recovering a crystal structure from z\bm{z}. Input z\bm{z} may specify, for example, information on the crystal structure of a material or some desired criteria for materials to be produced.

The NeSF uses a neural network, ff, as a vector field on 3D Cartesian coordinates p\bm{p}, conditioned on the target material information, z\bm{z}:

s=f(p,z)\bm{s} = f(\bm{p}, \bm{z})

In the position field, network ff is trained to output a 3D vector pointing from query point p\bm{p} to its nearest atom position a\bm{a} in the crystal structure of interest. Thus, we expect output s\bm{s} to be ap\bm{a} - \bm{p}. If the position field is ideally trained, we can retrieve position a\bm{a} of the nearest atom at any query point p\bm{p} as p+f(p,z)\bm{p} + f(\bm{p}, \bm{z}).

Analogously, the species field is trained to output a categorical probability distribution that indicates the species of the nearest atom.

Application to Crystal Autoencoders

As an application of the NeSF, we present an autoencoder of crystal structures. In this autoencoder, crystal structures are embedded into vectors z\bm{z} (called latent vectors) via an encoder, and then the NeSF acts as a decoder to recover the input crystal structures from z\bm{z}.

crystal autoencoders

Autoencoders are typically used to learn latent vector representations of data via self-supervised learning, in which the input data can supervise the learning via a reconstruction loss.

Benchmark

The NeSF-based autoencoder outperforms an existing voxel-based method, ICSG3D (Court et al., 2020), in reconstruction accuracies.

benchmark results benchmark results

Interpolation of Materials

Interestingly, the learned latent space with the NeSF enables us to produce interpolated crystal structures between a pair of materials.

latent space interpolation 1
Latent space interpolation from ZnS to CdS

latent space interpolation 2
Latent space interpolation from MgO to NaCl

Citation

@article{chiba2023nesf,
  author  = {Naoya Chiba and 
             Yuta Suzuki and
             Tatsunori Taniai and
             Ryo Igarashi and
             Yoshitaka Ushiku and
             Kotaro Saito and 
             Kanta Ono
             },
  title   = {Neural structure fields with application to crystal structure autoencoders},
  journal = {Communications Materials},
  volume  = {4},
  number  = {1},
  pages   = {106},
  year    = {2023},
  month   = {12},
  day     = {12},
  doi     = {10.1038/s43246-023-00432-w}
}