What is rascaline

Rascaline is a library for the efficient computing of representations for atomistic machine learning also called “descriptors” or “fingerprints”. These representation can be used for atomistic machine learning (ML) models including ML potentials, visualization or similarity analysis.

There exist several libraries able to compute such structural representations, such as DScribe, QUIP, and many more. Rascaline tries to distinguish itself by focussing on speed and memory efficiency of the calculations, with the explicit goal of running molecular simulations with ML potentials. In particular, memory efficiency is achieved by using the metatensor to store the structural representation. Additionally, rascaline is not limited to a single representation but supports several:

representation

description

gradients

Spherical expansion

Atoms are represented by the expansion of their neighbor’s density on radial basis and spherical harmonics. This is the core of representations in SOAP (Smooth Overlap of Atomic Positions)

positions, strain, cell

SOAP radial spectrum

Atoms are represented by 2-body correlations of their neighbors’ density

positions, strain, cell

SOAP power spectrum

Atoms are represented by 3-body correlations of their neighbors’ density

positions, strain, cell

LODE Spherical Expansion

Core of representations in LODE (Long distance equivariant)

positions

Sorted distances

Each atom is represented by a vector of distance to its neighbors within the spherical cutoff

no

Neighbor List

Each pair is represented by the vector between the atoms. This is intended to be used as a starting point for more complex representations

positions

AtomicComposition

Obtaining the stoichiometric information of a system

positions, strain, cell