top of page

scPRINT: pre-training on 50 million cells allows robust gene network predictions

  • cyrilrenassia
  • Apr 16
  • 1 min read

Nature Communications


Jérémie Kalfon, Jules Samaran, Gabriel Peyré & Laura Cantini 


Summary


A cell is governed by the interaction of myriads of macromolecules. Inferring such a network of interactions has remained an elusive milestone in cellular biology. Building on recent advances in large foundation models and their ability to learn without supervision, we present scPRINT, a large cell model for the inference of gene networks pre-trained on more than 50 million cells from the cellxgene database. Using innovative pretraining tasks and model architecture, scPRINT pushes large transformer models towards more interpretability and usability when uncovering the complex biology of the cell. Based on our atlas-level benchmarks, scPRINT demonstrates superior performance in gene network inference to the state of the art, as well as competitive zero-shot abilities in denoising, batch effect correction, and cell label prediction. On an atlas of benign prostatic hyperplasia, scPRINT highlights the profound connections between ion exchange, senescence, and chronic inflammation.


Recent Posts

See All
The extended mobility of plasmids

Nucleic Acid Research Maria Pilar Garcillán-Barcia, Fernando de la Cruz, Eduardo P C Rocha Summary Plasmids play key roles in the...

 
 
 

Comments


bottom of page