Virtual cells aim to replicate and predict real cell behavior using AI models, accelerating disease analysis, drug development, and cellular research. Instead of manually formulating equations, AI directly learns from cellular data to predict intracellular interactions
Why AI Companies Are Racing to Build a Virtual Human Cell
Virtual cells could make it faster and easier to discover new drugs.
https://time.com/7324119/what-is-virtual-cell/

Virtual Cell Challenge
Papers
A Cross-Species Generative Cell Atlas Across 1.5 Billion Years of Evolution: The TranscriptFormer Single-cell Model
Single-cell transcriptomics has revolutionized our understanding of cellular diversity, but integrating this knowledge across evolutionary distances remains challenging. Here we present TranscriptFormer, a family of generative foundation models representing a cross-species generative cell atlas trained on up to 112 million cells spanning 1.53 billion years of evolution across 12 species. TranscriptFormer jointly models genes and transcripts using a novel generative architecture, enabling it to function as a virtual instrument for probing cellular biology. In zero-shot settings, our models demonstrate superior performance on both in-distribution and out-of-distribution cell type classification, with robust performance even for species separated by over 685 million years of evolutionary distance. TranscriptFormer can also perform zero-shot disease state identification in human cells and accurately transfers cell type annotations across species boundaries. Being a generative model, TranscriptFormer can be prompted to predict cell type-specific transcription factors and gene-gene interactions that align with independent experimental observations. This work establishes a powerful framework for integrating and interrogating cellular diversity across species as well as offering a foundation for in silico experimentation with a generative single-cell atlas model. ### Competing Interest Statement The authors have declared no competing interest.
https://www.biorxiv.org/content/10.1101/2025.04.25.650731v1

Google
Teaching machines the language of biology: Scaling large language models for next-generation single-cell analysis
David van Dijk, Assistant Professor, Yale University, and Bryan Perozzi, Research Scientist, Google Research
https://research.google/blog/teaching-machines-the-language-of-biology-scaling-large-language-models-for-next-generation-single-cell-analysis/


Seonglae Cho