VarLand: A pipeline to map the structural landscape of missense variants at the proteome scale

Author
Keywords
Abstract

Missense variant pathogenicity often arises from disruptions to protein structural features. The integration of large-scale genetic sequencing into clinical workflows, and the availability of accurate artificial intelligence-based protein structure predictions present an opportunity to assess the structure-function relationship of missense variants at a population scale. To harness this potential, we developed VarLand, a computational pipeline that extracts 29 structural and biophysical features from AlphaFold-predicted protein models and nine complementary annotation tools. We applied VarLand to pathogenic missense variants from ClinVar and a population-specific dataset of rare Middle Eastern variants, comparing their feature profiles to high-frequency benign variants from the Genome Aggregation Database (gnomAD). Our analysis confirms that pathogenic variants are significantly enriched in ordered regions, buried residues, and sites with high intramolecular contact density, whereas benign variants preferentially occur in disordered, solvent-exposed regions. However, VarLand also uncovered feature landscape variations across protein functional classes and disease categories, suggesting differences in underlying disease mechanisms. Furthermore, variants from the artificial intelligence-based AlphaMissense database showed a stronger association between structural order and pathogenicity than clinical datasets, indicating residual bias from structure-centric training. These findings demonstrate the effectiveness of multidimensional structural profiling by VarLand to uncover not only broad structure-pathogenicity relationships but also dataset-specific and class-specific deviations, offering deeper insight into disease mechanisms.

Year of Publication
2026
Journal
Journal of Biological Chemistry
URL
https://doi.org/10.1016/j.jbc.2025.111071
DOI
https://doi.org/10.1016/j.jbc.2025.111071
Download citation