Advanced Search
Submit ManuscriptADVANCE ONLINE PUBLICATION
Structural genomics sheds light on protein functions and remote homologs across the insect tree of life
Weiyin Wu1,† , Chunlai Cui2,3,† , Yixiao Zhu1,4,† , Jingxuan Chen1 , Qiancheng Zhuang1 , Yazhou Wang1 , Zicheng Liu2 , Han Gao2 , Guo-Zheng Ou1 , Chao Liu1 , Mei Tao1 , Yun Chen1 , Ronghui Pan5 , Guojie Zhang6 , Hua Cai7 , Jinghua Yang1,4 , Xue-xin Chen1 , Xiaofan Zhou8,* , Sibao Wang2,* , Xing-Xing Shen1,4,6,*
1Zhejiang Key Laboratory of Biology and Ecological Regulation of Crop Pathogens and Insects, State Key Laboratory for Vegetation Structure, Function and Construction, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, Zhejiang, ChinaProtein structure bridges the sequence–function relationship, enabling deep exploration of biological processes across diverse organisms. Insects, the most diverse animal lineage, accounting for over 50% of all described animal species, provide an exceptional system for exploring sequence–structure–function relationships. Here, we reconstructed a comprehensive and well-resolved phylogeny of 4854 insects, spanning all orders. Leveraging this framework, we created an atlas of 13.29 million predicted protein structures from 824 representative species, including 11.63 million newly predicted structures. Structural clustering revealed that proteins with divergent sequences but similar structures could be effectively grouped together. Structural similarity searches against proteins with well-characterized functions yielded annotations for 7.61 million insect proteins, including up to 14% of previously unannotated proteins. We further identified 750 million remote homologs between insect proteins, many of which trace back to ancient branches of the insect phylogeny. Remarkably, despite extensive sequence divergence, cGAS-like receptors (cGLRs) were structurally conserved across all 824 insects. Experimental assays demonstrated that these structurally identified cGLRs play a crucial role in antiviral defense in the yellow fever mosquito. Our findings highlight the significance of structural genomics for understanding protein function and evolution across the tree of life.
https://doi.org/10.1038/s41422-026-01220-0