Lupa

Show document Help

A- | A+ | Print
Title:Fast prediction of protein flexibility
Authors:ID Pražnikar, Jure (Author)
Files:.pdf RAZ_Praznikar_Jure_2026.pdf (10,69 MB)
MD5: 044124A09598BFCA0B997932A8D93F48
 
URL https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btag175/8654525?login=false
 
Language:English
Work type:Article
Typology:1.01 - Original Scientific Article
Organization:IAM - Andrej Marušič Institute
Abstract:Motivation Advances in hardware have made molecular dynamics (MD) simulations of protein structures faster and more accessible to the scientific community. However, accurately estimating protein flexibility using MD remains computationally demanding, especially for large systems and long time scales. Several MD-based resources—including MdMD, the DynamD database, and more recently ATLAS and mdCATH—now provide MD trajectories for thousands of proteins, enabling the development of predictive models. Results Here, the Graphlet Degree Vector (GDV) is introduced as a lightweight, fast, and easy-to-implement linear model for predicting protein flexibility directly from atom coordinates. GDV is a 15-dimensional feature vector that captures local packing and the spatial connectivity of each atom with its nearby neighbors. Trained on a subset of globular-like proteins from the ATLAS database, the GDV model achieves a Spearman correlation of 0.828 compared to MD data. The model trained on ATLAS dataset was further evaluated on independent Nuclear Magnetic Resonance and cryo-electron microscopy datasets, demonstrating the robustness and generalizability of the GDV-based approach. A key advantage of the GDV model is that it requires no additional external or experimental data and can be applied in near real time (on the order of 10 seconds) even for large proteins with 20,000 atoms on a standard desktop or laptop. Overall, the results show that a lightweight, fast, and purely coordinate-based model can provide accurate and generalizable predictions of protein flexibility across diverse folds and sizes.
Keywords:protein flexibility, graphlets, predictive model
Publication version:Author Accepted Manuscript
Publication date:15.04.2026
Year of publishing:2026
Number of pages:str. 1-9
Numbering:Vol. , issue , [article no.] btag175
PID:20.500.12556/RUP-22965 This link opens in a new window
UDC:577:547.96
ISSN on article:1367-4811
DOI:10.1093/bioinformatics/btag175 This link opens in a new window
COBISS.SI-ID:275501315 This link opens in a new window
Publication date in RUP:16.04.2026
Views:33
Downloads:2
Metadata:XML DC-XML DC-RDF
:
Copy citation
  
Average score:(0 votes)
Your score:Voting is allowed only for logged in users.
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:Bioinformatics
Publisher:Oxford University Press
ISSN:1367-4811
COBISS.SI-ID:2799124 This link opens in a new window

Document is financed by a project

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:P1-0048-2018
Name:Strukturna biologija

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:I0-0035-2022
Name:Infrastrukturna skupina Univerze na Primorskem

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.

Secondary language

Language:Slovenian
Abstract:Napredek na področju strojne opreme je omogočil, da so simulacije molekulske dinamike (MD) proteinskih struktur hitrejše in dostopnejše znanstveni skupnosti. Vendar natančno ocenjevanje fleksibilnosti proteinov z uporabo MD še vedno ostaja računsko zahtevno, zlasti za velike proteine in daljše časovne skale. Več podatkovnih zbirk, kot so MdMD, DynamD ter novejši ATLAS in mdCATH, zdaj vsebuje MD trajektorije za tisoče proteinov, kar omogoča razvoj napovednih modelov. V tem delu je predstavljen Graphlet Degree Vector (GDV) kot preprost, hiter in enostaven linearni model za napovedovanje fleksibilnosti proteinov neposredno iz koordinat atomov. GDV je 15-dimenzionalni vektor značilk, ki zajema lokalno pakiranje in prostorsko povezanost vsakega atoma z njegovimi bližnjimi sosedi. Model GDV, naučen na podmnožici globularnim podobnih proteinov iz baze ATLAS, doseže Spearmanovo korelacijo 0,828 v primerjavi z MD podatki. Model, naučen na podatkih ATLAS, je bil dodatno ovrednoten na neodvisnih podatkovnih zbirkah jedrske magnetne resonance (NMR) in krioelektronske mikroskopije, kar potrjuje robustnost in posplošljivost pristopa, ki temelji na GDV. Ključna prednost modela GDV je, da ne zahteva dodatnih zunanjih ali eksperimentalnih podatkov in ga je mogoče uporabiti skoraj v realnem času (približno 10 sekund), tudi za velike proteine z 20.000 atomi na običajnem namiznem ali prenosnem računalniku. Na splošno rezultati kažejo, da lahko lahek, hiter model, ki temelji izključno na koordinatah, zagotovi natančne in posplošljive napovedi fleksibilnosti proteinov pri raznolikih zgradbah in velikostih.
Keywords:prožnost proteinov, grafki, napovedni model


Comments

Leave comment

You must log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica