SoluProtMutDB: A manually curated database of protein
solubility changes upon mutations

ENPřihlásit se Přihlásit se (EduID)

SoluProtMutDB: A manually curated database of protein solubility changes upon mutations

Přehled o publikaci

J 2022

SoluProtMutDB: A manually curated database of protein solubility changes upon mutations

VELECKÝ, Jan, Marie HAMŠÍKOVÁ, Jan ŠTOURAČ, Miloš MUSIL, Jiří DAMBORSKÝ et. al.

Základní údaje

Originální název

SoluProtMutDB: A manually curated database of protein solubility changes upon mutations

Autoři

VELECKÝ, Jan (203 Česká republika, domácí), Marie HAMŠÍKOVÁ (203 Česká republika, domácí), Jan ŠTOURAČ (203 Česká republika, domácí), Miloš MUSIL (203 Česká republika, domácí), Jiří DAMBORSKÝ (203 Česká republika, domácí), David BEDNÁŘ (203 Česká republika, garant, domácí) a Stanislav MAZURENKO (643 Rusko, domácí)

Vydání

Computational and Structural Biotechnology Journal, Amsterdam, Elsevier, 2022, 2001-0370

Další údaje

Jazyk

angličtina

Typ výsledku

Článek v odborném periodiku

Stát vydavatele

Nizozemské království

Utajení

není předmětem státního či obchodního tajemství

Odkazy

URL

Kód RIV

RIV/00216224:14310/22:00130114

Organizace

Přírodovědecká fakulta – Masarykova univerzita – Repozitář

DOI

http://dx.doi.org/10.1016/j.csbj.2022.11.009

UT WoS

001043880900004

EID Scopus

2-s2.0-85142188313

Klíčová slova anglicky

Mutational database; Protein engineering; Soluble expression; Protein yield; Machine learning; Protein aggregation

Návaznosti

EF15_003/0000469, projekt VaV. EF17_043/0009632, projekt VaV. FW03010208, projekt VaV. GJ20-15915Y, projekt VaV. LM2018121, projekt VaV. LX22NPO5102, projekt VaV. ELIXIR-CZ II, velká výzkumná infrastruktura.

Změněno: 27. 2. 2025 00:50, RNDr. Daniel Jakubík

Anotace

V originále

Protein solubility is an attractive engineering target primarily due to its relation to yields in protein production and manufacturing. Moreover, better knowledge of the mutational effects on protein solubility could connect several serious human diseases with protein aggregation. However, we have limited understanding of the protein structural determinants of solubility, and the available data have mostly been scattered in the literature. Here, we present SoluProtMutDB – the first database containing data on protein solubility changes upon mutations. Our database accommodates 33 000 measurements of 17 000 protein variants in 103 different proteins. The database can serve as an essential source of information for the researchers designing improved protein variants or those developing machine learning tools to predict the effects of mutations on solubility. The database comprises all the previously published solubility datasets and thousands of new data points from recent publications, including deep mutational scanning experiments. Moreover, it features many available experimental conditions known to affect protein solubility. The datasets have been manually curated with substantial corrections, improving suitability for machine learning applications. The database is available at loschmidt.chemi.muni.cz/soluprotmutdb.

Přiložené soubory

https://is.muni.cz/publication/2241002/SoluProtMutDB_A_manually_curated_database_of_protein_solubility_changes_upon_mutations.pdf

Citovat

VELECKÝ, Jan, Marie HAMŠÍKOVÁ, Jan ŠTOURAČ, Miloš MUSIL, Jiří DAMBORSKÝ, David BEDNÁŘ a Stanislav MAZURENKO. SoluProtMutDB: A manually curated database of protein solubility changes upon mutations. Computational and Structural Biotechnology Journal. Amsterdam: Elsevier, 2022, roč. 20, November 2022, s. 6339-6347. ISSN 2001-0370. Dostupné z: https://dx.doi.org/10.1016/j.csbj.2022.11.009.

@article{53247,
   author = {Velecký, Jan and Hamšíková, Marie and Štourač, Jan and Musil, Miloš and Damborský, Jiří and Bednář, David and Mazurenko, Stanislav},
   article_location = {Amsterdam},
   article_number = {November 2022},
   doi = {http://dx.doi.org/10.1016/j.csbj.2022.11.009},
   keywords = {Mutational database; Protein engineering; Soluble expression; Protein yield; Machine learning; Protein aggregation},
   language = {eng},
   issn = {2001-0370},
   journal = {Computational and Structural Biotechnology Journal},
   title = {SoluProtMutDB: A manually curated database of protein solubility changes upon mutations},
   url = {https://www.sciencedirect.com/science/article/pii/S2001037022005025?via%3Dihub},
   volume = {20},
   year = {2022}
}

TY  - JOUR
ID  - 53247
AU  - Velecký, Jan - Hamšíková, Marie - Štourač, Jan - Musil, Miloš - Damborský, Jiří - Bednář, David - Mazurenko, Stanislav
PY  - 2022
TI  - SoluProtMutDB: A manually curated database of protein solubility changes upon mutations
JF  - Computational and Structural Biotechnology Journal
VL  - 20
IS  - November 2022
SP  - 6339-6347
EP  - 6339-6347
PB  - Elsevier
SN  - 2001-0370
KW  - Mutational database
KW  - Protein engineering
KW  - Soluble expression
KW  - Protein yield
KW  - Machine learning
KW  - Protein aggregation
UR  - https://www.sciencedirect.com/science/article/pii/S2001037022005025?via%3Dihub
N2  - Protein solubility is an attractive engineering target primarily due to its relation to yields in protein production and manufacturing. Moreover, better knowledge of the mutational effects on protein solubility could connect several serious human diseases with protein aggregation. However, we have limited understanding of the protein structural determinants of solubility, and the available data have mostly been scattered in the literature. Here, we present SoluProtMutDB – the first database containing data on protein solubility changes upon mutations. Our database accommodates 33 000 measurements of 17 000 protein variants in 103 different proteins. The database can serve as an essential source of information for the researchers designing improved protein variants or those developing machine learning tools to predict the effects of mutations on solubility. The database comprises all the previously published solubility datasets and thousands of new data points from recent publications, including deep mutational scanning experiments. Moreover, it features many available experimental conditions known to affect protein solubility. The datasets have been manually curated with substantial corrections, improving suitability for machine learning applications. The database is available at loschmidt.chemi.muni.cz/soluprotmutdb.
ER  -

VELECKÝ, Jan, Marie HAMŠÍKOVÁ, Jan ŠTOURAČ, Miloš MUSIL, Jiří DAMBORSKÝ, David BEDNÁŘ a Stanislav MAZURENKO. SoluProtMutDB: A manually curated database of protein solubility changes upon mutations. \textit{Computational and Structural Biotechnology Journal}. Amsterdam: Elsevier, 2022, roč.~20, November 2022, s.~6339-6347. ISSN~2001-0370. Dostupné z: https://dx.doi.org/10.1016/j.csbj.2022.11.009.