J
2021
SoluProt: prediction of soluble protein expression in Escherichia coli
HON, Jiří; Martin MARUSIAK; Tomas MARTINEK; Antonín KUNKA; Jaroslav ZENDULKA et al.
Basic information
Original name
SoluProt: prediction of soluble protein expression in Escherichia coli
Authors
HON, Jiří; Martin MARUSIAK; Tomas MARTINEK; Antonín KUNKA; Jaroslav ZENDULKA; David BEDNÁŘ and Jiří DAMBORSKÝ
Edition
Bioinformatics, Oxford (UK), Oxford University Press, 2021, 1367-4803
Other information
Type of outcome
Article in a journal
Country of publisher
United Kingdom of Great Britain and Northern Ireland
Confidentiality degree
is not subject to a state or trade secret
Marked to be transferred to RIV
Yes
RIV identification code
RIV/00216224:14310/21:00119188
Organization
Přírodovědecká fakulta – Repository – Repository
Keywords in English
SOLUBILITY; WEBSERVER; TOPOLOGY; ACCURATE
Links
EF17_043/0009632, research and development project. GJ20-15915Y, research and development project. LM2018131, research and development project. LM2018140, research and development project. 814418, interní kód Repo. 857560, interní kód Repo.
In the original language
Motivation: Poor protein solubility hinders the production of many therapeutic and industrially useful proteins. Experimental efforts to increase solubility are plagued by low success rates and often reduce biological activity. Computational prediction of protein expressibility and solubility in Escherichia coli using only sequence information could reduce the cost of experimental studies by enabling prioritization of highly soluble proteins. Results: A new tool for sequence-based prediction of soluble protein expression in E.coli, SoluProt, was created using the gradient boosting machine technique with the TargetTrack database as a training set. When evaluated against a balanced independent test set derived from the NESG database, SoluProt's accuracy of 58.5% and AUC of 0.62 exceeded those of a suite of alternative solubility prediction tools. There is also evidence that it could significantly increase the success rate of experimental protein studies.
Displayed: 4/5/2026 13:03