Charles Explorer logo
🇨🇿

Exploration of protein sequence embeddings for protein-ligand binding site detection

Publikace na Matematicko-fyzikální fakulta |
2022

Tento text není v aktuálním jazyce dostupný. Zobrazuje se verze "en".Abstrakt

Detection of protein-ligand binding sites is essential not only for protein function investigation but also in fields such as drug discovery or bioengineering. In this paper, we show that the recently-developed pre-trained language models can be used for protein-ligand binding site prediction.

Specifically, we present a neural network architecture where inputs correspond to amino acids embeddings obtained from a protein language model. We show that increasing complexity of the language model improves the predictive performance of the method, eventually leading to results comparable to or surpassing state-of-the-art approaches.

Unlike the existing methods, the presented approach does not require time-consuming computation of evolutionary information, resulting in faster running times.