Charles Explorer logo
🇬🇧

Exploration of protein sequence embeddings for protein-ligand binding site detection

Publication at Faculty of Mathematics and Physics |
2022

Abstract

Detection of protein-ligand binding sites is essential not only for protein function investigation but also in fields such as drug discovery or bioengineering. In this paper, we show that the recently-developed pre-trained language models can be used for protein-ligand binding site prediction.

Specifically, we present a neural network architecture where inputs correspond to amino acids embeddings obtained from a protein language model. We show that increasing complexity of the language model improves the predictive performance of the method, eventually leading to results comparable to or surpassing state-of-the-art approaches.

Unlike the existing methods, the presented approach does not require time-consuming computation of evolutionary information, resulting in faster running times.