The protein databases have been growing exponentially in recent years, thus making the existing methods for similarity retrieval inappropriate concerning the volume of the protein-related data. In this thesis, we focus on similarity retrieval on protein sequence and structure levels.
At both levels, we propose improvements to the existing methods, as well as novel methods for managing proteins from the similarity perspective.