Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Ankh-score produces better sequence alignments than AlphaFold3

View through CrossRef
Abstract Protein sequence alignment is one of the most fundamental procedures in bioinformatics. Due to its many downstream applications, improvements to this procedure are of great importance. We consider two revolutionary concepts that emerged recently as candidates for improving the state-of-the-art alignment methods: AlphaFold and protein embeddings. Alignment improvements can come from structural alignment of AlphaFold-predicted structures or scoring based on similarity of protein embedding vectors, resp. Thorough comparison on many domains from BAliBASE and CDD demonstrates that the Ankh-score method produces much better sequence alignments than structural alignments using US-align of AlphaFold3-predicted structures. Both are better than the traditional method using BLOSUM matrices. This suggests that Ankh embedding vectors may possess certain information that is not available in the AlphaFold3-predicted structures.
Title: Ankh-score produces better sequence alignments than AlphaFold3
Description:
Abstract Protein sequence alignment is one of the most fundamental procedures in bioinformatics.
Due to its many downstream applications, improvements to this procedure are of great importance.
We consider two revolutionary concepts that emerged recently as candidates for improving the state-of-the-art alignment methods: AlphaFold and protein embeddings.
Alignment improvements can come from structural alignment of AlphaFold-predicted structures or scoring based on similarity of protein embedding vectors, resp.
Thorough comparison on many domains from BAliBASE and CDD demonstrates that the Ankh-score method produces much better sequence alignments than structural alignments using US-align of AlphaFold3-predicted structures.
Both are better than the traditional method using BLOSUM matrices.
This suggests that Ankh embedding vectors may possess certain information that is not available in the AlphaFold3-predicted structures.

Related Results

AlphaFold3 at CASP16
AlphaFold3 at CASP16
The CASP16 experiment provided the first opportunity to benchmark AlphaFold3. In contrast to AlphaFold2, AlphaFold3 can predict the structure of non-protein molecules, and accordin...
COFFEE: an objective function for multiple sequence alignments.
COFFEE: an objective function for multiple sequence alignments.
Abstract MOTIVATION: In order to increase the accuracy of multiple sequence alignments, we designed a new strategy for optimizing multiple sequence alignments by gen...
AlphaFold3 at CASP16
AlphaFold3 at CASP16
Abstract The CASP16 experiment provided the first benchmark for AlphaFold3. In contrast to AlphaFold2 and other methods, AlphaFold3 can also predict the structure o...
TooT-PLM-P2S: Incorporating Secondary Structure Information into Protein Language Models
TooT-PLM-P2S: Incorporating Secondary Structure Information into Protein Language Models
Abstract In bioinformatics, modeling the protein space to better predict function and structure has benefitted from Protein Language Models (PLMs). Their basis is t...
TooT-PLM-P2S: Incorporating Secondary Structure Information into Protein Language Models
TooT-PLM-P2S: Incorporating Secondary Structure Information into Protein Language Models
In bioinformatics, modeling the protein space to better predict function and structure has benefitted from Protein Language Models (PLMs). Their basis is the protein’s amino acid s...
ANKHvariants associated with ankylosing spondylitis: gender differences
ANKHvariants associated with ankylosing spondylitis: gender differences
AbstractThe ank (progressive ankylosis) mutant mouse, which has a nonsense mutation in exon 12 of the inorganic pyrophosphate regulator gene (ank), exhibits aberrant joint ankylosi...
Multiple Alignments of Data Objects and Generalized Center Star Algorithm
Multiple Alignments of Data Objects and Generalized Center Star Algorithm
Multiple alignments of strings have been extensively studied as an effective tool to study string-type data such as DNA. In this paper, we generalize the notion of multiple alignme...

Back to Top