SequenceBouncer: A method to remove outlier entries from a multiple sequence alignment
Keyword(s):
AbstractPhylogenetic analyses can take advantage of multiple sequence alignments as input. These alignments typically consist of homologous nucleic acid or protein sequences, and the inclusion of outlier or aberrant sequences can compromise downstream analyses. Here, I describe a program, SequenceBouncer, that uses the Shannon entropy values of alignment columns to identify outlier alignment sequences in a manner responsive to overall alignment context. I demonstrate the utility of this software using alignments of available mammalian mitochondrial genomes, bird cytochrome c oxidase-derived DNA barcodes, and COVID-19 sequences.
2021 ◽
2021 ◽
2009 ◽
Vol 19
(4)
◽
pp. 675-678
◽
2020 ◽
Vol 27
(4)
◽
pp. 295-302
◽