![]()
|
DCMF House-keeping proteins with TaxaMap taxonomy assignmentsProtein sequences were predicted using prokka
and futher annotated via high-throughput homology searching, multiple sequence alignment and molecular phylogenetics
using HAQESAC and MulitHAQ to search each protein against all bacterial proteins in the
UniProt Knowledgebase (download Each prokka protein was subject to a BLAST search against Uniprot bacteria and the other prokka proteins. HAQESAC was used to iteratively generate and clean up Clustal Omega multiple sequence alignments to produce a high quality alignment against a set of close homologues. The neighbor-joining tree implementation of Clustal W2 was used to make a phylogenetic tree (below). (NOTE: These alignments and trees are designed to give an automated first look at a protein. Where individual protein alignment and/or phylogenetic inference details are important, more careful analysis is recommended.) Individual proteins can be looked at in further detail by clicking the protein ID. Paralogues and in-paralogues (products of gene duplication) can be looked at by editing the following URL with the appropriate General Housekeeping Proteinsprotein, protein ID; description, prokka description; inpara, DCMF-specific "in-paralogues" identified by HAQESAC; paralogues, paralogues identified by HAQESAC; genus/family/order/class/phylum, TaxaMap taxonomy predictions based on well-supported HAQESAC clades; boot, bootstrap support (0-1) for TaxaMap clade; spcode, full list of Uniprot taxonomy species codes for HAQESAC clade. Ribosomal Proteinsprotein, protein ID; description, prokka description; inpara, DCMF-specific "in-paralogues" identified by HAQESAC; paralogues, paralogues identified by HAQESAC; genus/family/order/class/phylum, TaxaMap taxonomy predictions based on well-supported HAQESAC clades; boot, bootstrap support (0-1) for TaxaMap clade; spcode, full list of Uniprot taxonomy species codes for HAQESAC clade. © 2017 RJ Edwards. Contact: richard.edwards@unsw.edu.au. |