![]()
|
DCMRF Protein Ga0180325_114556 -This is the supplementary data page for predicted protein Ga0180325_114556 (predicted ) from an unusual, dichloromethane-fermenting Peptococcaceae strain, DCMF. DCMF was isolated by the Manefield group at UNSW before being sequenced, assembled and annotated in collaboration with the Edwards lab. Protein sequences were predicted using prokka
and the JGI Genome Portal annotation pipeline. Proteins were further annotated via high-throughput homology searching, multiple sequence alignment and molecular phylogenetics
using HAQESAC and MulitHAQ to search each protein against all bacterial proteins in the
UniProt Knowledgebase (download
protein, JGI locus tag; ncbi, NCBI protein ID (click ^ to open entry); prokka, prokka protein ID; jgi, JGI ID; description, JGI description; inpara, DCMF-specific "in-paralogues" identified by HAQESAC; paralogues, paralogues identified by HAQESAC; genus/family/order/class/phylum, TaxaMap taxonomy predictions based on well-supported HAQESAC clades; boot, bootstrap support (0-1) for TaxaMap clade; spcode, full list of Uniprot taxonomy species codes for HAQESAC clade. NOTE: HAQESAC only returns the closest homologues and paralogue lists may be incomplete as a result. More Proteins: Click here for a full list of JGI-annotated proteins and their TaxaMap classifications. HAQESAC protein alignment and phylogenyEach prokka protein was subject to a BLAST
search against Uniprot bacteria and the other JGI proteins. HAQESAC was used to
iteratively generate and clean up Clustal Omega multiple sequence alignments
to produce a high quality alignment against a set of close homologues. The neighbor-joining tree implementation of
Clustal W2 was used to make a phylogenetic tree (below).
(NOTE: These alignments and trees are designed to give an automated first look at a protein. Where individual
protein alignment and/or phylogenetic inference details are important, more careful analysis is recommended.)
NCBI proteins have the species code The full alignment can be accessed via the download link below. Individual Uniprot homologues can be retrieved
by visiting Download: Raw protein (fasta) | Sequence alignment (fasta) | Phylogenetic tree (newick | text | png) NOTE: If download links do not work and/or no alignment/tree appears, either the protein ID is incorrect, the predicted gene was non-coding, or insufficient homologues were found by HAQESAC. HAQESAC phylogenySee below for HAQESAC sequence alignment. ![]() HAQESAC multiple sequence alignmentLoading Multiple Alignment...
© 2019 RJ Edwards. Contact: richard.edwards@unsw.edu.au. |