Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   AzCIB_RS21760 Genome accession   NZ_CP011072
Coordinates   4848183..4849682 (-) Length   499 a.a.
NCBI ID   WP_050417818.1    Uniprot ID   -
Organism   Azoarcus sp. CIB     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4843183..4854682
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AzCIB_RS21740 (AzCIB_4349) - 4843405..4844349 (-) 945 WP_050417814.1 peptidase U32 family protein -
  AzCIB_RS21745 (AzCIB_4350) - 4844365..4845357 (-) 993 WP_050417815.1 peptidase U32 family protein -
  AzCIB_RS21750 (AzCIB_4351) - 4845371..4846738 (-) 1368 WP_050417816.1 aminotransferase class I/II-fold pyridoxal phosphate-dependent enzyme -
  AzCIB_RS21755 (AzCIB_4352) - 4846925..4848118 (-) 1194 WP_050417817.1 multidrug effflux MFS transporter -
  AzCIB_RS21760 (AzCIB_4353) comM 4848183..4849682 (-) 1500 WP_050417818.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  AzCIB_RS21765 (AzCIB_4354) - 4849795..4850058 (-) 264 WP_050417819.1 accessory factor UbiK family protein -
  AzCIB_RS21770 (AzCIB_4355) - 4850388..4851131 (+) 744 WP_050417820.1 TorF family putative porin -
  AzCIB_RS21775 (AzCIB_4356) glnK 4851182..4851520 (+) 339 WP_050417821.1 P-II family nitrogen regulator -
  AzCIB_RS21780 (AzCIB_4357) - 4851532..4852989 (+) 1458 WP_050417822.1 ammonium transporter -
  AzCIB_RS21785 (AzCIB_4358) - 4853168..4853614 (+) 447 WP_050417823.1 rhodanese-like domain-containing protein -

Sequence


Protein


Download         Length: 499 a.a.        Molecular weight: 53009.62 Da        Isoelectric Point: 8.8417

>NTDB_id=142041 AzCIB_RS21760 WP_050417818.1 4848183..4849682(-) (comM) [Azoarcus sp. CIB]
MSLALVRTRALAGMGAPEVTVEVHLANGLPAFNLVGLPDTEVREARERVRAAIATSQFEFPQRRITVNLAPADLPKEGGR
FDLPIALGILVASGQVDAAGLAKHEFVGELSLDGSLRAVRGGLAMALESGRTGRALVLPAANADEAALARGASVLPAASL
LAVCAHLNSHTALTRRLPPAVAEGRDDEADLAEVKGQLQARRALEVAAAGQHSLLMFGPPGTGKSMLARRLPGLLPPLDE
AEAIECASIQSLEGGFDARRWARRPYRAPHHSASAPALVGGGTTPRPGEISLAHRGVLFLDELPEFDRRVLEALREPLET
GSVTVSRARQRTEFPARFQLVAAMNPCPCGHAGDKGGRCRCTPDQVARYRGRLSGPLLDRMDIVIEVPLLDHAEMTGQAA
GEPSATVRERVAQAWAVQRGRQGGANSHLAPGRVDTLCAPDDQGKALLDHAIRRLNLSARGYHRILKVARTIADLAGAER
VGPAHLAEAIQYRRGLDSR

Nucleotide


Download         Length: 1500 bp        

>NTDB_id=142041 AzCIB_RS21760 WP_050417818.1 4848183..4849682(-) (comM) [Azoarcus sp. CIB]
ATGTCGCTGGCTCTCGTACGCACCAGGGCGCTCGCAGGCATGGGCGCGCCCGAGGTGACGGTGGAAGTCCATCTCGCCAA
CGGCCTGCCCGCATTCAACCTGGTCGGCCTGCCCGACACCGAAGTCCGCGAGGCGCGCGAACGCGTGCGCGCCGCGATCG
CGACCTCGCAGTTCGAATTCCCGCAACGCCGCATCACCGTCAATCTCGCTCCCGCGGACCTCCCCAAGGAGGGCGGGCGC
TTCGACCTGCCGATCGCGCTGGGGATCCTTGTGGCATCGGGGCAGGTGGATGCGGCGGGGCTCGCGAAGCACGAGTTCGT
CGGCGAACTCTCGCTCGACGGCAGCCTGCGCGCGGTGCGCGGCGGCCTCGCGATGGCGCTGGAGAGCGGTCGCACCGGTC
GCGCGCTGGTGCTGCCGGCCGCCAACGCCGACGAGGCGGCGCTCGCGCGCGGCGCTAGCGTACTGCCCGCCGCCAGCCTG
CTTGCCGTGTGCGCACACCTCAACAGCCACACGGCGCTCACTCGGCGCCTGCCGCCGGCGGTCGCCGAGGGGCGGGACGA
CGAAGCGGATCTGGCCGAGGTGAAGGGGCAGCTGCAGGCCCGCCGCGCGCTCGAGGTCGCGGCTGCCGGGCAGCATTCGC
TGCTGATGTTCGGTCCGCCGGGGACCGGCAAGTCCATGCTCGCGCGCCGCCTTCCGGGCCTGCTGCCGCCGCTCGACGAG
GCCGAGGCGATCGAGTGTGCGTCGATCCAGTCGCTCGAAGGCGGTTTCGACGCCCGCCGCTGGGCCCGTCGGCCGTACCG
CGCGCCGCACCACTCGGCCTCTGCGCCGGCGCTCGTCGGCGGCGGCACGACACCGCGGCCCGGCGAAATCAGCCTTGCGC
ACCGCGGCGTGCTCTTCCTCGACGAGCTTCCGGAGTTCGACCGCCGCGTCCTCGAGGCGCTGCGCGAGCCGCTGGAAACC
GGCTCCGTGACGGTGTCGCGCGCACGCCAGCGCACCGAGTTCCCGGCGCGCTTCCAGCTCGTCGCGGCGATGAATCCCTG
TCCCTGCGGCCACGCCGGCGACAAGGGCGGACGCTGCCGCTGCACGCCGGATCAGGTCGCGCGTTATCGCGGACGCCTGT
CGGGGCCGCTGCTCGACCGTATGGACATCGTCATCGAGGTGCCGCTGCTCGACCATGCCGAGATGACCGGGCAGGCGGCG
GGCGAACCCAGCGCGACCGTGCGCGAGCGCGTCGCGCAGGCCTGGGCGGTGCAGCGCGGGCGCCAGGGCGGGGCGAACAG
CCATCTCGCGCCGGGCCGCGTCGATACGCTGTGCGCGCCCGACGACCAGGGCAAGGCGCTGCTCGATCATGCGATCCGGC
GGCTGAATCTGTCCGCACGCGGGTATCATCGCATCCTGAAGGTCGCCCGCACGATCGCGGACCTGGCCGGCGCCGAGCGC
GTCGGTCCCGCGCATCTGGCCGAGGCGATCCAGTACCGGCGCGGCCTCGACTCGCGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

52.883

100

0.533

  comM Vibrio campbellii strain DS40M4

53.333

99.198

0.529

  comM Vibrio cholerae strain A1552

52.929

99.198

0.525

  comM Glaesserella parasuis strain SC1401

50.902

100

0.509

  comM Legionella pneumophila str. Paris

48.697

100

0.487

  comM Legionella pneumophila strain ERS1305867

48.697

100

0.487

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.281

100

0.439


Multiple sequence alignment