Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CBP36_RS00880 Genome accession   NZ_CP021366
Coordinates   188673..190223 (-) Length   516 a.a.
NCBI ID   WP_086926240.1    Uniprot ID   -
Organism   Acidovorax carolinensis strain P4     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 183673..195223
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CBP36_RS00865 (CBP36_00865) - 185181..185972 (+) 792 WP_086928548.1 ABC transporter ATP-binding protein -
  CBP36_RS00870 (CBP36_00870) - 185969..186748 (+) 780 WP_236748693.1 ABC transporter ATP-binding protein -
  CBP36_RS00875 (CBP36_00875) - 186871..188598 (-) 1728 WP_086926239.1 methyl-accepting chemotaxis protein -
  CBP36_RS00880 (CBP36_00880) comM 188673..190223 (-) 1551 WP_086926240.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  CBP36_RS00885 (CBP36_00885) - 190436..191167 (+) 732 WP_086910986.1 TorF family putative porin -
  CBP36_RS00890 (CBP36_00890) glnK 191214..191552 (+) 339 WP_086910987.1 P-II family nitrogen regulator -
  CBP36_RS00895 (CBP36_00895) amt 191595..193034 (+) 1440 WP_086926241.1 ammonium transporter -
  CBP36_RS00900 (CBP36_00900) - 193415..194293 (+) 879 WP_086926242.1 LysR substrate-binding domain-containing protein -

Sequence


Protein


Download         Length: 516 a.a.        Molecular weight: 54115.94 Da        Isoelectric Point: 7.5522

>NTDB_id=230200 CBP36_RS00880 WP_086926240.1 188673..190223(-) (comM) [Acidovorax carolinensis strain P4]
MSLALVQSRALLGLEAPSVTVEVHLANGLPSFTLVGLADVEVKEARERVRSALQNAGLEFPHNKRITVNLAPADLPKDSG
RFDLPIALGILAASGQIDASRLAGHEFAGELSLSGQLRPVRGALATSLALRAHQVAVRLVLPPGSAEEAALVPGAQVYRA
RHLLDVVRQFLPAGGQESIDDTAATDEGWCLLQPAPLATQAVYADLSDVKGQAAAKRALEIAAAGGHSLLMMGPPGSGKS
MLAQRFAGLLPAMTMEEALESAAVASLAGRFAAGQWAQRPTSQPHHTASAVALVGGGSPPQPGEISLAHHGVLFLDELPE
FPRAALEALREPLETGRITIARAARRAEFPACFQMVAAMNPCPCGFLGSAQRACRCTPDQVARYQGKLSGPLLDRIDLHV
EVPALPPTDLLDAPPGEASAAVRERVDRARLAAITRQGKTNQALQGQEIDTHTALEPAATQFLQTAAARLGWSARSTHRT
LKVARTIADLAGAASTQVAHVAEAVQYRRVLRGPAA

Nucleotide


Download         Length: 1551 bp        

>NTDB_id=230200 CBP36_RS00880 WP_086926240.1 188673..190223(-) (comM) [Acidovorax carolinensis strain P4]
ATGAGTCTTGCATTGGTGCAAAGTCGCGCCCTTCTGGGGCTGGAAGCGCCCTCCGTCACCGTCGAGGTGCATCTGGCCAA
CGGGCTTCCCAGCTTCACCCTGGTCGGCCTGGCCGATGTGGAGGTGAAGGAGGCGCGCGAGCGCGTGCGCTCGGCGCTGC
AGAACGCGGGGCTGGAGTTCCCCCACAACAAGCGCATCACCGTAAACCTTGCACCGGCCGATCTGCCCAAGGACTCCGGG
CGCTTTGACCTGCCGATTGCCCTGGGCATCCTGGCCGCCAGCGGACAGATCGACGCCAGCCGCCTGGCCGGCCATGAGTT
CGCCGGCGAGCTGTCGCTGTCCGGGCAATTGCGCCCTGTGCGCGGCGCACTCGCCACCAGCCTGGCGCTGCGCGCCCACC
AGGTAGCGGTGCGGCTGGTGTTGCCACCGGGCAGCGCCGAAGAAGCCGCCCTGGTGCCCGGCGCGCAGGTTTACCGCGCG
CGCCACCTGCTGGATGTCGTGCGGCAGTTCCTGCCGGCAGGCGGCCAGGAGTCCATCGACGACACCGCTGCCACGGACGA
GGGCTGGTGCCTGCTGCAACCCGCACCGCTGGCCACGCAAGCGGTCTATGCCGACCTTTCGGACGTCAAGGGCCAGGCCG
CCGCCAAACGCGCGCTCGAAATCGCGGCCGCCGGTGGGCACAGCCTGCTGATGATGGGCCCGCCCGGTTCGGGCAAGTCC
ATGCTCGCCCAGCGCTTTGCCGGGCTGCTGCCAGCCATGACGATGGAAGAGGCCCTGGAAAGTGCGGCCGTGGCCAGCCT
GGCGGGTCGCTTCGCTGCCGGGCAATGGGCGCAGCGACCCACCAGCCAGCCCCACCACACGGCATCGGCCGTGGCGCTGG
TGGGCGGCGGCTCGCCGCCGCAGCCAGGAGAAATATCACTGGCCCACCACGGGGTGCTGTTCCTGGACGAGTTGCCGGAG
TTCCCCCGGGCCGCCCTGGAGGCCCTGCGCGAGCCCCTGGAAACCGGGCGCATCACCATTGCGCGCGCCGCCCGGCGCGC
CGAGTTTCCGGCGTGCTTTCAGATGGTGGCCGCCATGAACCCCTGCCCCTGCGGCTTCTTGGGGTCTGCGCAGCGTGCCT
GCCGCTGCACGCCCGACCAGGTGGCGCGCTACCAAGGCAAGCTCAGCGGGCCCCTGCTTGATCGCATCGACCTGCATGTG
GAGGTGCCCGCGTTGCCGCCCACCGACCTGCTCGACGCGCCACCGGGCGAGGCCAGCGCCGCTGTGCGCGAGCGCGTGGA
CCGCGCCCGCCTGGCAGCCATCACCCGCCAGGGCAAAACCAACCAGGCGCTGCAGGGCCAGGAAATCGACACCCACACCG
CCCTGGAGCCCGCCGCCACGCAGTTTCTGCAAACGGCGGCAGCGCGCCTGGGCTGGTCGGCCCGCAGCACCCACCGCACA
CTGAAGGTGGCGCGCACCATTGCCGACCTGGCCGGTGCCGCCAGCACCCAGGTGGCGCATGTGGCAGAGGCCGTGCAATA
CCGGCGCGTGCTGCGCGGGCCTGCCGCCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

52.19

100

0.531

  comM Vibrio cholerae strain A1552

51.65

99.806

0.515

  comM Glaesserella parasuis strain SC1401

51.859

99.031

0.514

  comM Vibrio campbellii strain DS40M4

51.068

99.806

0.51

  comM Legionella pneumophila str. Paris

48.456

100

0.486

  comM Legionella pneumophila strain ERS1305867

48.456

100

0.486

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.048

100

0.438


Multiple sequence alignment