Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ACGI6K_RS19960 Genome accession   NZ_CP171449
Coordinates   4117763..4119259 (+) Length   498 a.a.
NCBI ID   WP_376944608.1    Uniprot ID   -
Organism   Azorhizophilus paspali strain ATCC 23833     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4112763..4124259
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACGI6K_RS19925 (ACGI6K_19925) - 4112803..4113504 (+) 702 WP_376944597.1 HAD family hydrolase -
  ACGI6K_RS19930 (ACGI6K_19930) - 4113550..4114317 (+) 768 Protein_3905 CPBP family intramembrane glutamic endopeptidase -
  ACGI6K_RS19935 (ACGI6K_19935) sutA 4114384..4114704 (-) 321 WP_376944599.1 transcriptional regulator SutA -
  ACGI6K_RS19940 (ACGI6K_19940) - 4114829..4115254 (-) 426 WP_376944601.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  ACGI6K_RS19945 (ACGI6K_19945) - 4115404..4116720 (-) 1317 WP_376944603.1 ammonium transporter -
  ACGI6K_RS19950 (ACGI6K_19950) glnK 4116753..4117091 (-) 339 WP_012703245.1 P-II family nitrogen regulator -
  ACGI6K_RS19955 (ACGI6K_19955) - 4117456..4117731 (+) 276 WP_376944606.1 accessory factor UbiK family protein -
  ACGI6K_RS19960 (ACGI6K_19960) comM 4117763..4119259 (+) 1497 WP_376944608.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ACGI6K_RS19965 (ACGI6K_19965) - 4119333..4120511 (+) 1179 Protein_3912 AAA family ATPase -
  ACGI6K_RS19970 (ACGI6K_19970) - 4120508..4121389 (-) 882 WP_376944610.1 LysR family transcriptional regulator -
  ACGI6K_RS19975 (ACGI6K_19975) - 4121504..4121926 (+) 423 WP_376944612.1 DoxX family protein -
  ACGI6K_RS19980 (ACGI6K_19980) - 4121976..4122704 (+) 729 WP_376944614.1 pirin family protein -

Sequence


Protein


Download         Length: 498 a.a.        Molecular weight: 52422.36 Da        Isoelectric Point: 8.3343

>NTDB_id=1061264 ACGI6K_RS19960 WP_376944608.1 4117763..4119259(+) (comM) [Azorhizophilus paspali strain ATCC 23833]
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPALTLVGLPETAVRESKDRVRSAILTSGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASEQLPAGSLDGLECLGELALSGGLRPVRGVLPAALAAHAAGRTLVVPRANAEEASLASGLNVLAIDHL
LELAAHLNGRTPLAPYRSSGLLRQTRPYPDLAEVQGQAAAKRALLVAAAGAHNLLLSGPPGTGKTLLASRLPGLLPPLDE
REALEVAAIHSVAGSAPLSAWPQRPFRQPHHSASGPALVGGGSRPRPGEITLAHRGVLFLDELPEFDRKVLEVLREPLES
GEIVIARASDKVRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHVGVARETTALGAPRLD
GPDSAGAAAQVALARNLQLARQGCPNAFLDLPGLHQHCALSDEDRQWLERACERLGLSLRAAHRVLKVARTLADLEALPS
IARAHLAEALQYRPAAHA

Nucleotide


Download         Length: 1497 bp        

>NTDB_id=1061264 ACGI6K_RS19960 WP_376944608.1 4117763..4119259(+) (comM) [Azorhizophilus paspali strain ATCC 23833]
ATGTCCCTGGCCATCGTCCACAGCCGTGCCCAGGTAGGCGTCGAGGCGCCCGCCGTCACCGTGGAGGCGCACCTGGCCAA
CGGTTTGCCGGCGTTGACCCTGGTCGGCCTGCCGGAAACCGCGGTCCGCGAGAGCAAGGATCGCGTGCGCAGCGCCATCC
TCACTTCCGGCTTCGATTTCCCGGCACGGCGCATCACCCTCAACCTGGCGCCCGCCGACCTGCCCAAGGACGGCGGACGC
TTCGACCTGGCCATCGCCCTCGGCATCCTCGCCGCCAGCGAGCAATTGCCCGCGGGGTCTCTGGACGGCCTGGAATGCCT
GGGCGAACTGGCCCTCTCCGGCGGCCTGCGCCCGGTTCGGGGCGTGCTGCCCGCCGCGCTGGCCGCGCACGCCGCCGGAC
GCACCCTGGTGGTGCCGCGGGCGAACGCCGAAGAGGCCAGCCTGGCGTCGGGTCTGAACGTGCTGGCGATCGACCACCTG
CTGGAACTGGCCGCCCACCTGAACGGCCGGACCCCGCTCGCGCCCTACCGGTCCAGCGGCCTGCTGCGACAGACGCGCCC
CTACCCCGACCTCGCCGAGGTGCAGGGCCAGGCCGCGGCCAAGCGCGCCCTGCTGGTGGCGGCGGCAGGAGCTCACAATC
TCCTGTTGAGCGGTCCGCCCGGAACCGGCAAGACCTTGCTGGCCAGCCGTCTGCCGGGCCTGCTGCCGCCTTTGGACGAA
CGCGAGGCGCTGGAGGTGGCGGCGATCCATTCGGTGGCCGGTAGCGCGCCGCTCTCCGCCTGGCCGCAGCGGCCGTTCCG
CCAGCCCCATCATAGCGCCTCGGGACCGGCGCTGGTCGGCGGCGGCAGCCGGCCGCGCCCCGGCGAAATCACCCTGGCGC
ACCGGGGCGTGCTGTTTCTCGACGAATTGCCCGAATTCGACCGCAAGGTGCTGGAGGTGCTGCGCGAGCCTCTGGAAAGC
GGCGAAATCGTTATCGCCCGGGCCAGCGATAAGGTGCGCTTTCCGGCGCGCTTCCAGTTGGTGGCGGCGATGAACCCCTG
TCCCTGCGGCTATCTGGGCGACCCCGCCGGCCGTTGTCGCTGCACCCCGGAACAGATCCAGCGTTACCGCGCCAAGCTGT
CCGGCCCGCTGCTCGACCGCATCGACCTGCACGTCGGCGTCGCCCGCGAGACCACCGCCCTGGGCGCGCCGCGCCTGGAC
GGCCCGGACAGCGCCGGCGCCGCCGCCCAGGTGGCGTTGGCGCGCAACCTGCAACTGGCCCGCCAGGGCTGCCCCAATGC
CTTCCTCGACCTGCCCGGGCTGCACCAGCACTGTGCACTGAGCGACGAAGACCGCCAGTGGCTGGAACGCGCCTGCGAAC
GCCTCGGCCTGTCGCTGCGCGCCGCCCACCGCGTCCTCAAGGTGGCACGCACCCTGGCCGATCTGGAGGCGCTGCCGAGC
ATCGCCCGCGCCCACCTGGCCGAAGCGCTGCAATACCGGCCGGCGGCGCATGCCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.74

99.799

0.566

  comM Vibrio cholerae strain A1552

56.338

99.799

0.562

  comM Haemophilus influenzae Rd KW20

55

100

0.552

  comM Glaesserella parasuis strain SC1401

54.2

100

0.544

  comM Legionella pneumophila str. Paris

50.198

100

0.508

  comM Legionella pneumophila strain ERS1305867

50.198

100

0.508

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.219

100

0.456


Multiple sequence alignment