Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   S101267_RS12735 Genome accession   NZ_CP021505
Coordinates   2463400..2464470 (-) Length   356 a.a.
NCBI ID   WP_013352872.1    Uniprot ID   A0A9P1JI17
Organism   Bacillus amyloliquefaciens strain SRCM101267     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2458400..2469470
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S101267_RS12685 (S101267_02546) sipW 2458462..2459046 (-) 585 WP_013352863.1 signal peptidase I SipW -
  S101267_RS12690 (S101267_02547) tapA 2459018..2459689 (-) 672 WP_013352864.1 amyloid fiber anchoring/assembly protein TapA -
  S101267_RS12695 (S101267_02548) - 2459947..2460276 (+) 330 WP_013352865.1 DUF3889 domain-containing protein -
  S101267_RS12700 (S101267_02549) - 2460317..2460496 (-) 180 WP_013352866.1 YqzE family protein -
  S101267_RS12705 (S101267_02550) comGG 2460550..2460927 (-) 378 WP_013352867.1 competence type IV pilus minor pilin ComGG Machinery gene
  S101267_RS12710 comGF 2460929..2461429 (-) 501 WP_013352868.1 competence type IV pilus minor pilin ComGF -
  S101267_RS12715 (S101267_02552) comGE 2461338..2461652 (-) 315 WP_014470662.1 competence type IV pilus minor pilin ComGE Machinery gene
  S101267_RS12720 (S101267_02553) comGD 2461636..2462073 (-) 438 WP_013352869.1 competence type IV pilus minor pilin ComGD Machinery gene
  S101267_RS12725 (S101267_02554) comGC 2462063..2462371 (-) 309 WP_013352870.1 competence type IV pilus major pilin ComGC Machinery gene
  S101267_RS12730 (S101267_02555) comGB 2462376..2463413 (-) 1038 WP_013352871.1 competence type IV pilus assembly protein ComGB Machinery gene
  S101267_RS12735 (S101267_02556) comGA 2463400..2464470 (-) 1071 WP_013352872.1 competence type IV pilus ATPase ComGA Machinery gene
  S101267_RS12740 (S101267_02557) - 2464664..2465614 (-) 951 WP_013352873.1 magnesium transporter CorA family protein -
  S101267_RS12745 (S101267_02558) - 2465761..2467062 (+) 1302 WP_013352874.1 hemolysin family protein -
  S101267_RS12750 (S101267_02559) - 2467100..2467927 (-) 828 WP_013352875.1 STAS domain-containing protein -
  S101267_RS12760 (S101267_02561) - 2468140..2468520 (-) 381 WP_013352876.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  S101267_RS12765 (S101267_02562) - 2468756..2469001 (+) 246 WP_003153078.1 DUF2626 domain-containing protein -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40430.67 Da        Isoelectric Point: 9.0345

>NTDB_id=231405 S101267_RS12735 WP_013352872.1 2463400..2464470(-) (comGA) [Bacillus amyloliquefaciens strain SRCM101267]
MENIETYSRNIINEAYMARASDIHIVPRERDAFIHFRIGHALVKKRVLKKEECVRLISHFKFLSAMDIGERRKPQNGSLS
LPLPTETVHLRMSTLPTMNDESLVIRLLPKRQIPPLDKLSLFPGAGAALLSFLKHSHGLLLFTGPTGSGKTTTLYSLVNY
AKRHFNRRIVTLEDPVETRDEDVLQVQVNEKAGVTYSTGLKAILRHDPDMIILGEIRDAETAEIAVRAAMTGHLVLSSLH
TRDAKGAIYRLLEFGVNMTEIEQTVIAIAAQRLVDLTCPFCKTDNCSVYCSLYRQTRRAGIYELLYGKNLTQCFQEAKGE
HANFQYQTLRRLIRKGIALGYVTTDNYDRWVYHEAD

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=231405 S101267_RS12735 WP_013352872.1 2463400..2464470(-) (comGA) [Bacillus amyloliquefaciens strain SRCM101267]
ATGGAGAATATTGAAACATACAGCAGGAACATCATCAACGAGGCCTATATGGCAAGGGCTTCAGATATACACATTGTGCC
GAGGGAAAGGGATGCTTTCATTCATTTTCGCATCGGTCATGCTCTTGTGAAAAAAAGAGTCTTAAAAAAAGAGGAATGCG
TAAGGCTTATTTCTCATTTTAAATTTTTATCAGCCATGGATATAGGAGAAAGACGAAAGCCGCAAAACGGATCACTGTCT
CTGCCGCTTCCAACGGAAACCGTTCATTTGAGGATGTCAACCTTGCCGACAATGAATGATGAAAGTCTTGTCATCAGGCT
ATTGCCTAAAAGGCAGATTCCCCCTCTGGATAAACTGTCCTTATTTCCGGGGGCAGGCGCCGCATTATTATCCTTTCTGA
AGCATTCGCACGGCCTGCTTCTTTTTACCGGCCCGACCGGATCTGGAAAAACAACCACCCTTTACTCGCTTGTCAACTAT
GCAAAACGGCATTTCAACCGGCGGATTGTCACGCTGGAAGATCCCGTAGAAACGAGAGATGAAGATGTTTTGCAAGTTCA
GGTGAATGAAAAAGCCGGAGTCACATATTCCACCGGTTTAAAAGCGATTTTAAGACATGACCCCGATATGATCATCTTAG
GGGAAATCAGAGATGCCGAGACGGCTGAAATCGCAGTCCGCGCGGCCATGACAGGACACCTCGTACTGTCAAGCCTGCAT
ACAAGAGATGCTAAGGGCGCCATATACAGGCTGCTGGAGTTCGGGGTGAACATGACGGAAATTGAACAAACCGTTATTGC
CATCGCCGCTCAGCGTCTGGTTGATTTAACGTGTCCGTTTTGCAAAACGGACAACTGTTCCGTGTACTGCAGTCTATACA
GGCAGACACGCCGTGCTGGGATTTATGAACTGCTTTACGGAAAAAATCTCACCCAGTGCTTTCAAGAAGCAAAAGGGGAG
CACGCCAATTTTCAATATCAGACGCTTCGCAGGCTGATCAGAAAAGGCATCGCGCTCGGTTATGTCACGACTGATAATTA
TGATCGGTGGGTGTATCATGAAGCGGATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

82.022

100

0.82

  pilB Glaesserella parasuis strain SC1401

38.506

97.753

0.376

  pilF Thermus thermophilus HB27

38

98.315

0.374


Multiple sequence alignment