Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   BSM4216_RS10450 Genome accession   NZ_CP012024
Coordinates   2226322..2227401 (-) Length   359 a.a.
NCBI ID   WP_048623682.1    Uniprot ID   -
Organism   Bacillus smithii strain DSM 4216     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2221322..2232401
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BSM4216_RS10410 (BSM4216_2540) - 2221799..2222584 (+) 786 WP_003355322.1 YqhG family protein -
  BSM4216_RS10415 (BSM4216_2542) - 2222880..2223380 (-) 501 WP_048623676.1 shikimate kinase -
  BSM4216_RS10420 (BSM4216_2543) - 2223395..2223763 (-) 369 WP_048623677.1 hypothetical protein -
  BSM4216_RS10425 (BSM4216_2544) comGF 2223756..2224184 (-) 429 WP_048623678.1 competence type IV pilus minor pilin ComGF -
  BSM4216_RS16760 (BSM4216_2545) - 2224195..2224509 (-) 315 WP_048623679.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  BSM4216_RS10435 (BSM4216_2546) comGD 2224549..2224986 (-) 438 WP_048623680.1 competence type IV pilus minor pilin ComGD -
  BSM4216_RS10440 (BSM4216_2547) comGC 2224989..2225303 (-) 315 WP_003355330.1 competence type IV pilus major pilin ComGC -
  BSM4216_RS10445 (BSM4216_2548) comGB 2225322..2226389 (-) 1068 WP_048623681.1 competence type IV pilus assembly protein ComGB -
  BSM4216_RS10450 (BSM4216_2549) comGA 2226322..2227401 (-) 1080 WP_048623682.1 competence type IV pilus ATPase ComGA Machinery gene
  BSM4216_RS10455 (BSM4216_2550) - 2227509..2229083 (-) 1575 WP_174521015.1 bifunctional UDP-sugar hydrolase/5'-nucleotidase -
  BSM4216_RS10460 (BSM4216_2551) - 2229920..2230162 (+) 243 WP_003355335.1 DUF2626 domain-containing protein -
  BSM4216_RS10465 (BSM4216_2553) - 2230642..2231277 (-) 636 WP_003355341.1 MBL fold metallo-hydrolase -
  BSM4216_RS16375 (BSM4216_2554) - 2231464..2231634 (+) 171 WP_048623684.1 DUF2759 domain-containing protein -

Sequence


Protein


Download         Length: 359 a.a.        Molecular weight: 40700.24 Da        Isoelectric Point: 9.6956

>NTDB_id=150511 BSM4216_RS10450 WP_048623682.1 2226322..2227401(-) (comGA) [Bacillus smithii strain DSM 4216]
MSYERNVEQLVEDALSLFATDIHFEPKTHGYQVQLRTHGLLTPFLKLTVKEGERLISQFKYMASMDISEKRKPQSGSLLL
ETKKGKIALRVSTLPSVLQRESLAIRLSPQKSFMDVYRLSLFPNSVKKLLALLRYSHGLMIFTGPTGSGKTTTLYSLVSH
RTRHFNQKVITLEDPVERQNDGWLQLQVNEKAGLTYSAGLKAILRHDPDIVIVGEIRDEETARITMRAALTGHLVLTTMH
TKDAKGAIYRLLEFGIEWHDIQQTLVAVSAQRLVRLLCPFCGESCSPYCPSQFRQKRATIYEIISGAVLQEVLKEAVGKE
GRYTYPTLKQLIRKGIGLGFLGNDELDRWVLEEETPKKI

Nucleotide


Download         Length: 1080 bp        

>NTDB_id=150511 BSM4216_RS10450 WP_048623682.1 2226322..2227401(-) (comGA) [Bacillus smithii strain DSM 4216]
TTGTCCTATGAAAGAAATGTGGAACAACTAGTGGAAGACGCGCTTTCCTTGTTTGCAACAGATATTCATTTTGAGCCCAA
AACACATGGATATCAAGTTCAATTACGCACTCACGGATTGTTGACGCCTTTTCTGAAGCTGACAGTCAAGGAAGGAGAAC
GGCTGATTTCCCAATTCAAATATATGGCCTCGATGGATATCAGCGAAAAAAGAAAGCCTCAGAGCGGGTCCCTTCTATTA
GAAACGAAGAAAGGAAAAATCGCTCTCCGCGTTTCCACTCTTCCGAGCGTGCTTCAGAGAGAAAGTTTGGCCATTCGGCT
TTCACCGCAAAAATCCTTCATGGATGTTTATCGTCTGTCTCTCTTTCCAAACTCCGTAAAAAAGCTTTTAGCTCTTCTCC
GTTATTCCCACGGTCTAATGATTTTCACCGGTCCAACCGGCAGCGGCAAAACCACTACACTCTATTCTCTTGTTTCCCAT
AGAACTCGTCATTTTAACCAGAAAGTAATTACGTTGGAGGACCCTGTGGAGCGCCAGAATGACGGCTGGCTCCAACTTCA
AGTGAACGAGAAAGCGGGTCTGACATATTCAGCGGGGCTTAAGGCTATTCTCCGCCATGATCCCGATATTGTCATTGTCG
GAGAAATACGTGATGAAGAAACCGCCCGCATCACTATGAGAGCGGCGTTAACCGGTCATTTAGTTTTGACCACGATGCAT
ACAAAGGATGCCAAAGGAGCTATTTATCGGTTGCTTGAATTTGGCATTGAATGGCATGATATTCAACAGACGCTTGTGGC
GGTTTCGGCTCAAAGGCTTGTACGGCTTTTATGCCCTTTTTGCGGGGAATCTTGTTCTCCCTATTGTCCTAGTCAGTTTC
GTCAAAAGCGCGCTACCATTTATGAGATCATCAGCGGTGCAGTACTGCAAGAAGTTTTAAAGGAAGCCGTTGGAAAGGAA
GGACGATACACTTATCCAACGCTGAAACAACTTATACGAAAGGGGATCGGTCTTGGATTTCTTGGGAATGATGAATTGGA
TCGCTGGGTTTTGGAAGAGGAAACACCGAAAAAGATATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

53.955

98.607

0.532

  pilB Glaesserella parasuis strain SC1401

38.439

96.379

0.37


Multiple sequence alignment