Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   LS684_RS14375 Genome accession   NZ_CP089997
Coordinates   2828505..2829572 (-) Length   355 a.a.
NCBI ID   WP_233806939.1    Uniprot ID   -
Organism   Cytobacillus spongiae strain CY-G     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2823505..2834572
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LS684_RS14325 (LS684_14325) - 2823652..2824443 (+) 792 WP_233806927.1 YqhG family protein -
  LS684_RS14330 (LS684_14330) - 2824459..2824641 (-) 183 WP_233806928.1 DUF4083 family protein -
  LS684_RS14335 (LS684_14335) - 2824868..2825047 (-) 180 WP_233806929.1 YqzE family protein -
  LS684_RS14340 (LS684_14340) - 2825091..2825588 (-) 498 WP_233806930.1 shikimate kinase -
  LS684_RS14345 (LS684_14345) comGG 2825606..2825983 (-) 378 WP_233806932.1 competence type IV pilus minor pilin ComGG -
  LS684_RS14350 (LS684_14350) comGF 2825980..2826390 (-) 411 WP_267930264.1 competence type IV pilus minor pilin ComGF -
  LS684_RS14355 (LS684_14355) - 2826404..2826730 (-) 327 WP_233806935.1 hypothetical protein -
  LS684_RS14360 (LS684_14360) comGD 2826711..2827154 (-) 444 WP_233806936.1 competence type IV pilus minor pilin ComGD -
  LS684_RS14365 (LS684_14365) comGC 2827154..2827468 (-) 315 WP_233806937.1 competence type IV pilus major pilin ComGC -
  LS684_RS14370 (LS684_14370) comGB 2827484..2828515 (-) 1032 WP_233806938.1 competence type IV pilus assembly protein ComGB -
  LS684_RS14375 (LS684_14375) comGA 2828505..2829572 (-) 1068 WP_233806939.1 competence type IV pilus ATPase ComGA Machinery gene
  LS684_RS14385 (LS684_14385) - 2829865..2830245 (-) 381 WP_233806940.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  LS684_RS14390 (LS684_14390) - 2830583..2831284 (+) 702 WP_233806942.1 helix-turn-helix domain-containing protein -
  LS684_RS14395 (LS684_14395) - 2831366..2831608 (+) 243 WP_233806944.1 DUF2626 domain-containing protein -
  LS684_RS14400 (LS684_14400) - 2831854..2832945 (-) 1092 WP_233806946.1 SAM-dependent methyltransferase -
  LS684_RS14405 (LS684_14405) - 2833101..2833736 (-) 636 WP_233806948.1 MBL fold metallo-hydrolase -
  LS684_RS14410 (LS684_14410) - 2833820..2834080 (+) 261 WP_233806950.1 hypothetical protein -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 39920.34 Da        Isoelectric Point: 8.5746

>NTDB_id=639915 LS684_RS14375 WP_233806939.1 2828505..2829572(-) (comGA) [Cytobacillus spongiae strain CY-G]
MSSIKRLAYKVLSDAVQLNASDLHIIPRENDSLIQQRVGNKLVPQMCIPKEECDRLISHLKFTASMDIGERRRPQSGAFT
LTIEGKLIGFRLSTLPAKFSESLVIRILPQQEQIPFFQISLFPNSSQKLLALLKHAHGLIIFTGPTGSGKTSTLYSLLHE
TSHLYNRNVITLEDPIEKNCDTVLQVQVNEKAGMSYTTGLKAILRHDPDIIMVGEIRDAETAEIAVRASLTGHLVLSTMH
ARDAKGAIFRLLEFGVNWLEIEQTLVAVTAQRLVELTCPFCKGECSPFCYSQRKVKRASVFELLTGRTLSAVLREAKGIE
SNYHYPTLKDVIKKGIALGFIKETEYERWVFEHEK

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=639915 LS684_RS14375 WP_233806939.1 2828505..2829572(-) (comGA) [Cytobacillus spongiae strain CY-G]
TTGAGCTCGATAAAAAGACTTGCGTATAAGGTTTTATCGGATGCTGTTCAGTTAAATGCATCGGACCTTCATATTATTCC
TCGAGAAAACGATTCACTCATTCAGCAACGAGTTGGCAACAAACTAGTCCCTCAAATGTGTATACCAAAGGAAGAATGTG
ACAGATTGATTTCTCATTTAAAGTTCACTGCTTCGATGGACATTGGTGAACGAAGAAGACCTCAAAGCGGTGCCTTTACG
TTAACCATTGAAGGCAAGCTTATTGGATTTAGACTCTCCACTCTTCCTGCAAAGTTTTCAGAAAGCCTCGTCATTCGAAT
TCTTCCACAGCAAGAACAAATCCCTTTCTTTCAAATTTCCCTTTTTCCAAACTCTTCACAAAAATTATTAGCCTTATTAA
AGCATGCCCATGGACTTATTATCTTTACAGGTCCAACTGGAAGCGGAAAGACCTCGACCCTCTATTCACTGCTACATGAA
ACCTCCCATTTATATAACCGCAATGTGATCACATTAGAAGACCCAATCGAAAAAAATTGCGATACTGTACTTCAGGTTCA
AGTGAATGAGAAAGCAGGGATGTCCTATACAACAGGCCTTAAAGCGATATTAAGACATGATCCTGATATCATTATGGTGG
GGGAAATTCGAGATGCAGAAACAGCAGAGATTGCTGTACGCGCATCTTTAACCGGGCATTTAGTGTTAAGTACAATGCAT
GCGAGAGATGCTAAAGGAGCGATTTTTCGATTGTTGGAGTTCGGTGTTAATTGGTTAGAAATCGAACAAACCCTTGTAGC
GGTAACTGCACAGCGGTTAGTGGAATTAACTTGTCCATTCTGTAAAGGGGAATGTTCACCATTTTGTTATAGCCAGCGGA
AGGTAAAGCGTGCAAGTGTATTTGAACTGCTTACAGGAAGAACATTATCAGCTGTTTTGAGAGAAGCAAAAGGGATAGAA
AGTAACTATCATTATCCTACTCTAAAGGATGTGATAAAAAAGGGGATTGCTTTAGGATTTATTAAAGAAACAGAATATGA
ACGCTGGGTATTTGAACATGAAAAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

56.461

100

0.566

  pilB Glaesserella parasuis strain SC1401

39.941

95.211

0.38

  pilB Haemophilus influenzae 86-028NP

38.218

98.028

0.375