Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   M493_RS11715 Genome accession   NC_022080
Coordinates   2356537..2357628 (-) Length   363 a.a.
NCBI ID   WP_041267964.1    Uniprot ID   -
Organism   Geobacillus genomosp. 3     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2351537..2362628
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  M493_RS11675 (M493_12600) - 2352176..2352970 (+) 795 WP_020960561.1 YqhG family protein -
  M493_RS11680 (M493_12605) - 2353047..2353226 (-) 180 WP_020960562.1 YqzE family protein -
  M493_RS11685 (M493_12610) comGG 2353274..2353663 (-) 390 WP_020960563.1 competence type IV pilus minor pilin ComGG -
  M493_RS11690 (M493_12615) comGF 2353992..2354432 (-) 441 WP_020960564.1 competence type IV pilus minor pilin ComGF -
  M493_RS11695 (M493_12620) - 2354411..2354737 (-) 327 WP_020960565.1 type II secretion system protein -
  M493_RS11700 (M493_12625) comGD 2354721..2355158 (-) 438 WP_020960566.1 competence type IV pilus minor pilin ComGD -
  M493_RS11705 (M493_12630) comGC 2355148..2355444 (-) 297 WP_020960567.1 competence type IV pilus major pilin ComGC Machinery gene
  M493_RS11710 (M493_12635) comGB 2355512..2356540 (-) 1029 WP_020960568.1 competence type IV pilus assembly protein ComGB -
  M493_RS11715 (M493_12640) comGA 2356537..2357628 (-) 1092 WP_041267964.1 competence type IV pilus ATPase ComGA Machinery gene
  M493_RS11720 (M493_12645) - 2357806..2358504 (+) 699 WP_020960570.1 helix-turn-helix transcriptional regulator -
  M493_RS11725 (M493_12650) - 2358669..2358911 (+) 243 WP_008879854.1 DUF2626 family protein -
  M493_RS11730 (M493_12655) - 2358942..2360069 (-) 1128 WP_020960571.1 class I SAM-dependent methyltransferase -
  M493_RS11735 (M493_12660) - 2360328..2360960 (-) 633 WP_020960572.1 MBL fold metallo-hydrolase -
  M493_RS17760 (M493_12670) - 2361140..2361316 (+) 177 WP_020960573.1 DUF2759 domain-containing protein -
  M493_RS11745 (M493_12675) - 2361351..2362514 (-) 1164 WP_020960574.1 M14 family metallopeptidase -

Sequence


Protein


Download         Length: 363 a.a.        Molecular weight: 40584.79 Da        Isoelectric Point: 7.7655

>NTDB_id=61055 M493_RS11715 WP_041267964.1 2356537..2357628(-) (comGA) [Geobacillus genomosp. 3]
MEGREYALDDIEQVANRLLAEAVQRRASDLHLIPRRHDAAIRLRVDGMLIDVGTLPKEIAERVIVHFKFLADMDIGERRR
PQSGAMEVNESGETVYLRLSTLPTLYDESLVIRLLPQRFSLPLRELSLFSHSTARLFSFMQQPQGLILLTGPTGSGKTTT
LYTLLDVCQAERQRNIITLEDPVEKRNDRFLQVQINEKAGITYAASLKAALRHDPDVLMVGEIRDHDTAAIAVRSSLSGH
LVVSTMHAADAVGAVYRLHEFGVPPGDLAETLLAVSAQRLVELCCPLCGDDCHPACRRLQRRRRAAVHELLYGPELMNVI
RSLSDSDGRRPRRHMTLGRLIRKGIALGYLPTRALELAEGGER

Nucleotide


Download         Length: 1092 bp        

>NTDB_id=61055 M493_RS11715 WP_041267964.1 2356537..2357628(-) (comGA) [Geobacillus genomosp. 3]
ATGGAAGGGAGGGAATATGCGCTGGATGATATCGAACAAGTAGCGAACCGTCTCCTCGCTGAAGCGGTGCAGCGCCGCGC
CTCTGACCTTCACCTCATTCCGCGCCGCCATGATGCGGCCATCCGCCTCCGTGTTGACGGCATGCTCATCGATGTTGGTA
CGCTCCCGAAAGAGATCGCCGAACGCGTCATCGTGCATTTCAAATTTTTAGCCGACATGGATATCGGTGAACGGCGCCGT
CCGCAAAGCGGGGCGATGGAAGTAAACGAATCCGGGGAAACGGTCTATTTGCGCTTATCGACGCTGCCGACGCTTTACGA
CGAAAGCCTCGTCATCCGCCTCTTGCCGCAGCGCTTTTCGCTGCCGCTTCGCGAACTATCCTTATTTTCCCACTCCACCG
CACGATTGTTTTCCTTTATGCAGCAGCCGCAAGGGCTCATATTGCTGACCGGCCCGACCGGGTCGGGAAAGACGACGACG
TTGTACACCCTTCTTGATGTTTGTCAAGCGGAAAGACAGCGCAACATCATCACATTGGAAGATCCAGTCGAAAAACGAAA
TGACCGGTTTTTGCAAGTGCAGATCAATGAAAAAGCAGGCATCACGTATGCGGCCAGCTTGAAAGCCGCTTTGCGCCATG
ATCCGGATGTGTTAATGGTCGGGGAAATCCGCGATCATGATACGGCGGCCATCGCCGTGCGGTCATCGTTGAGCGGACAC
TTGGTCGTGTCGACCATGCATGCCGCGGATGCCGTCGGTGCGGTGTACCGGCTGCATGAATTCGGTGTGCCACCCGGCGA
CTTGGCTGAAACGCTGCTTGCCGTTTCGGCGCAGCGCCTTGTCGAGCTATGCTGTCCGCTATGTGGCGACGATTGCCATC
CGGCGTGTCGCCGGCTTCAACGCAGACGGCGTGCCGCCGTCCATGAACTGCTTTACGGACCGGAGCTTATGAATGTCATC
CGTTCGCTGTCGGACAGTGATGGGCGACGGCCCCGCCGCCATATGACATTAGGGCGCCTCATCCGCAAGGGAATCGCGCT
TGGTTATTTGCCGACCCGCGCGCTTGAGCTTGCCGAAGGGGGAGAGCGATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

50.432

95.592

0.482


Multiple sequence alignment