Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   IC803_RS04695 Genome accession   NZ_CP061474
Coordinates   917728..918798 (+) Length   356 a.a.
NCBI ID   WP_190304293.1    Uniprot ID   -
Organism   Geobacillus sp. 46C-IIa     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 912728..923798
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IC803_RS04665 (IC803_04665) - 912849..914012 (+) 1164 WP_081209506.1 M14 family metallopeptidase -
  IC803_RS04670 (IC803_04670) - 914048..914224 (-) 177 WP_081209504.1 DUF2759 domain-containing protein -
  IC803_RS04675 (IC803_04675) - 914405..915037 (+) 633 WP_081209502.1 MBL fold metallo-hydrolase -
  IC803_RS04680 (IC803_04680) - 915268..916395 (+) 1128 WP_081209500.1 class I SAM-dependent methyltransferase -
  IC803_RS04685 (IC803_04685) - 916426..916668 (-) 243 WP_008879854.1 DUF2626 family protein -
  IC803_RS04690 (IC803_04690) - 916830..917528 (-) 699 WP_081209498.1 metalloregulator ArsR/SmtB family transcription factor -
  IC803_RS04695 (IC803_04695) comGA 917728..918798 (+) 1071 WP_190304293.1 competence type IV pilus ATPase ComGA Machinery gene
  IC803_RS04700 (IC803_04700) comGB 918795..919823 (+) 1029 WP_081209494.1 competence type IV pilus assembly protein ComGB -
  IC803_RS04705 (IC803_04705) comGC 919887..920183 (+) 297 WP_081209492.1 competence type IV pilus major pilin ComGC -
  IC803_RS04710 (IC803_04710) comGD 920167..920610 (+) 444 WP_223812022.1 competence type IV pilus minor pilin ComGD -
  IC803_RS04715 (IC803_04715) - 920594..920920 (+) 327 WP_081209490.1 competence protein ComG -
  IC803_RS04720 (IC803_04720) comGF 920899..921342 (+) 444 WP_081209488.1 competence type IV pilus minor pilin ComGF -
  IC803_RS04725 (IC803_04725) comGG 921473..921862 (+) 390 WP_081209486.1 competence type IV pilus minor pilin ComGG -
  IC803_RS04730 (IC803_04730) - 921908..922087 (+) 180 WP_081209484.1 YqzE family protein -
  IC803_RS04735 (IC803_04735) - 922163..922957 (-) 795 WP_081209482.1 YqhG family protein -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 39588.86 Da        Isoelectric Point: 9.2159

>NTDB_id=482558 IC803_RS04695 WP_190304293.1 917728..918798(+) (comGA) [Geobacillus sp. 46C-IIa]
MDDIEQVANRLLAEAVQRRASDLHLVPRRHDAAIRLRLDGMLTDVGTLPKETAERVIVHFKFLAGMDIGERRRPQSGAME
VNESGETVYLRLSTLPTLYDESLVIRLLPQRFSLPLRELSLFPQSTARLFSFMQQPQGLVLLTGPTGSGKTTTLYTLLDV
CQAERQRNIITLEDPIEKRNDRFLQVQINEKAGITYAASLKAALRHDPDVLMVGEIRDHDTAAIAVRSALTGHLVVSTMH
AADAVGAVYRLHEFGVPPGDLAETLLAVSAQRLVELCCPLCGDDCHPACRRLRRRRRAAVHELLYGPALLDAIRSLSDGD
GRRLRRHMTLARLIRKGIALGYLPVRALRLVGGEGR

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=482558 IC803_RS04695 WP_190304293.1 917728..918798(+) (comGA) [Geobacillus sp. 46C-IIa]
CTGGATGATATTGAACAAGTAGCGAACCGTCTCCTTGCTGAAGCGGTGCAGCGCCGCGCCTCTGACCTTCACCTCGTCCC
GCGCCGCCATGATGCGGCCATTCGCCTCCGTCTTGACGGCATGCTCACCGATGTTGGCACGCTCCCGAAAGAAACCGCTG
AACGCGTCATCGTCCATTTCAAGTTTCTAGCCGGCATGGATATCGGCGAACGCCGCCGACCGCAAAGCGGCGCGATGGAA
GTGAATGAGTCCGGGGAAACGGTGTATTTGCGCTTATCGACGCTGCCGACGCTTTACGATGAAAGCCTCGTCATCCGCCT
TCTGCCGCAGCGCTTTTCGCTGCCGCTTCGCGAACTATCCTTATTTCCTCAATCCACCGCACGGTTGTTTTCCTTTATGC
AGCAGCCGCAAGGGCTCGTACTGTTGACCGGACCGACCGGGTCGGGAAAGACGACGACATTGTATACGCTTCTTGACGTT
TGCCAAGCGGAAAGACAACGCAACATCATCACGCTGGAAGATCCGATCGAAAAACGGAATGACCGGTTTTTGCAAGTGCA
GATCAATGAAAAAGCGGGCATCACGTATGCGGCGAGTTTGAAAGCCGCTTTGCGCCATGACCCTGACGTGTTAATGGTCG
GCGAGATCCGCGACCATGATACGGCGGCGATCGCCGTGCGATCGGCGTTAACTGGACATTTAGTCGTGTCGACCATGCAT
GCTGCCGATGCGGTCGGCGCGGTGTACCGGCTGCATGAGTTCGGGGTGCCGCCCGGCGATTTAGCGGAGACGTTGCTCGC
CGTTTCGGCGCAGCGTCTCGTTGAACTGTGCTGTCCGCTATGCGGCGACGATTGCCATCCGGCGTGTCGCCGGCTTCGAC
GCAGACGGCGCGCTGCCGTCCATGAACTGCTTTACGGCCCGGCGCTTCTTGATGCCATCCGTTCACTGTCGGACGGAGAC
GGGCGACGGCTCCGCCGCCATATGACATTAGCCCGCCTCATCCGTAAAGGAATCGCGCTCGGTTATTTGCCGGTCCGCGC
GCTTAGACTTGTCGGAGGGGAAGGACGATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

51.594

96.91

0.5