Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   CA592_RS00655 Genome accession   NZ_CP021838
Coordinates   116885..117955 (+) Length   356 a.a.
NCBI ID   WP_004889540.1    Uniprot ID   A0A178T503
Organism   Anoxybacillus flavithermus strain 52-1A     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 111885..122955
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CA592_RS00625 (CA592_00625) - 112428..113297 (+) 870 WP_230456172.1 M14 family metallocarboxypeptidase -
  CA592_RS00630 (CA592_00630) - 113324..113509 (-) 186 WP_004889534.1 DUF2759 domain-containing protein -
  CA592_RS00635 (CA592_00635) - 113642..114274 (+) 633 WP_004889535.1 MBL fold metallo-hydrolase -
  CA592_RS00640 (CA592_00640) - 114389..115492 (+) 1104 WP_088223205.1 class I SAM-dependent methyltransferase -
  CA592_RS00645 (CA592_00645) - 115522..115764 (-) 243 WP_004889537.1 DUF2626 domain-containing protein -
  CA592_RS00650 (CA592_00650) - 115834..116532 (-) 699 WP_004889538.1 helix-turn-helix transcriptional regulator -
  CA592_RS00655 (CA592_00655) comGA 116885..117955 (+) 1071 WP_004889540.1 competence type IV pilus ATPase ComGA Machinery gene
  CA592_RS00660 (CA592_00660) comGB 117939..118970 (+) 1032 WP_004889541.1 competence type IV pilus assembly protein ComGB -
  CA592_RS00665 (CA592_00665) comGC 118979..119275 (+) 297 WP_006317959.1 competence type IV pilus major pilin ComGC Machinery gene
  CA592_RS00670 (CA592_00670) comGD 119268..119705 (+) 438 WP_004889545.1 competence type IV pilus minor pilin ComGD -
  CA592_RS00675 (CA592_00675) comGE 119689..119997 (+) 309 WP_004889546.1 competence type IV pilus minor pilin ComGE -
  CA592_RS00680 (CA592_00680) comGF 119994..120428 (+) 435 WP_004889548.1 competence type IV pilus minor pilin ComGF -
  CA592_RS00685 (CA592_00685) comGG 120388..120804 (+) 417 WP_035018541.1 competence type IV pilus minor pilin ComGG -
  CA592_RS00690 (CA592_00690) - 120819..121325 (+) 507 WP_004889550.1 shikimate kinase -
  CA592_RS00695 (CA592_00695) - 121352..121540 (+) 189 WP_004889551.1 YqzE family protein -
  CA592_RS00700 (CA592_00700) - 121555..122325 (-) 771 WP_004889552.1 YqhG family protein -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 39965.72 Da        Isoelectric Point: 8.4735

>NTDB_id=233683 CA592_RS00655 WP_004889540.1 116885..117955(+) (comGA) [Anoxybacillus flavithermus strain 52-1A]
MQTIEQLADRLVEDAYILQASDIHIVPRKQDALVQFRLGGMLVTKHVLSKQMCERLLAHFKFLADMDIGERRRPQSGAME
MNVAHTTVHLRLSTLPTIYDESLVIRLLPLHSSLPLKQLALFPSTIKKLLALLNYSHGLMIFTGPTGSGKTTTLYSLLNV
CRHHFQRNVITLEDPVEKRAEDVLQVQVNEKAGITYAAGLKAILRHDPDVIMVGEIRDAETARIAVRAALTGHLVLTTMH
TKDAVGAIYRLLEFGVPFQEMAQTLVAVTAQRLVQLKCPFCEAECSPFCRQYRPVRRVGVYELLYGSELAQAMRAAKGEQ
ATYVYTRLKDVIKKGIALGFLHEHVIESWLLDEATS

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=233683 CA592_RS00655 WP_004889540.1 116885..117955(+) (comGA) [Anoxybacillus flavithermus strain 52-1A]
TTGCAAACAATCGAGCAATTGGCGGACCGACTCGTTGAGGATGCGTATATTCTTCAAGCTTCTGACATTCATATCGTTCC
GCGTAAGCAAGATGCGCTTGTTCAATTTCGGTTAGGAGGGATGCTTGTTACAAAGCATGTGCTATCTAAACAAATGTGTG
AACGTTTGCTTGCTCATTTCAAGTTTCTCGCTGATATGGATATTGGTGAACGTCGCCGCCCGCAAAGCGGAGCAATGGAA
ATGAACGTTGCCCACACAACTGTCCATCTTCGTTTATCGACATTGCCTACGATTTACGACGAAAGCTTAGTTATCCGCTT
GCTCCCGTTACATTCATCTCTTCCGCTCAAGCAACTCGCATTATTTCCTTCAACGATAAAAAAGTTGCTCGCGCTATTAA
ACTATTCCCATGGGCTTATGATTTTCACAGGACCAACAGGATCGGGAAAGACAACAACGTTATACAGTTTGTTAAACGTA
TGTCGCCATCATTTTCAACGAAATGTCATTACGTTAGAAGATCCGGTTGAAAAGCGGGCGGAAGATGTGTTGCAAGTACA
AGTAAACGAAAAAGCTGGCATTACGTATGCTGCAGGATTAAAAGCGATTTTGCGACACGATCCCGATGTGATTATGGTAG
GAGAAATTCGTGATGCCGAAACGGCACGAATAGCTGTACGCGCAGCGCTTACAGGACATCTTGTGCTAACAACGATGCAT
ACGAAGGATGCGGTTGGAGCGATTTATCGTCTTCTTGAGTTTGGTGTACCATTTCAAGAAATGGCGCAAACGTTAGTCGC
TGTGACAGCCCAACGTCTTGTGCAATTGAAATGTCCGTTTTGTGAAGCGGAGTGCTCACCTTTTTGCCGACAGTATCGAC
CTGTTCGTCGAGTCGGTGTTTATGAATTATTATACGGAAGTGAACTGGCACAAGCGATGCGTGCAGCGAAAGGGGAGCAA
GCGACATACGTGTATACTCGGTTAAAAGATGTGATTAAAAAAGGCATTGCGCTCGGTTTTTTACACGAGCATGTCATTGA
AAGTTGGTTGTTAGATGAAGCGACGTCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A178T503

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

59.605

99.438

0.593

  pilB Glaesserella parasuis strain SC1401

40.51

99.157

0.402

  pilB Vibrio cholerae strain A1552

37.607

98.596

0.371


Multiple sequence alignment