Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   ACHZK0_RS20625 Genome accession   NZ_CP172246
Coordinates   4018190..4019233 (-) Length   347 a.a.
NCBI ID   WP_395760848.1    Uniprot ID   -
Organism   Bacillus sp. 3G2     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4013190..4024233
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACHZK0_RS20580 (ACHZK0_20580) - 4013618..4013818 (-) 201 WP_000106080.1 YqzE family protein -
  ACHZK0_RS20585 (ACHZK0_20585) - 4013857..4014354 (-) 498 WP_395760844.1 shikimate kinase -
  ACHZK0_RS20590 (ACHZK0_20590) - 4014473..4015123 (-) 651 WP_395760845.1 2OG-Fe(II) oxygenase -
  ACHZK0_RS20595 (ACHZK0_20595) comGG 4015300..4015671 (-) 372 WP_395760846.1 competence type IV pilus minor pilin ComGG -
  ACHZK0_RS20600 (ACHZK0_20600) comGF 4015668..4016138 (-) 471 WP_270605880.1 competence type IV pilus minor pilin ComGF -
  ACHZK0_RS20605 (ACHZK0_20605) comGE 4016108..4016410 (-) 303 WP_238955220.1 competence type IV pilus minor pilin ComGE -
  ACHZK0_RS20610 (ACHZK0_20610) comGD 4016403..4016858 (-) 456 WP_262743203.1 comG operon protein ComGD -
  ACHZK0_RS20615 (ACHZK0_20615) comGC 4016855..4017154 (-) 300 WP_061677382.1 competence type IV pilus major pilin ComGC -
  ACHZK0_RS20620 (ACHZK0_20620) comGB 4017166..4018197 (-) 1032 WP_395760847.1 competence type IV pilus assembly protein ComGB -
  ACHZK0_RS20625 (ACHZK0_20625) comGA 4018190..4019233 (-) 1044 WP_395760848.1 competence protein ComGA Machinery gene
  ACHZK0_RS20630 (ACHZK0_20630) - 4019439..4020134 (+) 696 WP_395760849.1 helix-turn-helix transcriptional regulator -
  ACHZK0_RS20635 (ACHZK0_20635) - 4020261..4020503 (+) 243 WP_000440719.1 DUF2626 domain-containing protein -
  ACHZK0_RS20640 (ACHZK0_20640) - 4020611..4022005 (+) 1395 WP_001094339.1 L-cystine transporter -
  ACHZK0_RS20645 (ACHZK0_20645) - 4022205..4022420 (-) 216 WP_001008329.1 DUF3912 family protein -
  ACHZK0_RS20650 (ACHZK0_20650) - 4022674..4022985 (+) 312 WP_001093241.1 hypothetical protein -
  ACHZK0_RS20655 (ACHZK0_20655) - 4023022..4023510 (-) 489 WP_270339949.1 hypothetical protein -
  ACHZK0_RS20660 (ACHZK0_20660) - 4023590..4023964 (-) 375 WP_262740275.1 nucleoside 2-deoxyribosyltransferase -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39323.86 Da        Isoelectric Point: 9.3643

>NTDB_id=1065909 ACHZK0_RS20625 WP_395760848.1 4018190..4019233(-) (comGA) [Bacillus sp. 3G2]
MNGIESFANTILKEACRVQASDLHIVPRKKDVAVQLRIGKDLMTRHCIEKKFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQMDGQEVYLRLSTLPTVYQESLVIRLHLQASIQPLSHLSLFPSTAKKLLSFLHYSHGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIVTLEDPVEKRNDDVLQIQINEKAGITYEAGLKAILRHDPDIILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDAKGAILRFMDFGITRQEIEQSLLAIAAQRLVELKCPFCKRKCSTLCKSMRQVRQASIYELLYGYELKQAIKEANGEC
VTYKHETLESSIRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=1065909 ACHZK0_RS20625 WP_395760848.1 4018190..4019233(-) (comGA) [Bacillus sp. 3G2]
ATGAATGGAATTGAAAGTTTTGCGAATACGATTTTGAAAGAGGCGTGTAGGGTACAAGCTTCGGATTTACATATTGTGCC
CCGGAAGAAGGATGTAGCGGTTCAACTGCGTATAGGAAAAGATTTAATGACGAGACACTGTATCGAAAAAAAATTTGGAG
AAAAACTTGTTTCACACTTTAAATTTTTAGCATCCATGGATATAGGGGAGAGGCGGAAGCCACAAAATGGTTCACTGTAT
TTACAAATGGATGGGCAAGAAGTATATTTACGCCTTTCCACGCTACCAACTGTATATCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTATTCAGCCGTTATCTCATCTTTCGTTATTTCCAAGTACAGCGAAGAAACTACTCTCTTTTTTAC
ATTATTCACATGGGTTACTCGTATTTACTGGACCAACTGGTTCTGGAAAGACAACAACAATGTATGCATTGTTAGAGGTA
ATTAGAAAAAAGAAAACACGTCGCATCGTTACACTGGAGGATCCAGTTGAAAAAAGAAATGACGATGTATTACAAATTCA
AATAAATGAAAAAGCAGGTATCACATATGAGGCGGGATTAAAGGCTATTTTACGTCATGATCCGGATATTATTTTAGTCG
GTGAAATTCGTGATGAAGAAACAGCGAAAATTGCTATAAGAGCAAGTTTGACTGGACATTTAGTAATGACGACACTGCAT
ACGAATGATGCGAAAGGGGCGATACTCAGGTTCATGGATTTTGGTATAACGAGGCAAGAAATTGAACAATCTTTATTGGC
TATAGCTGCACAACGACTTGTCGAATTGAAGTGTCCGTTTTGCAAAAGAAAGTGCTCAACTTTATGTAAATCAATGAGGC
AAGTAAGACAAGCGAGTATTTATGAGTTGTTATATGGATATGAGTTAAAACAAGCAATTAAGGAAGCGAATGGGGAATGT
GTCACATACAAGCACGAAACATTAGAATCTTCAATACGAAAAGGATACGCTTTAGGATTTTTAGAAGAGGATGTGTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

57.637

100

0.576

  pilB Vibrio campbellii strain DS40M4

35.411

100

0.36


Multiple sequence alignment