Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   P5638_RS10670 Genome accession   NZ_CP120599
Coordinates   2105825..2106895 (+) Length   356 a.a.
NCBI ID   WP_025092972.1    Uniprot ID   A0A5C0WKU5
Organism   Bacillus safensis strain PRO114     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2100825..2111895
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  P5638_RS10630 (P5638_10630) - 2101026..2102279 (+) 1254 WP_277721852.1 MFS transporter -
  P5638_RS10635 (P5638_10635) - 2102356..2102670 (+) 315 WP_024427984.1 MTH1187 family thiamine-binding protein -
  P5638_RS10640 (P5638_10640) - 2102696..2102869 (-) 174 WP_003217333.1 DUF2759 domain-containing protein -
  P5638_RS10645 (P5638_10645) - 2103027..2103665 (+) 639 WP_024422981.1 MBL fold metallo-hydrolase -
  P5638_RS10650 (P5638_10650) - 2103713..2103955 (-) 243 WP_003217440.1 DUF2626 domain-containing protein -
  P5638_RS10655 (P5638_10655) - 2104168..2104548 (+) 381 WP_007501232.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  P5638_RS10665 (P5638_10665) - 2104772..2105647 (+) 876 WP_024422979.1 STAS domain-containing protein -
  P5638_RS10670 (P5638_10670) comGA 2105825..2106895 (+) 1071 WP_025092972.1 competence type IV pilus ATPase ComGA Machinery gene
  P5638_RS10675 (P5638_10675) comGB 2106876..2107916 (+) 1041 WP_024427986.1 competence type IV pilus assembly protein ComGB -
  P5638_RS10680 (P5638_10680) comGC 2107933..2108226 (+) 294 WP_024422977.1 competence type IV pilus major pilin ComGC Machinery gene
  P5638_RS10685 (P5638_10685) comGD 2108219..2108665 (+) 447 WP_034621605.1 competence type IV pilus minor pilin ComGD -
  P5638_RS10690 (P5638_10690) comGE 2108649..2108963 (+) 315 WP_034621606.1 competence type IV pilus minor pilin ComGE -
  P5638_RS10695 (P5638_10695) comGF 2108950..2109402 (+) 453 WP_277721854.1 competence type IV pilus minor pilin ComGF -
  P5638_RS10700 (P5638_10700) comGG 2109399..2109782 (+) 384 WP_277721855.1 competence type IV pilus minor pilin ComGG -
  P5638_RS10705 (P5638_10705) - 2109842..2110036 (+) 195 WP_056767455.1 YqzE family protein -
  P5638_RS10710 (P5638_10710) - 2110071..2110349 (-) 279 WP_171464609.1 DUF3889 domain-containing protein -
  P5638_RS10715 (P5638_10715) tapA 2110649..2111176 (+) 528 WP_024422972.1 amyloid fiber anchoring/assembly protein TapA -
  P5638_RS10720 (P5638_10720) - 2111228..2111800 (+) 573 WP_024422971.1 signal peptidase I -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40412.86 Da        Isoelectric Point: 9.5360

>NTDB_id=806509 P5638_RS10670 WP_025092972.1 2105825..2106895(+) (comGA) [Bacillus safensis strain PRO114]
MYGIEYLRQELLEEACRMRASDVHIVPKEKEASVSFRVDSDLIQQRTIDKKSGERLIAHFKFLSSMDIGEKRRPQNGSLA
VMLRKGQVFIRMSTLPTVNDESLVIRILPQDHVPKIKHLSLFPKASARLLSFLNHSHGLILFTGPTNSGKTTTLYSLIQF
AKKNFNRNIITLEDPVETRNEEVLQVQVNEKAGITYAAGLRAILRHDPDVIVLGEIRDAETARTAIRAALTGHLVLSTLH
AKNAKGALYRMLEFGVTMNELEQTMVAIAAQRLIELTCPFCGERCQLYCKLNRPIRRTNVFELLFGKELGECIKEAKGEY
AHSSYETLQRLIRKGVALGYLSKNTYHRWVYEEASL

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=806509 P5638_RS10670 WP_025092972.1 2105825..2106895(+) (comGA) [Bacillus safensis strain PRO114]
TTGTATGGGATTGAATATTTAAGACAAGAGCTGCTAGAAGAAGCATGCCGCATGAGGGCTTCTGATGTGCATATTGTGCC
AAAAGAAAAGGAAGCTTCGGTGTCATTTCGTGTTGATTCAGATTTAATCCAGCAGCGAACCATTGATAAAAAAAGCGGAG
AGCGGCTTATTGCACATTTTAAGTTTTTATCTTCTATGGATATCGGGGAAAAAAGGAGGCCGCAAAATGGCTCACTAGCT
GTGATGCTAAGAAAAGGTCAAGTATTTATTCGGATGTCTACCTTGCCGACAGTGAATGATGAGAGCTTAGTGATTAGAAT
TTTACCGCAGGATCATGTTCCGAAAATAAAACATCTATCATTATTTCCGAAAGCCTCAGCCAGATTATTATCCTTTTTAA
ACCACTCGCATGGACTCATTTTATTTACTGGGCCAACAAATTCAGGGAAAACAACAACCCTCTATTCACTGATCCAGTTT
GCAAAAAAGAATTTTAACCGAAATATTATTACCCTTGAAGATCCTGTCGAAACAAGGAATGAAGAAGTGCTGCAGGTTCA
AGTAAATGAAAAAGCCGGCATCACATATGCCGCAGGATTACGCGCTATTTTAAGACATGATCCAGATGTGATTGTGCTAG
GAGAAATAAGAGATGCTGAAACGGCCAGAACGGCGATTAGAGCAGCACTGACAGGCCATTTAGTATTGAGTACGCTCCAT
GCAAAAAATGCAAAAGGAGCCCTTTACCGCATGCTTGAATTTGGTGTCACGATGAATGAACTTGAGCAAACGATGGTTGC
GATTGCTGCGCAGCGATTAATCGAGCTCACCTGTCCTTTTTGCGGAGAAAGATGTCAGCTTTACTGTAAATTAAATCGAC
CGATCAGACGAACAAATGTATTTGAACTGCTGTTCGGAAAAGAGCTTGGTGAGTGTATCAAGGAGGCTAAAGGAGAATAT
GCTCACTCGTCATATGAAACACTGCAAAGATTAATTCGTAAAGGAGTGGCACTCGGCTATTTATCAAAAAACACTTATCA
TCGCTGGGTTTATGAAGAAGCGAGTCTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A5C0WKU5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

64.689

99.438

0.643