Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   ETK61_RS13650 Genome accession   NZ_CP035406
Coordinates   2505694..2506764 (-) Length   356 a.a.
NCBI ID   WP_024573393.1    Uniprot ID   A0A165AJM7
Organism   Bacillus subtilis strain SRCM103612     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2500694..2511764
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ETK61_RS13605 (ETK61_13610) tapA 2501179..2501934 (-) 756 WP_129137785.1 amyloid fiber anchoring/assembly protein TapA -
  ETK61_RS13610 (ETK61_13615) yqzG 2502206..2502532 (+) 327 WP_003246051.1 YqzG/YhdC family protein -
  ETK61_RS13615 (ETK61_13620) spoIITA 2502574..2502753 (-) 180 WP_003230176.1 YqzE family protein -
  ETK61_RS13620 (ETK61_13625) comGG 2502824..2503198 (-) 375 WP_029726722.1 ComG operon protein ComGG Machinery gene
  ETK61_RS13625 (ETK61_13630) comGF 2503199..2503582 (-) 384 WP_029317913.1 ComG operon protein ComGF Machinery gene
  ETK61_RS13630 (ETK61_13635) comGE 2503608..2503955 (-) 348 WP_014480255.1 ComG operon protein 5 Machinery gene
  ETK61_RS13635 (ETK61_13640) comGD 2503939..2504370 (-) 432 WP_014480256.1 comG operon protein ComGD Machinery gene
  ETK61_RS13640 (ETK61_13645) comGC 2504360..2504656 (-) 297 WP_003230162.1 comG operon protein ComGC Machinery gene
  ETK61_RS13645 (ETK61_13650) comGB 2504670..2505707 (-) 1038 WP_129137786.1 comG operon protein ComGB Machinery gene
  ETK61_RS13650 (ETK61_13655) comGA 2505694..2506764 (-) 1071 WP_024573393.1 competence protein ComGA Machinery gene
  ETK61_RS13660 (ETK61_13665) corA 2507177..2508130 (-) 954 WP_017696189.1 magnesium transporter CorA -
  ETK61_RS13665 (ETK61_13670) yqhB 2508272..2509600 (+) 1329 WP_129137787.1 hemolysin family protein -
  ETK61_RS13670 (ETK61_13675) rsbRD 2509653..2510489 (-) 837 WP_017696187.1 RsbT co-antagonist protein RsbRD -
  ETK61_RS13680 (ETK61_13685) mgsR 2510713..2511093 (-) 381 WP_003230147.1 transcriptional regulator MgsR -
  ETK61_RS13685 (ETK61_13690) yqgY 2511325..2511570 (+) 246 WP_003226300.1 DUF2626 domain-containing protein -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40472.67 Da        Isoelectric Point: 9.0175

>NTDB_id=340569 ETK61_RS13650 WP_024573393.1 2505694..2506764(-) (comGA) [Bacillus subtilis strain SRCM103612]
MDSIEKLSKNLIEEAYLTKASDIHIVPRERDAIIHFRVDHALLKKRDMKKEECVRLISHFKFLSAMDIGERRKPQNGSLT
LKLKEGNVHLRMSTLPTINEESLVIRVMPQYNIPSIDKLSLFPKTGATLLSFLKHSHGMLIFTGPTGSGKTTTLYSLVQY
AKKHFNRNIVTLEDPVETRDEDVLQVQVNEKAGVTYSAGLKAILRHDPDMIILGEIRDAETAEIAVRAAMTGHLVLTSLH
TRDAKGAIYRLLEFGINMNEIEQTVIAIAAQRLVDLACPFCENGCSSVYCRQSRNTRRASVYELLYGKNLQQCIQEAKGN
HANYQYQTLRQIIRKGIALGYLTTNNYDRWVYHEKD

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=340569 ETK61_RS13650 WP_024573393.1 2505694..2506764(-) (comGA) [Bacillus subtilis strain SRCM103612]
TTGGATTCAATAGAAAAGTTAAGCAAAAACTTGATTGAAGAGGCATATCTAACAAAGGCTTCTGATATTCACATCGTGCC
GAGGGAGCGGGACGCTATCATTCATTTTCGGGTCGATCATGCCTTGCTGAAAAAAAGGGACATGAAAAAAGAAGAGTGCG
TAAGGCTGATTTCACATTTTAAATTTCTTTCAGCAATGGATATAGGTGAAAGGCGAAAGCCGCAAAACGGTTCGCTTACG
TTAAAGCTGAAAGAGGGAAATGTTCATTTAAGAATGTCAACACTGCCCACAATTAATGAAGAAAGCCTCGTGATCAGAGT
GATGCCCCAATACAATATCCCTTCGATTGATAAATTGTCGCTATTTCCCAAGACAGGAGCCACATTACTCTCGTTTTTAA
AACATTCCCATGGCATGCTCATTTTTACCGGGCCGACTGGTTCAGGGAAGACTACCACATTATACTCTCTCGTTCAATAT
GCAAAAAAACACTTTAATCGAAATATTGTCACATTAGAGGACCCTGTTGAAACAAGGGACGAAGATGTTCTTCAGGTTCA
GGTGAATGAAAAAGCCGGTGTAACTTATTCCGCAGGTCTGAAAGCAATTTTGCGCCATGACCCCGATATGATTATTTTAG
GTGAGATCAGAGACGCGGAAACAGCTGAAATTGCGGTGCGGGCAGCGATGACGGGACATCTGGTACTAACGAGCCTTCAT
ACGAGAGACGCAAAGGGCGCAATTTACAGACTGCTTGAATTCGGTATCAATATGAATGAAATTGAACAGACTGTCATTGC
AATAGCGGCTCAGCGCTTGGTTGATTTGGCTTGCCCGTTTTGTGAAAACGGATGTTCATCAGTGTATTGCCGACAGTCAC
GAAATACCAGGAGAGCTAGCGTTTATGAGCTTCTTTACGGGAAAAATCTTCAGCAATGTATCCAGGAGGCAAAAGGAAAT
CATGCAAATTACCAATATCAAACGCTTCGTCAAATTATCAGGAAAGGAATTGCGCTCGGCTATTTAACGACAAACAACTA
TGACCGGTGGGTTTATCATGAAAAAGATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A165AJM7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

99.719

100

0.997

  pilB Glaesserella parasuis strain SC1401

43.878

82.584

0.362


Multiple sequence alignment