Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   C2H91_RS18490 Genome accession   NZ_CP026030
Coordinates   3684124..3685194 (+) Length   356 a.a.
NCBI ID   WP_070547533.1    Uniprot ID   -
Organism   Bacillus subtilis strain PK3_9     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3679124..3690194
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2H91_RS18455 (C2H91_18475) yqgY 3679320..3679565 (-) 246 WP_003226300.1 DUF2626 domain-containing protein -
  C2H91_RS18460 (C2H91_18480) mgsR 3679797..3680177 (+) 381 WP_003230147.1 transcriptional regulator MgsR -
  C2H91_RS18470 (C2H91_18490) rsbRD 3680401..3681237 (+) 837 WP_021480027.1 RsbT co-antagonist protein RsbRD -
  C2H91_RS18475 (C2H91_18495) yqhB 3681290..3682618 (-) 1329 WP_249850937.1 hemolysin family protein -
  C2H91_RS18480 (C2H91_18500) corA 3682761..3683714 (+) 954 WP_072173930.1 magnesium transporter CorA -
  C2H91_RS18485 (C2H91_18505) - 3683716..3683913 (+) 198 WP_249850938.1 hypothetical protein -
  C2H91_RS18490 (C2H91_18510) comGA 3684124..3685194 (+) 1071 WP_070547533.1 competence protein ComGA Machinery gene
  C2H91_RS18495 (C2H91_18515) comGB 3685181..3686218 (+) 1038 WP_086344083.1 comG operon protein ComGB Machinery gene
  C2H91_RS18500 (C2H91_18520) comGC 3686232..3686528 (+) 297 WP_003230162.1 comG operon protein ComGC Machinery gene
  C2H91_RS18505 (C2H91_18525) comGD 3686518..3686949 (+) 432 WP_032726159.1 competence type IV pilus minor pilin ComGD Machinery gene
  C2H91_RS18510 (C2H91_18530) comGE 3686933..3687280 (+) 348 WP_063335293.1 ComG operon protein 5 Machinery gene
  C2H91_RS18515 (C2H91_18535) comGF 3687306..3687689 (+) 384 WP_032726158.1 ComG operon protein ComGF Machinery gene
  C2H91_RS18520 (C2H91_18540) comGG 3687690..3688064 (+) 375 WP_064671036.1 ComG operon protein ComGG Machinery gene
  C2H91_RS18525 (C2H91_18545) spoIIT 3688136..3688315 (+) 180 WP_014480252.1 YqzE family protein -
  C2H91_RS18530 (C2H91_18550) yqzG 3688357..3688683 (-) 327 WP_038829733.1 YqzG/YhdC family protein -
  C2H91_RS18535 (C2H91_18555) tapA 3688954..3689715 (+) 762 WP_077671431.1 amyloid fiber anchoring/assembly protein TapA -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40444.57 Da        Isoelectric Point: 8.8402

>NTDB_id=265304 C2H91_RS18490 WP_070547533.1 3684124..3685194(+) (comGA) [Bacillus subtilis strain PK3_9]
MDSIENVSKNLIEEAYLTKASDIHIVPRERDAIIHFRVDHALLKKRDMKKEECVRLISHFKFLSAMDIGERRKPQNGSLT
LKLKEGNVHLRMSTLPTINEESLVIRVMPQYNIPSIDKLSLFPKTGATLLSFLKHSHGMLIFTGPTGSGKTTTLYSLVQY
AKKHFNRNIVTLEDPVETRDEDVLQVQVNEKAGVTYSAGLKAILRHDPDMIILGEIRDAETAEIAVRAAMTGHLVLTSLH
TRDAKGAIYRLLEFGINMNEIEQTVIAIAAQRLVDLACPFCENGCSSVYCRQSRNTRRASVYELLYGKNLQQCIQEAKGN
HANYQYQTLRQIIRKGIALGYLTTNNYDRWVYHEKD

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=265304 C2H91_RS18490 WP_070547533.1 3684124..3685194(+) (comGA) [Bacillus subtilis strain PK3_9]
TTGGATTCAATAGAAAATGTAAGCAAAAACTTGATTGAAGAAGCATATCTAACAAAGGCTTCTGATATTCACATCGTGCC
GAGGGAGCGGGACGCTATCATTCATTTTCGGGTCGATCATGCCTTGCTGAAAAAAAGGGACATGAAAAAAGAAGAGTGCG
TAAGACTGATTTCACATTTTAAATTTCTTTCAGCAATGGATATAGGTGAAAGGCGAAAGCCGCAAAACGGTTCGCTTACG
TTAAAGCTGAAAGAGGGAAATGTTCATTTAAGAATGTCAACGCTGCCCACAATTAATGAAGAAAGCCTCGTGATCAGAGT
GATGCCCCAATACAATATCCCTTCGATTGATAAATTGTCGCTATTTCCGAAGACAGGAGCCACATTACTCTCGTTTTTAA
AACATTCCCATGGCATGCTCATTTTTACCGGGCCGACTGGTTCAGGGAAGACTACCACATTATACTCTCTCGTTCAATAT
GCAAAAAAACACTTTAATCGAAATATTGTCACATTAGAGGACCCTGTTGAAACAAGGGACGAAGATGTTCTTCAGGTTCA
GGTGAATGAAAAAGCCGGTGTAACTTATTCTGCAGGTCTGAAAGCAATTTTGCGCCATGATCCCGATATGATTATTTTAG
GTGAGATCAGAGACGCGGAAACAGCTGAAATTGCGGTGCGGGCAGCGATGACGGGACATCTGGTACTAACGAGCCTTCAT
ACGAGAGACGCAAAGGGCGCCATTTACAGACTGCTTGAATTCGGTATCAATATGAATGAAATCGAACAGACTGTCATTGC
AATAGCGGCTCAGCGCTTGGTTGATTTGGCTTGCCCGTTTTGTGAAAACGGATGTTCATCAGTGTATTGCCGACAGTCAC
GAAATACCAGGAGAGCTAGCGTTTATGAGCTTCTATACGGGAAAAATCTTCAGCAATGTATCCAGGAGGCAAAAGGAAAT
CATGCAAATTACCAATATCAAACGCTTCGTCAAATTATCAGAAAAGGAATTGCGCTCGGCTATTTAACGACAAACAACTA
TGACCGGTGGGTTTATCATGAAAAAGATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

99.719

100

0.997


Multiple sequence alignment