Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   DIC78_RS19060 Genome accession   NZ_CP029364
Coordinates   3697263..3698333 (+) Length   356 a.a.
NCBI ID   WP_127696768.1    Uniprot ID   -
Organism   Bacillus halotolerans strain ZB201702     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3692263..3703333
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DIC78_RS19030 (DIC78_19055) mgsR 3692511..3692891 (+) 381 WP_026014741.1 transcriptional regulator MgsR -
  DIC78_RS19040 (DIC78_19065) - 3693124..3693963 (+) 840 WP_127696766.1 STAS domain-containing protein -
  DIC78_RS19045 (DIC78_19070) - 3694043..3695372 (-) 1330 Protein_3681 hemolysin family protein -
  DIC78_RS19050 (DIC78_19075) - 3695516..3696469 (+) 954 WP_095713277.1 magnesium transporter CorA family protein -
  DIC78_RS19055 (DIC78_19080) - 3696532..3696942 (+) 411 WP_106020097.1 CBS domain-containing protein -
  DIC78_RS19060 (DIC78_19085) comGA 3697263..3698333 (+) 1071 WP_127696768.1 competence protein ComGA Machinery gene
  DIC78_RS19065 (DIC78_19090) comGB 3698320..3699357 (+) 1038 WP_127696769.1 competence type IV pilus assembly protein ComGB Machinery gene
  DIC78_RS19070 (DIC78_19095) comGC 3699371..3699667 (+) 297 WP_010334925.1 competence type IV pilus major pilin ComGC Machinery gene
  DIC78_RS19075 (DIC78_19100) comGD 3699657..3700088 (+) 432 WP_044154579.1 competence type IV pilus minor pilin ComGD Machinery gene
  DIC78_RS19080 (DIC78_19105) comGE 3700072..3700419 (+) 348 WP_127696770.1 competence type IV pilus minor pilin ComGE Machinery gene
  DIC78_RS19085 (DIC78_19110) comGF 3700445..3700828 (+) 384 WP_038954226.1 competence type IV pilus minor pilin ComGF Machinery gene
  DIC78_RS19090 (DIC78_19115) comGG 3700829..3701203 (+) 375 WP_106020094.1 competence type IV pilus minor pilin ComGG Machinery gene
  DIC78_RS19095 (DIC78_19120) - 3701275..3701454 (+) 180 WP_003236949.1 YqzE family protein -
  DIC78_RS19100 (DIC78_19125) - 3701497..3701820 (-) 324 WP_024122040.1 YqzG/YhdC family protein -
  DIC78_RS19105 (DIC78_19130) tapA 3702097..3702858 (+) 762 WP_106020093.1 amyloid fiber anchoring/assembly protein TapA -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40508.74 Da        Isoelectric Point: 9.0197

>NTDB_id=292003 DIC78_RS19060 WP_127696768.1 3697263..3698333(+) (comGA) [Bacillus halotolerans strain ZB201702]
MDSIEKISKTLIEEAYRTKASDIHIVPRERDAIIHFRVDHALLKKRNMKKEECVRLISHFKFLSAMDIGERRKPQNGSLT
LKLKEGDVHLRMSTLPTINEESLVIRVMPQYNIPSIDKLSLFPKTGATLLSFLKHSHGMLIFTGPTGSGKTTTLYSLIQY
AKKHFNRNIVTLEDPVETRDEDVLQVQVNEKAGVTYSTGLKAILRHDPDMIILGEIRDAETAQIAVRAAMTGHLVLTSLH
TRDAKGAIYRLLEFGINMNEIEQTVIAIAAQRLVDLACPFCEDGCSSVYCRLSRNVRRASVYELLYGKNLQRCIQEAKGD
NANYQYQTLRQIIRKGIALGYLTTNNYDRWVYHETD

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=292003 DIC78_RS19060 WP_127696768.1 3697263..3698333(+) (comGA) [Bacillus halotolerans strain ZB201702]
TTGGATTCGATAGAAAAAATAAGCAAAACTTTAATTGAAGAGGCATATAGGACGAAGGCCTCTGACATTCACATTGTGCC
GAGGGAACGGGACGCAATCATTCATTTTCGGGTTGACCATGCATTGCTGAAAAAGCGAAACATGAAAAAAGAAGAGTGTG
TAAGACTCATTTCTCATTTTAAGTTTCTCTCAGCTATGGACATAGGGGAAAGAAGAAAGCCGCAAAATGGATCTCTTACT
CTAAAATTAAAAGAGGGTGACGTTCACTTAAGGATGTCAACGCTCCCGACAATTAACGAAGAAAGCCTCGTCATCAGAGT
GATGCCTCAGTATAATATCCCTTCGATTGACAAACTGTCTTTATTCCCAAAGACAGGAGCCACCTTGCTATCATTTTTAA
AACACTCCCACGGCATGCTGATTTTTACCGGACCAACAGGCTCTGGGAAAACAACAACATTATATTCTCTGATCCAATAT
GCAAAAAAACATTTTAATCGAAACATTGTCACACTTGAAGATCCAGTGGAAACGAGGGATGAAGATGTTCTTCAAGTGCA
GGTGAATGAAAAAGCCGGTGTCACATATTCCACAGGGCTGAAGGCTATTTTGCGCCATGACCCCGATATGATCATCTTAG
GTGAGATTAGAGACGCTGAAACAGCCCAAATTGCGGTGCGGGCAGCGATGACTGGTCATTTAGTGCTGACGAGCCTTCAT
ACGAGAGATGCGAAGGGCGCGATTTACAGGCTGCTTGAATTTGGCATTAATATGAATGAAATCGAGCAAACGGTTATTGC
GATAGCGGCTCAGCGTTTAGTTGATTTGGCTTGCCCGTTTTGTGAAGACGGATGTTCATCAGTGTATTGCCGCCTGTCAC
GAAATGTCAGGAGAGCAAGCGTATACGAGCTGCTTTACGGGAAAAACCTTCAGCGGTGTATCCAGGAAGCGAAAGGGGAC
AATGCAAATTACCAATATCAAACGCTGCGTCAAATTATAAGAAAAGGGATTGCGCTCGGCTATTTAACGACAAACAACTA
TGACCGGTGGGTTTATCATGAAACAGATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

95.787

100

0.958

  pilB Glaesserella parasuis strain SC1401

38.44

100

0.388


Multiple sequence alignment