Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   L1879_RS13595 Genome accession   NZ_CP091131
Coordinates   2818307..2819422 (+) Length   371 a.a.
NCBI ID   WP_285958217.1    Uniprot ID   A0AAW7CFK1
Organism   Heyndrickxia coagulans strain TM3     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2813307..2824422
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  L1879_RS13560 - 2813328..2814494 (+) 1167 WP_041819143.1 rhomboid family intramembrane serine protease -
  L1879_RS13565 - 2814803..2815024 (+) 222 WP_013859652.1 YqgQ family protein -
  L1879_RS13570 - 2815017..2815985 (+) 969 WP_061574785.1 ROK family glucokinase -
  L1879_RS13575 - 2816084..2816257 (-) 174 WP_019720590.1 DUF2759 domain-containing protein -
  L1879_RS13580 - 2816387..2817016 (+) 630 WP_013859649.1 MBL fold metallo-hydrolase -
  L1879_RS13585 - 2817086..2817328 (-) 243 WP_014098019.1 DUF2626 domain-containing protein -
  L1879_RS13590 - 2817406..2818113 (-) 708 WP_258921440.1 metalloregulator ArsR/SmtB family transcription factor -
  L1879_RS13595 comGA 2818307..2819422 (+) 1116 WP_285958217.1 competence type IV pilus ATPase ComGA Machinery gene
  L1879_RS13600 comGB 2819424..2820386 (+) 963 WP_258921441.1 competence type IV pilus assembly protein ComGB -
  L1879_RS13605 comGC 2820455..2820766 (+) 312 WP_029142066.1 competence type IV pilus major pilin ComGC -
  L1879_RS13610 comGD 2820763..2821206 (+) 444 WP_029142067.1 competence type IV pilus minor pilin ComGD -
  L1879_RS13615 - 2821193..2821501 (+) 309 WP_029142068.1 type II secretion system protein -
  L1879_RS13620 comGF 2821464..2821946 (+) 483 WP_061566238.1 competence type IV pilus minor pilin ComGF -
  L1879_RS13625 comGG 2821943..2822311 (+) 369 WP_017553559.1 competence type IV pilus minor pilin ComGG -
  L1879_RS13630 - 2822308..2822865 (+) 558 WP_029142071.1 shikimate kinase -
  L1879_RS13635 - 2822852..2823028 (+) 177 WP_064485109.1 YqzE family protein -
  L1879_RS13640 - 2823062..2823853 (-) 792 WP_258921442.1 YqhG family protein -

Sequence


Protein


Download         Length: 371 a.a.        Molecular weight: 41891.43 Da        Isoelectric Point: 9.7653

>NTDB_id=647954 L1879_RS13595 WP_285958217.1 2818307..2819422(+) (comGA) [Heyndrickxia coagulans strain TM3]
MSVEKIAELLIGQAVKNNVTDVHIVPKEKHYHVQFRQYGRLYPHRNLSAKAGERLISHLKFMSSMDISEKRKPQSGSFAI
NVLQQAVSLRISTLPTSLSKESLVIRILPHEEQFQLSQISLFPSSTKKLMALLNHSHGMLIFSGPTGSGKSTTMYTLVEH
CAKRFLRNVITLEDPVEKQSDSFLQVQVNEKAGITYSTGLKAILRHDPDIILVGEIRDAETARIAVRASLTGHLVLTTLH
TRDAKGAIYRLMEFGVSIHEMEQTLLAVSAQRLITLRCPVCGGIRCPHDCPGSRKKRQTAVYELLYGKNLQRVLKEAKGE
QVRYHYPTLKEWLRKGIVLGYISEEEYHRWIAEEEEADKPQTTGALSYPGR

Nucleotide


Download         Length: 1116 bp        

>NTDB_id=647954 L1879_RS13595 WP_285958217.1 2818307..2819422(+) (comGA) [Heyndrickxia coagulans strain TM3]
ATGTCGGTTGAAAAAATCGCGGAACTATTGATTGGACAGGCGGTGAAAAACAATGTGACCGATGTCCATATCGTTCCGAA
AGAGAAGCATTACCATGTCCAGTTCCGGCAGTACGGAAGGCTGTACCCCCACCGCAACCTTTCCGCGAAAGCCGGGGAAA
GGCTGATATCCCATTTAAAATTTATGTCTTCCATGGATATCAGTGAAAAACGAAAGCCCCAAAGCGGGTCGTTTGCCATA
AACGTTTTGCAGCAGGCGGTCTCCTTAAGGATTTCGACGCTGCCAACCTCTCTTTCCAAAGAGAGCCTCGTCATCCGGAT
TTTGCCCCACGAAGAACAATTCCAGCTCAGCCAAATTTCACTTTTTCCATCGAGCACGAAAAAATTAATGGCGCTATTAA
ACCATTCGCACGGCATGCTGATTTTCAGCGGTCCGACCGGAAGCGGCAAGTCGACGACGATGTATACGCTTGTTGAGCAC
TGCGCAAAACGGTTTTTACGGAATGTAATTACACTGGAAGACCCTGTTGAAAAACAAAGCGATTCGTTTTTGCAGGTGCA
GGTGAATGAAAAAGCAGGCATTACTTATAGCACAGGGCTGAAAGCAATTTTGCGCCATGATCCGGATATTATCCTGGTGG
GGGAAATCCGCGATGCCGAAACAGCAAGAATTGCGGTCCGCGCATCGCTCACCGGCCATCTTGTCCTGACGACTTTGCAT
ACGCGTGACGCAAAGGGCGCGATTTACAGGCTGATGGAATTCGGCGTGAGCATCCACGAAATGGAGCAGACTTTGCTCGC
AGTCAGCGCACAGCGGCTGATTACATTGCGTTGTCCGGTATGCGGCGGCATCCGCTGCCCGCATGATTGTCCCGGCAGCC
GGAAAAAGCGGCAAACCGCGGTATATGAGCTGCTGTACGGGAAAAATTTGCAGCGGGTGCTGAAAGAAGCAAAGGGGGAA
CAGGTCCGCTACCATTATCCGACGCTGAAAGAATGGCTCAGAAAGGGGATTGTGCTTGGATACATTTCTGAAGAAGAGTA
TCACCGCTGGATTGCCGAAGAGGAAGAAGCGGATAAGCCTCAAACAACAGGGGCGCTTTCTTACCCGGGTCGGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

54.93

95.687

0.526