Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   G4P54_RS12985 Genome accession   NZ_CP048852
Coordinates   2431960..2433030 (-) Length   356 a.a.
NCBI ID   WP_167872872.1    Uniprot ID   -
Organism   Bacillus tequilensis strain EA-CB0015     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2426960..2438030
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G4P54_RS12940 (G4P54_12895) tapA 2427441..2428202 (-) 762 WP_167872866.1 amyloid fiber anchoring/assembly protein TapA -
  G4P54_RS12945 (G4P54_12900) - 2428471..2428797 (+) 327 WP_167872867.1 YqzG/YhdC family protein -
  G4P54_RS12950 (G4P54_12905) - 2428839..2429018 (-) 180 WP_024715069.1 YqzE family protein -
  G4P54_RS12955 (G4P54_12910) comGG 2429090..2429464 (-) 375 WP_167872868.1 competence type IV pilus minor pilin ComGG Machinery gene
  G4P54_RS12960 (G4P54_12915) comGF 2429465..2429848 (-) 384 WP_167873878.1 competence type IV pilus minor pilin ComGF Machinery gene
  G4P54_RS12965 (G4P54_12920) comGE 2429874..2430221 (-) 348 WP_167872869.1 competence type IV pilus minor pilin ComGE Machinery gene
  G4P54_RS12970 (G4P54_12925) comGD 2430205..2430636 (-) 432 WP_167872870.1 competence type IV pilus minor pilin ComGD Machinery gene
  G4P54_RS12975 (G4P54_12930) comGC 2430626..2430922 (-) 297 WP_024715064.1 comG operon protein ComGC Machinery gene
  G4P54_RS12980 (G4P54_12935) comGB 2430936..2431973 (-) 1038 WP_167872871.1 competence type IV pilus assembly protein ComGB Machinery gene
  G4P54_RS12985 (G4P54_12940) comGA 2431960..2433030 (-) 1071 WP_167872872.1 competence protein ComGA Machinery gene
  G4P54_RS12990 (G4P54_12945) - 2433231..2433437 (-) 207 WP_240939667.1 hypothetical protein -
  G4P54_RS12995 (G4P54_12950) corA 2433641..2434594 (-) 954 WP_024715061.1 magnesium transporter CorA -
  G4P54_RS13000 (G4P54_12955) - 2434737..2436065 (+) 1329 WP_024715060.1 hemolysin family protein -
  G4P54_RS13005 (G4P54_12960) - 2436114..2436953 (-) 840 WP_024715059.1 STAS domain-containing protein -
  G4P54_RS13015 (G4P54_12970) mgsR 2437177..2437554 (-) 378 WP_032682101.1 transcriptional regulator MgsR -
  G4P54_RS13020 (G4P54_12975) - 2437775..2438020 (+) 246 WP_003226300.1 DUF2626 domain-containing protein -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40480.65 Da        Isoelectric Point: 9.0264

>NTDB_id=423573 G4P54_RS12985 WP_167872872.1 2431960..2433030(-) (comGA) [Bacillus tequilensis strain EA-CB0015]
MDSIEKISRTLIEEAYITKASDIHIVPRERDAIIHFRVDHALLKKRNMKKEECARLISHFKFLSAMDIGERRKPQNGSLT
LKLEEGHVHLRMSTLPTINEESLVIRVMPQYNIPSIDRLSLFPKTGATLLSFLKHSHGMLIFTGPTGSGKTTTLYALVQY
AKKHFNRNIVTLEDPVETRDEDVLQVQVNEKAGVTYSAGLKAILRHDPDMIILGEIRDAETAKIAVRAAMTGHLVLTSLH
TRDAKGAIYRLLEFGINMNEIEQTLIAIAAQRLVDLACPFCENGCSSVYCRQSRNTRRASVYELLYGKNLQQCIQEAKGN
HANYQYQTLRQIIRKGIALGYLTTNNYDRWVYHETD

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=423573 G4P54_RS12985 WP_167872872.1 2431960..2433030(-) (comGA) [Bacillus tequilensis strain EA-CB0015]
TTGGATTCAATAGAAAAGATAAGCAGAACCTTGATTGAAGAGGCATATATAACGAAGGCTTCTGATATTCACATTGTGCC
GAGGGAGCGTGACGCTATCATTCATTTTCGAGTGGATCATGCCTTGCTGAAAAAAAGGAACATGAAAAAAGAAGAGTGTG
CAAGACTGATTTCCCATTTTAAATTTCTCTCAGCAATGGACATAGGAGAAAGACGAAAACCGCAAAATGGCTCCCTTACG
TTAAAGCTTGAAGAGGGACATGTTCATTTAAGAATGTCAACGCTCCCCACAATTAATGAAGAAAGTCTCGTAATCAGAGT
GATGCCTCAATACAATATCCCTTCAATAGACAGACTGTCACTATTTCCAAAGACAGGAGCCACCTTACTCTCTTTCTTAA
AACATTCCCACGGCATGCTCATTTTTACCGGGCCGACTGGTTCGGGCAAGACCACCACATTATATGCTCTCGTTCAATAT
GCAAAAAAACACTTTAATCGAAATATTGTCACACTGGAGGACCCTGTTGAAACAAGGGACGAGGATGTTCTTCAAGTACA
GGTGAATGAAAAAGCAGGCGTGACCTATTCCGCAGGGCTGAAAGCGATTTTGCGCCATGACCCGGATATGATTATTTTAG
GTGAGATCAGAGACGCGGAAACCGCTAAAATTGCTGTGCGGGCAGCGATGACGGGGCATCTGGTGCTGACGAGCCTTCAT
ACGAGAGATGCGAAGGGCGCGATTTACAGGCTGCTTGAATTCGGCATCAATATGAATGAAATCGAGCAGACTCTCATTGC
AATAGCTGCTCAGCGTTTAGTTGATTTAGCTTGTCCGTTTTGTGAAAACGGTTGTTCATCGGTGTATTGCCGCCAGTCAC
GGAACACCAGGAGAGCAAGTGTTTATGAGCTCCTTTACGGAAAAAATCTTCAGCAATGCATTCAGGAGGCAAAAGGAAAT
CATGCAAATTACCAATATCAAACACTCCGTCAAATTATCCGAAAAGGAATTGCGCTTGGCTACTTGACGACCAACAACTA
TGACCGGTGGGTTTATCATGAAACAGATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

96.348

100

0.963