Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   D9842_RS12500 Genome accession   NZ_CP033043
Coordinates   2504842..2505909 (-) Length   355 a.a.
NCBI ID   WP_121662826.1    Uniprot ID   -
Organism   Metabacillus litoralis strain Bac94     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2499842..2510909
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D9842_RS12450 - 2500358..2500636 (-) 279 WP_121662816.1 hypothetical protein -
  D9842_RS12455 - 2500711..2501025 (+) 315 WP_121662817.1 DUF3889 domain-containing protein -
  D9842_RS12460 - 2501160..2501354 (-) 195 WP_121662818.1 YqzE family protein -
  D9842_RS12465 - 2501389..2501904 (-) 516 WP_121662819.1 shikimate kinase -
  D9842_RS12470 comGG 2501946..2502326 (-) 381 WP_121662820.1 competence type IV pilus minor pilin ComGG -
  D9842_RS12475 comGF 2502323..2502826 (-) 504 WP_257536038.1 competence type IV pilus minor pilin ComGF -
  D9842_RS12480 comGE 2502750..2503097 (-) 348 WP_121662822.1 competence type IV pilus minor pilin ComGE -
  D9842_RS12485 comGD 2503063..2503518 (-) 456 WP_162987416.1 competence type IV pilus minor pilin ComGD -
  D9842_RS12490 comGC 2503493..2503807 (-) 315 WP_121662824.1 competence type IV pilus major pilin ComGC Machinery gene
  D9842_RS12495 comGB 2503818..2504852 (-) 1035 WP_162987417.1 competence type IV pilus assembly protein ComGB -
  D9842_RS12500 comGA 2504842..2505909 (-) 1068 WP_121662826.1 competence type IV pilus ATPase ComGA Machinery gene
  D9842_RS12510 - 2506529..2506909 (-) 381 WP_121662827.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  D9842_RS12515 - 2507252..2507500 (+) 249 WP_098799530.1 DUF2626 domain-containing protein -
  D9842_RS12520 - 2507563..2508195 (-) 633 WP_121662828.1 MBL fold metallo-hydrolase -
  D9842_RS12525 - 2508449..2508622 (+) 174 WP_098799532.1 DUF2759 domain-containing protein -
  D9842_RS12530 - 2508681..2508989 (-) 309 WP_121662829.1 MTH1187 family thiamine-binding protein -
  D9842_RS12535 - 2509004..2510134 (-) 1131 WP_121662830.1 hypothetical protein -

Sequence


Protein


Download         Length: 355 a.a.        Molecular weight: 40504.03 Da        Isoelectric Point: 8.6569

>NTDB_id=320607 D9842_RS12500 WP_121662826.1 2504842..2505909(-) (comGA) [Metabacillus litoralis strain Bac94]
MKPIEQISERIIEEACALRASDIHIIPREQDTLIQFRVDDDLIEKHTMKKEICGRIISHLKFLASMDIGERRRPQNGSLS
LYTTNLNVHLRLSTLPTIHEESLVIRILPQEKIPPITHLSLFPATTNKLLSILNHSHGLMIFTGPTGSGKTTTLYSLIHY
AKRHFNRNIITLEDPVETKSEEVLQVQVNEKAGITYAAGLKAILRHDPDIIMVGEIRDRETAQIAIRASLTGHLVLSTMH
TRDAKGAIYRLMEFDVSLSEIEQTLIAVSAQRLVQLKCPFCEDQCSPFCKKLRQKRRLSIYELLYGKNLTAVVKEAKGEL
ASYQYPTLKDVIKKGIALGYLYPNSYDRWVFRYED

Nucleotide


Download         Length: 1068 bp        

>NTDB_id=320607 D9842_RS12500 WP_121662826.1 2504842..2505909(-) (comGA) [Metabacillus litoralis strain Bac94]
GTGAAACCAATTGAACAAATAAGTGAACGCATTATTGAAGAAGCCTGTGCATTAAGAGCCTCAGACATTCACATTATTCC
TAGAGAACAAGACACGCTCATTCAATTTCGAGTTGATGATGATTTAATTGAAAAGCATACTATGAAAAAAGAAATCTGTG
GTCGTATTATCTCTCATCTAAAATTCTTAGCCTCCATGGACATTGGTGAAAGACGAAGACCACAAAATGGGTCACTGTCA
CTTTATACTACTAATCTGAACGTTCATCTTAGATTATCCACCTTACCTACCATTCACGAAGAAAGTCTTGTGATTAGAAT
TCTGCCACAAGAGAAAATTCCACCAATTACACATTTATCGTTATTCCCTGCCACAACTAACAAGCTGCTCTCTATTTTGA
ATCACTCACATGGTTTGATGATTTTTACAGGCCCTACTGGTTCGGGTAAGACAACGACACTTTATTCTCTCATTCATTAT
GCAAAGCGTCATTTTAACAGAAATATTATTACCTTAGAGGACCCGGTTGAAACGAAAAGTGAGGAAGTATTGCAAGTTCA
AGTTAATGAAAAAGCGGGGATTACCTATGCTGCTGGTTTAAAGGCAATTCTGCGTCATGATCCCGATATTATTATGGTTG
GAGAAATTCGTGATCGAGAAACAGCTCAAATTGCCATTCGCGCCAGTCTAACTGGTCACCTTGTTTTGTCAACCATGCAC
ACAAGAGATGCAAAGGGGGCGATTTATCGATTAATGGAATTTGATGTGAGTTTATCAGAAATTGAACAAACCCTCATTGC
TGTCTCAGCTCAGAGACTAGTTCAACTAAAATGTCCATTCTGTGAAGACCAATGCTCTCCTTTTTGTAAGAAACTAAGAC
AAAAAAGAAGGTTAAGTATTTATGAGCTTTTATATGGGAAAAACTTAACAGCTGTCGTAAAGGAGGCGAAAGGGGAATTA
GCTTCCTATCAATACCCAACATTAAAAGATGTTATTAAAAAAGGAATAGCACTAGGATATCTGTATCCAAATTCTTATGA
CCGTTGGGTGTTTCGTTATGAAGATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

67.697

100

0.679

  pilB Glaesserella parasuis strain SC1401

38.268

100

0.386

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

37.681

97.183

0.366


Multiple sequence alignment