Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   VSY18_RS21590 Genome accession   NZ_CP142670
Coordinates   4158016..4159059 (-) Length   347 a.a.
NCBI ID   WP_048531150.1    Uniprot ID   A0A8T9Z920
Organism   Bacillus albus strain YK87     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4153016..4164059
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  VSY18_RS21545 (VSY18_21545) - 4153443..4153643 (-) 201 WP_000106081.1 YqzE family protein -
  VSY18_RS21550 (VSY18_21550) aroK 4153682..4154179 (-) 498 WP_048531135.1 shikimate kinase AroK -
  VSY18_RS21555 (VSY18_21555) - 4154298..4154948 (-) 651 WP_166688051.1 2OG-Fe(II) oxygenase -
  VSY18_RS21560 (VSY18_21560) comGG 4155126..4155497 (-) 372 WP_071757287.1 competence type IV pilus minor pilin ComGG -
  VSY18_RS21565 (VSY18_21565) comGF 4155494..4155964 (-) 471 WP_048531140.1 competence type IV pilus minor pilin ComGF -
  VSY18_RS21570 (VSY18_21570) comGE 4155934..4156236 (-) 303 WP_270734886.1 competence type IV pilus minor pilin ComGE -
  VSY18_RS21575 (VSY18_21575) comGD 4156229..4156684 (-) 456 WP_270734873.1 comG operon protein ComGD -
  VSY18_RS21580 (VSY18_21580) comGC 4156681..4156980 (-) 300 WP_016086289.1 comG operon protein ComGC -
  VSY18_RS21585 (VSY18_21585) comGB 4156992..4158023 (-) 1032 WP_270734872.1 competence type IV pilus assembly protein ComGB -
  VSY18_RS21590 (VSY18_21590) comGA 4158016..4159059 (-) 1044 WP_048531150.1 competence protein ComGA Machinery gene
  VSY18_RS21595 (VSY18_21595) - 4159265..4159960 (+) 696 WP_071757285.1 helix-turn-helix transcriptional regulator -
  VSY18_RS21600 (VSY18_21600) - 4160086..4160328 (+) 243 WP_000440714.1 DUF2626 domain-containing protein -
  VSY18_RS21605 (VSY18_21605) - 4160436..4161830 (+) 1395 WP_001094339.1 L-cystine transporter -
  VSY18_RS21610 (VSY18_21610) - 4161987..4162394 (-) 408 WP_048531153.1 hypothetical protein -
  VSY18_RS21615 (VSY18_21615) - 4162482..4162697 (-) 216 WP_270734868.1 DUF3912 family protein -
  VSY18_RS21620 (VSY18_21620) - 4162950..4163261 (+) 312 WP_048531157.1 hypothetical protein -
  VSY18_RS21625 (VSY18_21625) - 4163298..4163786 (-) 489 WP_270734867.1 hypothetical protein -

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39378.84 Da        Isoelectric Point: 8.8395

>NTDB_id=926336 VSY18_RS21590 WP_048531150.1 4158016..4159059(-) (comGA) [Bacillus albus strain YK87]
MNGIESFANMILKEACRVQASDLHIVPRKKDVAVQLRIGKDLMTKQYIEKEFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQMDGQEVYLRLSTLPTVYQESLVIRLHLQASIQPLSHLSLFPSTAKKLLSFLHYSHGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIVTLEDPVEKRNDDVLQIQINEKAGITYEAGLKAILRHDPDIILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDAKGAILRFMDFGITRQEIEQSLLAIAAQRLVELKCPFCKRKCSTLCKSMRQVRQASIYELLYGYELKQAIKEANGEC
VTYEHETLESSLRKGYALGFLEEDVYV

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=926336 VSY18_RS21590 WP_048531150.1 4158016..4159059(-) (comGA) [Bacillus albus strain YK87]
ATGAATGGAATTGAAAGCTTTGCGAATATGATTTTGAAAGAGGCGTGTAGGGTACAAGCTTCGGATTTACACATCGTGCC
CCGGAAGAAGGATGTTGCGGTTCAACTACGTATAGGAAAAGATTTAATGACGAAACAATACATTGAAAAGGAATTTGGAG
AAAAACTTGTTTCACACTTTAAATTTTTAGCATCCATGGATATAGGGGAGAGGCGGAAGCCACAAAATGGTTCATTGTAT
TTACAAATGGATGGACAAGAAGTGTATTTGCGCCTTTCCACGCTTCCAACCGTATATCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTATTCAGCCGTTATCTCATCTTTCGTTATTTCCAAGTACAGCGAAGAAACTTCTTTCCTTTTTAC
ATTATTCGCATGGGTTACTCGTCTTTACGGGACCGACTGGTTCAGGAAAGACAACAACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGTCGCATCGTTACACTGGAAGATCCAGTTGAAAAAAGAAATGACGATGTATTACAAATCCA
AATAAATGAAAAAGCAGGTATCACATATGAGGCGGGACTGAAGGCTATTTTACGCCATGATCCAGATATTATTTTAGTCG
GTGAAATTCGTGATGAAGAAACAGCAAAAATTGCTATAAGAGCAAGTTTGACTGGACATTTAGTAATGACGACATTGCAT
ACGAATGATGCGAAAGGAGCGATACTTCGGTTTATGGACTTTGGCATAACAAGGCAAGAGATTGAACAATCTTTATTGGC
TATAGCTGCACAGCGACTTGTCGAATTGAAGTGTCCGTTTTGCAAAAGAAAGTGCTCAACTTTATGTAAATCAATGAGGC
AGGTAAGACAAGCGAGTATTTATGAGTTGTTATATGGATATGAGTTAAAACAAGCGATTAAAGAAGCAAACGGGGAATGT
GTCACATACGAGCATGAGACATTAGAATCTTCGCTACGAAAAGGATACGCTTTAGGATTTTTAGAAGAGGATGTGTATGT
TTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

57.925

100

0.579

  pilB Vibrio campbellii strain DS40M4

35.977

100

0.366