Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   BS1321_RS21685 Genome accession   NZ_CP017704
Coordinates   4488944..4490029 (+) Length   361 a.a.
NCBI ID   WP_411836501.1    Uniprot ID   -
Organism   Peribacillus simplex NBRC 15720 = DSM 1321     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4483944..4495029
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BS1321_RS21655 (BS1321_21315) - 4484673..4484846 (-) 174 WP_034308396.1 DUF2759 domain-containing protein -
  BS1321_RS21660 (BS1321_21320) - 4485080..4485706 (+) 627 WP_063233059.1 MBL fold metallo-hydrolase -
  BS1321_RS21665 (BS1321_21325) - 4486071..4487171 (+) 1101 WP_063233176.1 class I SAM-dependent methyltransferase -
  BS1321_RS21670 (BS1321_21330) - 4487224..4487466 (-) 243 WP_034308393.1 DUF2626 domain-containing protein -
  BS1321_RS21675 (BS1321_21335) - 4487552..4488253 (-) 702 WP_063233060.1 helix-turn-helix transcriptional regulator -
  BS1321_RS21680 (BS1321_21340) - 4488469..4488849 (+) 381 WP_063233061.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  BS1321_RS21685 (BS1321_21345) comGA 4488944..4490029 (+) 1086 WP_411836501.1 competence type IV pilus ATPase ComGA Machinery gene
  BS1321_RS21690 (BS1321_21350) comGB 4490013..4491047 (+) 1035 WP_063233062.1 competence type IV pilus assembly protein ComGB -
  BS1321_RS21695 (BS1321_21355) comGC 4491125..4491436 (+) 312 WP_063233178.1 competence type IV pilus major pilin ComGC -
  BS1321_RS21700 (BS1321_21360) comGD 4491444..4491887 (+) 444 WP_063233063.1 competence type IV pilus minor pilin ComGD -
  BS1321_RS21705 (BS1321_21365) - 4491874..4492212 (+) 339 WP_063233064.1 hypothetical protein -
  BS1321_RS21710 (BS1321_21370) comGF 4492160..4492639 (+) 480 WP_081112903.1 competence type IV pilus minor pilin ComGF -
  BS1321_RS21715 (BS1321_21375) comGG 4492632..4493018 (+) 387 WP_063233066.1 competence type IV pilus minor pilin ComGG -
  BS1321_RS21720 (BS1321_21380) - 4493130..4493315 (+) 186 WP_063233067.1 YqzE family protein -
  BS1321_RS21725 (BS1321_21385) - 4493362..4494159 (-) 798 WP_063233068.1 YqhG family protein -

Sequence


Protein


Download         Length: 361 a.a.        Molecular weight: 40259.00 Da        Isoelectric Point: 9.2521

>NTDB_id=202071 BS1321_RS21685 WP_411836501.1 4488944..4490029(+) (comGA) [Peribacillus simplex NBRC 15720 = DSM 1321]
MNELISIEKTAEKILTRAVQESASDIHIFFRREGPLIQFRIDNKLVPQETLSFFEAERLIAHLKFLASMDIGEKRRPQSG
AITINLANQVVGLRLSTLPTAHLESLVIRLIPQQNILPLKQLSLFPNTVQKLIALLKHSHGMLIFTGPTGSGKTTTLYSL
LHHAHEMINRNIITLEDPIENVSQKVLQVQINEKAGITYSVGLKAVLRHDPDVIMVGEVRDAETAKIAVRAALTGHLILT
TMHTRDAQGAISRLLEFGVSLLEVEQSLIGVTAQRLVELRCLPCKGECALACKMTARNKRASVYELLYGKSLAEVLRIMG
DEKGKATVSYRQLKDEIGKAVAMGYVDSQEYERLVYHETKK

Nucleotide


Download         Length: 1086 bp        

>NTDB_id=202071 BS1321_RS21685 WP_411836501.1 4488944..4490029(+) (comGA) [Peribacillus simplex NBRC 15720 = DSM 1321]
ATGAATGAATTGATATCGATTGAAAAGACCGCAGAAAAAATACTAACCCGTGCAGTACAGGAATCGGCATCGGATATCCA
CATTTTTTTTCGCAGAGAGGGACCTCTCATCCAATTTAGGATAGACAATAAGCTTGTTCCCCAGGAAACATTATCATTCT
TTGAAGCTGAAAGGCTGATCGCTCATTTGAAGTTCCTTGCCTCGATGGATATAGGGGAGAAAAGGAGGCCCCAGAGTGGT
GCCATCACCATCAATTTAGCTAACCAGGTGGTCGGACTCCGCCTTTCCACTTTACCCACTGCCCATCTCGAAAGTTTGGT
CATCCGCTTAATACCCCAACAGAATATCCTTCCTCTCAAACAGTTATCCTTATTTCCAAACACCGTTCAAAAATTGATTG
CACTCCTGAAGCATTCCCATGGCATGCTCATATTTACCGGTCCGACCGGCAGTGGAAAAACCACGACACTTTATTCCTTG
CTTCACCATGCCCATGAAATGATCAATCGAAATATTATTACACTTGAAGATCCCATTGAAAATGTATCCCAAAAGGTATT
GCAAGTCCAAATCAATGAAAAGGCGGGCATTACGTATTCTGTCGGTCTAAAAGCTGTCCTAAGGCATGACCCTGACGTCA
TCATGGTTGGGGAAGTCAGGGATGCAGAAACAGCCAAAATCGCAGTGCGTGCCGCATTAACAGGTCATTTGATACTTACA
ACCATGCATACAAGGGACGCCCAGGGCGCCATCTCCAGGTTACTGGAGTTTGGTGTCAGCTTGCTTGAGGTTGAACAGAG
TTTGATTGGCGTGACAGCACAGCGGCTGGTTGAGTTGCGATGTCTTCCATGTAAAGGGGAGTGTGCTTTAGCTTGCAAAA
TGACTGCCAGGAATAAAAGGGCAAGTGTATATGAATTGCTATATGGAAAAAGCCTGGCTGAGGTTCTCCGGATAATGGGA
GATGAAAAGGGAAAGGCAACGGTCAGCTACCGCCAATTGAAGGATGAAATCGGAAAAGCGGTTGCGATGGGCTATGTAGA
TTCTCAGGAATATGAACGGCTGGTATACCATGAAACCAAAAAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

55.493

98.338

0.546

  pilB Haemophilus influenzae 86-028NP

39.08

96.399

0.377

  pilB Haemophilus influenzae Rd KW20

37.931

96.399

0.366


Multiple sequence alignment