Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   FI644_RS17180 Genome accession   NZ_CP041305
Coordinates   3437766..3438836 (+) Length   356 a.a.
NCBI ID   WP_053476999.1    Uniprot ID   -
Organism   Cytobacillus solani strain 5L6     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3432766..3443836
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FI644_RS17150 - 3432832..3433146 (+) 315 WP_053477004.1 MTH1187 family thiamine-binding protein -
  FI644_RS17155 - 3433548..3433721 (-) 174 WP_075209188.1 DUF2759 domain-containing protein -
  FI644_RS17160 - 3434008..3434643 (+) 636 WP_142108654.1 MBL fold metallo-hydrolase -
  FI644_RS17165 - 3435306..3436367 (+) 1062 WP_142108655.1 class I SAM-dependent methyltransferase -
  FI644_RS17170 - 3436429..3436671 (-) 243 WP_053477001.1 DUF2626 domain-containing protein -
  FI644_RS17175 - 3436763..3437464 (-) 702 WP_142108656.1 metalloregulator ArsR/SmtB family transcription factor -
  FI644_RS17180 comGA 3437766..3438836 (+) 1071 WP_053476999.1 competence type IV pilus ATPase ComGA Machinery gene
  FI644_RS17185 comGB 3438823..3439854 (+) 1032 WP_053476998.1 competence type IV pilus assembly protein ComGB -
  FI644_RS17190 comGC 3439854..3440189 (+) 336 WP_142108657.1 competence type IV pilus major pilin ComGC -
  FI644_RS17195 comGD 3440182..3440622 (+) 441 WP_142108658.1 competence type IV pilus minor pilin ComGD -
  FI644_RS17200 - 3440606..3440938 (+) 333 WP_053476995.1 hypothetical protein -
  FI644_RS17205 comGF 3440892..3441374 (+) 483 WP_246066963.1 competence type IV pilus minor pilin ComGF -
  FI644_RS17210 comGG 3441361..3441744 (+) 384 WP_053476993.1 competence type IV pilus minor pilin ComGG -
  FI644_RS17215 - 3441759..3442262 (+) 504 WP_053476992.1 shikimate kinase -
  FI644_RS17220 - 3442330..3442506 (+) 177 WP_075209182.1 YqzE family protein -
  FI644_RS17225 - 3442544..3443356 (-) 813 WP_053476991.1 YqhG family protein -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40211.60 Da        Isoelectric Point: 8.8741

>NTDB_id=371170 FI644_RS17180 WP_053476999.1 3437766..3438836(+) (comGA) [Cytobacillus solani strain 5L6]
MNTIEKLAERIIKDAVRNHASDIHIVPRRSDTLIQLRLANKLIPRLYLPKEECDRLISHFKFTASMDIGEKRRPQSGANS
YQIDGQMIGFRFSTLPSSHNESLVIRILPQQEQIPFYQISLFPSMTRKLLALLKHAHGLIIFTGPTGSGKTTTLYSLLNE
TSHMFNRNVITLEDPIEKENDMVLQVQVNEKAGVSYATGLKAILRHDPDIIMVGEIRDAETAKIAVRAALTGHLVLSTMH
TRDARGAIYRLSEFGVDAVEIEQTLVAVTAQRLVELSCPFCKGECSPYCYCYGRLKRASIFELLTGRALNASMKVARGEE
AVIKYRNLKDIIKKGIALGFIKETEYERWVLEHDQT

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=371170 FI644_RS17180 WP_053476999.1 3437766..3438836(+) (comGA) [Cytobacillus solani strain 5L6]
TTGAATACAATTGAAAAGCTTGCAGAAAGAATTATTAAAGATGCTGTCAGAAATCATGCATCAGATATCCACATTGTTCC
CCGCAGAAGCGATACCCTCATCCAATTACGTCTTGCTAACAAACTAATTCCCAGACTCTATTTGCCGAAAGAAGAATGCG
ACAGACTGATTTCACACTTTAAATTTACGGCTTCAATGGATATCGGTGAAAAGAGAAGACCGCAAAGCGGTGCAAACTCC
TATCAAATTGATGGTCAAATGATCGGCTTTCGCTTTTCAACGCTCCCCTCTAGTCATAATGAAAGCCTAGTTATTCGAAT
CCTCCCTCAGCAAGAACAAATTCCTTTCTACCAAATTTCTTTATTCCCGAGTATGACCCGAAAACTACTTGCCTTATTAA
AGCATGCCCACGGATTAATCATATTTACTGGTCCGACTGGATCAGGTAAAACTACCACCCTATACTCGCTTCTCAATGAA
ACCTCACATATGTTCAATCGCAATGTCATCACCCTTGAAGACCCAATTGAAAAAGAAAATGATATGGTGCTCCAGGTTCA
AGTGAATGAAAAAGCTGGTGTGTCCTATGCAACTGGATTAAAAGCCATTCTAAGACATGATCCAGATATCATTATGGTGG
GGGAAATCAGAGATGCAGAAACGGCAAAGATAGCAGTAAGAGCTGCACTAACAGGTCATTTAGTTCTTTCAACGATGCAT
ACGAGGGATGCTAGAGGAGCGATTTACAGGTTGAGTGAATTTGGTGTGGATGCAGTAGAAATAGAACAGACCCTTGTTGC
TGTAACCGCGCAGAGACTTGTGGAATTATCCTGTCCGTTTTGTAAGGGGGAATGTTCCCCATACTGCTATTGCTATGGCA
GATTGAAAAGAGCAAGTATTTTTGAGCTTTTAACTGGGAGGGCATTAAATGCCTCTATGAAAGTGGCGAGAGGTGAAGAG
GCTGTTATTAAATACCGAAACTTAAAGGATATCATTAAAAAGGGAATAGCACTTGGTTTTATTAAAGAAACAGAATATGA
GCGGTGGGTGCTTGAGCATGATCAAACGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

55.84

98.596

0.551

  pilB Glaesserella parasuis strain SC1401

39.773

98.876

0.393

  pilB Haemophilus influenzae 86-028NP

37.069

97.753

0.362


Multiple sequence alignment