Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   NM232_RS02560 Genome accession   NZ_CP101399
Coordinates   428998..430068 (-) Length   356 a.a.
NCBI ID   WP_058214148.1    Uniprot ID   -
Organism   Bacillus altitudinis strain HM-7     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 423998..435068
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NM232_RS02510 sipW 424093..424665 (-) 573 WP_255233976.1 signal peptidase I SipW -
  NM232_RS02515 tapA 424715..425242 (-) 528 WP_255233977.1 amyloid fiber anchoring/assembly protein TapA -
  NM232_RS02520 - 425495..425821 (+) 327 WP_039166025.1 DUF3889 domain-containing protein -
  NM232_RS02525 - 425856..426050 (-) 195 WP_008342191.1 YqzE family protein -
  NM232_RS02530 comGG 426111..426494 (-) 384 WP_046343972.1 competence type IV pilus minor pilin ComGG -
  NM232_RS02535 comGF 426491..426988 (-) 498 WP_046343974.1 competence type IV pilus minor pilin ComGF -
  NM232_RS02540 comGE 426930..427244 (-) 315 WP_007501227.1 competence type IV pilus minor pilin ComGE -
  NM232_RS02545 comGD 427228..427674 (-) 447 WP_017357985.1 competence type IV pilus minor pilin ComGD -
  NM232_RS02550 comGC 427667..427960 (-) 294 WP_008342186.1 competence type IV pilus major pilin ComGC Machinery gene
  NM232_RS02555 comGB 427977..429017 (-) 1041 WP_060781189.1 competence type IV pilus assembly protein ComGB -
  NM232_RS02560 comGA 428998..430068 (-) 1071 WP_058214148.1 competence type IV pilus ATPase ComGA Machinery gene
  NM232_RS02565 - 430523..431395 (-) 873 WP_035390819.1 STAS domain-containing protein -
  NM232_RS02575 - 431621..432001 (-) 381 WP_007501232.1 Spx/MgsR family RNA polymerase-binding regulatory protein -
  NM232_RS02580 - 432214..432456 (+) 243 WP_003217440.1 DUF2626 domain-containing protein -
  NM232_RS02585 - 432505..433143 (-) 639 WP_017357989.1 MBL fold metallo-hydrolase -
  NM232_RS02590 - 433304..433477 (+) 174 WP_003217333.1 DUF2759 domain-containing protein -
  NM232_RS02595 - 433502..433816 (-) 315 WP_017367597.1 MTH1187 family thiamine-binding protein -

Sequence


Protein


Download         Length: 356 a.a.        Molecular weight: 40386.71 Da        Isoelectric Point: 9.0449

>NTDB_id=709968 NM232_RS02560 WP_058214148.1 428998..430068(-) (comGA) [Bacillus altitudinis strain HM-7]
MYGIEYLGQELLEEACRMRASDVHIVPREKEASVSFRVDADLIQQRIIDKKSGERLIAHFKFLSSMDIGERRRPQNGSLA
VMLRRGQVFVRLSTLPTVNDESLVIRILPQDHVPKTKYLSLFPKASTRLLSFLNHSHGLILFTGPTNSGKTTTLYSLIQF
AKKHFNRNIITLEDPVETRNEEVLQVQVNEKAGITYAAGLRAILRHDPDMIVLGEIRDAETARTAIRAALTGHLVMSTLH
AKNAKGALYRMLEFGVTMNELEQTMVAIAAQRLIELACPFCGETCELYCKLNRSVRRTNVFELLYGKELGECIKEAKGEY
AHSSYETLQRLIRKGVALGYLSKSTYHRWVYEEANL

Nucleotide


Download         Length: 1071 bp        

>NTDB_id=709968 NM232_RS02560 WP_058214148.1 428998..430068(-) (comGA) [Bacillus altitudinis strain HM-7]
TTGTATGGTATTGAATATTTGGGGCAAGAGCTGTTAGAAGAGGCGTGCCGAATGAGAGCATCTGATGTGCATATTGTTCC
GAGAGAAAAGGAAGCTTCAGTCTCTTTTCGTGTAGACGCTGACCTCATTCAGCAGCGGATCATTGACAAAAAGAGCGGAG
AGCGGCTCATTGCTCATTTCAAATTTTTATCTTCTATGGATATTGGTGAAAGAAGGAGACCACAAAATGGATCATTGGCT
GTGATGCTAAGACGTGGTCAAGTGTTTGTTCGGCTCTCTACACTACCAACTGTAAATGATGAAAGTCTAGTGATTCGGAT
TTTACCGCAGGATCATGTGCCTAAAACAAAGTATTTGTCACTGTTCCCGAAAGCTTCAACGAGATTATTATCGTTTCTGA
ATCATTCGCATGGGCTCATTTTATTTACAGGACCAACAAATTCAGGGAAAACGACAACACTTTATTCATTGATCCAGTTT
GCCAAAAAACATTTTAATCGAAATATTATTACCCTTGAAGATCCTGTGGAAACAAGAAATGAAGAAGTGCTGCAGGTTCA
AGTAAATGAAAAAGCTGGTATCACGTATGCGGCAGGTTTACGCGCTATTTTAAGACATGATCCAGATATGATTGTGCTTG
GAGAAATAAGAGATGCGGAAACAGCGAGAACAGCTATTAGAGCAGCTTTGACAGGTCATTTAGTCATGAGCACACTGCAC
GCAAAGAATGCAAAAGGAGCTCTTTATCGAATGCTCGAATTTGGTGTGACGATGAATGAACTCGAACAAACAATGGTTGC
CATTGCTGCTCAGCGGTTAATCGAACTCGCATGTCCATTTTGCGGCGAAACGTGTGAGCTTTATTGCAAACTAAATCGGT
CAGTCAGACGAACAAATGTATTTGAATTGCTTTACGGAAAGGAGCTTGGTGAGTGTATCAAAGAGGCAAAAGGAGAGTAC
GCTCATTCGTCATACGAAACACTGCAAAGATTAATTCGGAAAGGAGTGGCACTCGGCTATTTATCAAAAAGCACATATCA
TCGCTGGGTTTATGAAGAAGCGAACCTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

64.607

100

0.646