Detailed information    

insolico Bioinformatically predicted

Overview


Name   comC   Type   Machinery gene
Locus tag   S101267_RS14200 Genome accession   NZ_CP021505
Coordinates   2726745..2727497 (-) Length   250 a.a.
NCBI ID   WP_013353066.1    Uniprot ID   A0A9P1JIQ0
Organism   Bacillus amyloliquefaciens strain SRCM101267     
Function   processing and translocation of ComGC; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2721745..2732497
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S101267_RS14175 (S101267_02845) mreC 2722180..2723043 (-) 864 WP_013353061.1 rod shape-determining protein MreC -
  S101267_RS14180 (S101267_02846) mreB 2723074..2724087 (-) 1014 WP_013353062.1 cell shape-determining protein MreB -
  S101267_RS14185 (S101267_02847) radC 2724179..2724874 (-) 696 WP_013353063.1 RadC family protein -
  S101267_RS14190 (S101267_02848) - 2724906..2725475 (-) 570 WP_013353064.1 Maf family protein -
  S101267_RS14195 (S101267_02849) - 2725617..2726618 (-) 1002 WP_013353065.1 SPOR domain-containing protein -
  S101267_RS14200 (S101267_02850) comC 2726745..2727497 (-) 753 WP_013353066.1 prepilin peptidase Machinery gene
  S101267_RS14205 (S101267_02851) - 2727637..2728929 (-) 1293 WP_013353067.1 bifunctional folylpolyglutamate synthase/dihydrofolate synthase -
  S101267_RS14210 (S101267_02852) - 2728988..2731630 (-) 2643 WP_013353068.1 valine--tRNA ligase -
  S101267_RS14215 (S101267_02853) - 2732081..2732272 (+) 192 WP_013353069.1 hypothetical protein -

Sequence


Protein


Download         Length: 250 a.a.        Molecular weight: 26869.51 Da        Isoelectric Point: 9.0703

>NTDB_id=231413 S101267_RS14200 WP_013353066.1 2726745..2727497(-) (comC) [Bacillus amyloliquefaciens strain SRCM101267]
MLLILFLLGLIFGSFFYTAACRIPLRISVISPRSSCSFCGLPLSWGELVPVVSYVLQKGRCRNCRGKLSIMYPAAECWTA
CLFTAAGIHFGFSRELLVALLFLSLLMIVTVTDLQYMLIPDKVLLFFLPLLIVGRIFSPLDSWYAGFAGAVCGFFLLIFI
MFVSKGGIGAGDVKLFGVIGLALGVKLVLIAFFLSVIIGAVYGMCAAAGGRLGKKQPFPFAPAIAAGSAVSYLYGEELFS
FYIKLASGGA

Nucleotide


Download         Length: 753 bp        

>NTDB_id=231413 S101267_RS14200 WP_013353066.1 2726745..2727497(-) (comC) [Bacillus amyloliquefaciens strain SRCM101267]
GTGCTTTTGATTCTGTTTTTGCTCGGTTTGATTTTCGGTTCTTTTTTTTATACAGCAGCGTGCCGTATCCCGCTTCGGAT
CTCGGTTATTTCGCCGCGCTCATCCTGTTCCTTTTGCGGCCTGCCGCTCTCCTGGGGTGAGCTGGTGCCCGTCGTTTCTT
ATGTCCTGCAAAAAGGGAGATGCAGAAACTGCCGTGGGAAGCTGTCGATTATGTATCCGGCGGCGGAATGCTGGACGGCA
TGTTTATTTACGGCTGCCGGTATTCATTTCGGTTTCTCAAGAGAACTGTTAGTCGCGCTGTTATTTCTGTCTCTATTAAT
GATTGTTACCGTGACAGATCTGCAATATATGCTGATTCCTGACAAGGTTCTGCTGTTTTTTCTGCCGCTTCTCATCGTCG
GCCGTATTTTTTCTCCGCTGGATTCATGGTATGCGGGGTTTGCCGGAGCCGTTTGCGGATTTTTTCTGCTTATTTTTATT
ATGTTTGTCAGTAAAGGAGGCATCGGTGCAGGTGATGTGAAACTGTTTGGGGTCATCGGCCTTGCGCTCGGCGTGAAGCT
CGTACTCATTGCATTCTTTCTTTCCGTTATAATCGGAGCCGTATACGGGATGTGCGCCGCAGCGGGCGGCAGGCTTGGTA
AAAAGCAGCCTTTCCCGTTTGCGCCCGCCATCGCGGCCGGGAGCGCTGTGAGTTATTTATACGGTGAAGAACTGTTTTCG
TTTTACATCAAACTCGCTTCCGGAGGAGCTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comC Bacillus subtilis subsp. subtilis str. 168

59.677

99.2

0.592


Multiple sequence alignment