Detailed information    

insolico Bioinformatically predicted

Overview


Name   comC   Type   Machinery gene
Locus tag   S100333_RS14450 Genome accession   NZ_CP021892
Coordinates   2683386..2684132 (-) Length   248 a.a.
NCBI ID   WP_072592551.1    Uniprot ID   -
Organism   Bacillus subtilis subsp. subtilis strain SRCM100333     
Function   processing and translocation of ComGC; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2678386..2689132
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S100333_RS14425 (S100333_02946) mreC 2678792..2679664 (-) 873 WP_003222609.1 rod shape-determining protein MreC -
  S100333_RS14430 (S100333_02947) mreB 2679695..2680708 (-) 1014 WP_003229650.1 cell shape-determining protein MreB -
  S100333_RS14435 (S100333_02948) radC 2680800..2681495 (-) 696 WP_014480455.1 DNA repair protein RadC -
  S100333_RS14440 (S100333_02949) maf 2681532..2682101 (-) 570 WP_014480456.1 Maf family nucleotide pyrophosphatase -
  S100333_RS14445 (S100333_02950) spoIIB 2682254..2683252 (-) 999 WP_014480457.1 stage II sporulation protein SpoIIB -
  S100333_RS14450 (S100333_02951) comC 2683386..2684132 (-) 747 WP_072592551.1 A24 family peptidase Machinery gene
  S100333_RS14455 (S100333_02952) folC 2684272..2685564 (-) 1293 WP_014480460.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -
  S100333_RS14460 (S100333_02953) valS 2685624..2688266 (-) 2643 WP_088272417.1 valine--tRNA ligase -
  S100333_RS14465 (S100333_02954) - 2688714..2688905 (+) 192 WP_003222590.1 hypothetical protein -

Sequence


Protein


Download         Length: 248 a.a.        Molecular weight: 26422.99 Da        Isoelectric Point: 9.7944

>NTDB_id=234550 S100333_RS14450 WP_072592551.1 2683386..2684132(-) (comC) [Bacillus subtilis subsp. subtilis strain SRCM100333]
MLSILFIFGLILGSFYYTAGCRIPLHLSIIAPRSSCPFCRRTLTPAELIPILSFLFQKGKCKSCGHRISFMYPAAELVTA
CLFAAAGIRFGTSLELFPAVVFISLLIIVAVTDIHFMLIPNRILIFFLPFLAAARLISPLDSWYAGLLGAAAGFLFLAVI
AAITHGGVGGGDIKLFAVIGFVLGVKMLAAAFFFSVLIGALYGAAAVLTGRLAKRQPLPFAPAIAAGSILAYLYGDSIIS
FYIKMALG

Nucleotide


Download         Length: 747 bp        

>NTDB_id=234550 S100333_RS14450 WP_072592551.1 2683386..2684132(-) (comC) [Bacillus subtilis subsp. subtilis strain SRCM100333]
ATGCTATCCATTCTTTTTATCTTCGGGCTTATCCTTGGTTCTTTTTACTATACGGCCGGGTGCCGTATCCCCTTACATCT
ATCTATTATTGCGCCCCGTTCATCATGCCCGTTTTGCCGGAGGACATTAACTCCTGCAGAATTAATTCCCATCCTGTCAT
TCCTATTTCAAAAAGGTAAATGTAAAAGCTGCGGGCATAGGATTTCTTTTATGTATCCCGCAGCAGAGCTTGTGACAGCG
TGTTTATTTGCCGCCGCAGGAATACGCTTTGGCACGTCGCTGGAACTGTTTCCCGCTGTGGTGTTTATCTCTCTTCTCAT
TATTGTTGCAGTGACAGATATTCATTTTATGCTGATTCCAAATCGTATATTGATTTTCTTTCTTCCCTTTTTGGCGGCCG
CGAGATTGATTTCTCCGCTTGATTCCTGGTATGCAGGCCTGTTAGGTGCGGCAGCCGGATTTCTATTTCTGGCTGTAATT
GCCGCAATCACCCATGGGGGAGTAGGGGGAGGAGATATTAAATTATTTGCGGTGATTGGCTTTGTGCTTGGTGTGAAAAT
GCTGGCAGCTGCCTTTTTCTTTTCTGTTTTGATAGGTGCATTATATGGAGCGGCAGCCGTTCTGACTGGTAGACTCGCTA
AAAGGCAGCCGCTTCCCTTTGCCCCCGCTATAGCCGCAGGGAGCATTTTAGCCTATTTATACGGTGACTCTATCATTTCT
TTTTATATCAAAATGGCATTGGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comC Bacillus subtilis subsp. subtilis str. 168

99.597

100

0.996


Multiple sequence alignment