Detailed information    

insolico Bioinformatically predicted

Overview


Name   comC   Type   Machinery gene
Locus tag   GO004_RS20785 Genome accession   NZ_CP046591
Coordinates   4022172..4022918 (+) Length   248 a.a.
NCBI ID   WP_088467305.1    Uniprot ID   -
Organism   Bacillus subtilis strain R31     
Function   processing and translocation of ComGC; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4017172..4027918
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GO004_RS20770 (GO004_20785) - 4017399..4017590 (-) 192 WP_003222590.1 hypothetical protein -
  GO004_RS20775 (GO004_20790) valS 4018038..4020680 (+) 2643 WP_195727758.1 valine--tRNA ligase -
  GO004_RS20780 (GO004_20795) folC 4020740..4022032 (+) 1293 WP_041339279.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -
  GO004_RS20785 (GO004_20800) comC 4022172..4022918 (+) 747 WP_088467305.1 A24 family peptidase Machinery gene
  GO004_RS20790 (GO004_20805) spoIIB 4023052..4024050 (+) 999 WP_088467304.1 stage II sporulation protein SpoIIB -
  GO004_RS20795 (GO004_20810) maf 4024203..4024772 (+) 570 WP_088467303.1 Maf family nucleotide pyrophosphatase -
  GO004_RS20800 (GO004_20815) radC 4024809..4025210 (+) 402 Protein_4069 DNA repair protein RadC -
  GO004_RS20805 (GO004_20820) - 4025229..4026647 (-) 1419 WP_195727934.1 recombinase family protein -
  GO004_RS20810 (GO004_20825) - 4026696..4027142 (-) 447 WP_195727760.1 ImmA/IrrE family metallo-endopeptidase -
  GO004_RS20815 (GO004_20830) - 4027156..4027590 (-) 435 WP_103330184.1 helix-turn-helix transcriptional regulator -

Sequence


Protein


Download         Length: 248 a.a.        Molecular weight: 26455.05 Da        Isoelectric Point: 9.7944

>NTDB_id=405832 GO004_RS20785 WP_088467305.1 4022172..4022918(+) (comC) [Bacillus subtilis strain R31]
MLSILFIFGLILGSFYYTAGCRIPLHLSIIAPRSSCPFCRRTLTPAELIPILSFLFQKGKCKSCGHRISFMYPAAELVTA
CLFAAAGIRFGTSLELFPAVVFISLLIIVAVTDIHFMLIPNRILIFFLPFLAAARLISPLDSWYAGLLGAAAGFLFLAVI
AAITHGGVGGGDIKLFAVIGFMLGVKMLAAAFFFSVLIGALYGAAAVLTGRLAKRQPLPFAPAIAAGSILAYLYGDSIIS
FYIKMALG

Nucleotide


Download         Length: 747 bp        

>NTDB_id=405832 GO004_RS20785 WP_088467305.1 4022172..4022918(+) (comC) [Bacillus subtilis strain R31]
ATGCTATCCATTCTTTTTATCTTCGGACTTATCCTTGGTTCTTTTTACTATACGGCCGGGTGCCGTATCCCCTTACATCT
ATCTATTATTGCGCCCCGTTCATCATGCCCGTTTTGCCGGCGGACATTAACTCCTGCAGAATTAATTCCCATCCTGTCAT
TCCTATTTCAAAAAGGTAAATGTAAAAGCTGCGGGCATAGGATTTCTTTTATGTATCCCGCAGCAGAGCTTGTGACAGCG
TGTTTATTTGCTGCCGCAGGAATACGCTTTGGCACATCGCTGGAACTGTTTCCCGCTGTGGTGTTTATCTCTCTTCTCAT
TATTGTTGCAGTGACAGATATTCATTTTATGCTGATTCCAAATCGAATATTGATTTTCTTTCTTCCCTTTTTGGCGGCCG
CGAGATTGATTTCTCCACTTGATTCCTGGTATGCAGGCCTGCTAGGTGCGGCAGCCGGATTTCTATTTCTGGCTGTAATT
GCCGCAATCACCCATGGGGGAGTAGGGGGAGGAGATATTAAATTATTTGCGGTGATTGGCTTTATGCTTGGTGTGAAAAT
GCTGGCAGCTGCCTTTTTCTTTTCAGTTTTGATAGGTGCATTATATGGAGCGGCAGCTGTTCTGACTGGTAGACTCGCTA
AAAGGCAGCCGCTTCCCTTCGCCCCCGCTATAGCCGCAGGGAGCATTTTAGCCTATTTATACGGTGACTCTATCATTTCT
TTTTATATCAAAATGGCATTGGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comC Bacillus subtilis subsp. subtilis str. 168

99.194

100

0.992


Multiple sequence alignment