Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   ACFMPA_RS12955 Genome accession   NZ_CP172605
Coordinates   2527870..2530200 (-) Length   776 a.a.
NCBI ID   WP_326225400.1    Uniprot ID   -
Organism   Bacillus sp. SG20032     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2522870..2535200
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACFMPA_RS12920 (ACFMPA_12920) - 2523209..2523547 (-) 339 WP_004399162.1 YqxA family protein -
  ACFMPA_RS12925 (ACFMPA_12925) spoIIP 2523564..2524769 (-) 1206 WP_003229993.1 spore autolysin SpoIIP -
  ACFMPA_RS12930 (ACFMPA_12930) gpr 2524832..2525938 (-) 1107 WP_003229991.1 GPR endopeptidase -
  ACFMPA_RS12935 (ACFMPA_12935) rpsT 2526142..2526408 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  ACFMPA_RS12940 (ACFMPA_12940) holA 2526423..2527466 (-) 1044 WP_003229987.1 DNA polymerase III subunit delta -
  ACFMPA_RS12945 (ACFMPA_12945) - 2527506..2527655 (-) 150 WP_003229985.1 hypothetical protein -
  ACFMPA_RS12950 (ACFMPA_12950) - 2527696..2527830 (+) 135 WP_003229983.1 YqzM family protein -
  ACFMPA_RS12955 (ACFMPA_12955) comEC 2527870..2530200 (-) 2331 WP_326225400.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  ACFMPA_RS12960 (ACFMPA_12960) - 2530204..2530773 (-) 570 WP_003229978.1 ComE operon protein 2 -
  ACFMPA_RS12965 (ACFMPA_12965) comEA 2530840..2531457 (-) 618 WP_032722140.1 competence protein ComEA Machinery gene
  ACFMPA_RS12970 (ACFMPA_12970) comER 2531541..2532362 (+) 822 WP_015714289.1 late competence protein ComER -
  ACFMPA_RS12975 (ACFMPA_12975) - 2532428..2533171 (-) 744 WP_069322765.1 class I SAM-dependent methyltransferase -
  ACFMPA_RS12980 (ACFMPA_12980) rsfS 2533168..2533524 (-) 357 WP_041333759.1 ribosome silencing factor -
  ACFMPA_RS12985 (ACFMPA_12985) yqeK 2533542..2534102 (-) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  ACFMPA_RS12990 (ACFMPA_12990) - 2534092..2534661 (-) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  ACFMPA_RS12995 (ACFMPA_12995) yhbY 2534673..2534963 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86742.37 Da        Isoelectric Point: 7.1433

>NTDB_id=1068234 ACFMPA_RS12955 WP_326225400.1 2527870..2530200(-) (comEC) [Bacillus sp. SG20032]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVLGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVLFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAEILLKHHKVKRLVIPKGFVSEPKDEKVLQAAREEGVAIEEVKRGDVLQIKDLQFHVLSPE
TPDPASKNNSSLVLWMETGVLSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYYHPHQKVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=1068234 ACFMPA_RS12955 WP_326225400.1 2527870..2530200(-) (comEC) [Bacillus sp. SG20032]
ATGCGTAATTCGCGCTTATTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCTGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATTCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGCGGC
ATTTGCTGCACGGTGCTGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAGCAGCTTGATGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGATTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGGCAGCCAGGGAAG
AGGGAGTGGCAATTGAAGAGGTGAAGCGAGGCGATGTGTTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGAA
ACACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGTTCTGAGCTGGATTTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACTATCATCCTCATCAAAAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.454

100

0.985


Multiple sequence alignment