Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   U471_RS11870 Genome accession   NC_022653
Coordinates   2504107..2506458 (-) Length   783 a.a.
NCBI ID   WP_012118025.1    Uniprot ID   A7Z6X1
Organism   Bacillus amyloliquefaciens CC178     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2499107..2511458
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  U471_RS11840 (U471_24640) - 2499473..2499811 (-) 339 WP_007408270.1 YqxA family protein -
  U471_RS11845 (U471_24650) spoIIP 2499828..2501021 (-) 1194 WP_007612673.1 stage II sporulation protein P -
  U471_RS11850 (U471_24660) gpr 2501089..2502195 (-) 1107 WP_007408268.1 GPR endopeptidase -
  U471_RS11855 (U471_24670) rpsT 2502398..2502664 (+) 267 WP_003152876.1 30S ribosomal protein S20 -
  U471_RS11860 (U471_24680) holA 2502681..2503722 (-) 1042 Protein_2347 DNA polymerase III subunit delta -
  U471_RS19785 - 2503762..2503913 (-) 152 Protein_2348 hypothetical protein -
  U471_RS19410 (U471_24690) - 2503954..2504088 (+) 135 WP_003152870.1 YqzM family protein -
  U471_RS11870 (U471_24700) comEC 2504107..2506458 (-) 2352 WP_012118025.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  U471_RS11875 (U471_24710) - 2506459..2507028 (-) 570 WP_003152868.1 ComE operon protein 2 -
  U471_RS11880 (U471_24720) comEA 2507095..2507709 (-) 615 WP_012118026.1 helix-hairpin-helix domain-containing protein Machinery gene
  U471_RS11885 (U471_24730) comER 2507768..2508589 (+) 822 WP_012118027.1 late competence protein ComER -
  U471_RS11890 (U471_24740) - 2508658..2509395 (-) 738 WP_012118028.1 class I SAM-dependent DNA methyltransferase -
  U471_RS11895 (U471_24750) rsfS 2509392..2509748 (-) 357 WP_007408260.1 ribosome silencing factor -
  U471_RS11900 (U471_24760) yqeK 2509765..2510325 (-) 561 WP_012118029.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  U471_RS11905 (U471_24770) - 2510315..2510884 (-) 570 WP_012118030.1 nicotinate-nucleotide adenylyltransferase -
  U471_RS11910 (U471_24780) yhbY 2510895..2511185 (-) 291 WP_007408257.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86682.54 Da        Isoelectric Point: 9.0968

>NTDB_id=63177 U471_RS11870 WP_012118025.1 2504107..2506458(-) (comEC) [Bacillus amyloliquefaciens CC178]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIIVKTKQYAPVIVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYSQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFGMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGRSKNDSSLVLWTVFGGVSWLLTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=63177 U471_RS11870 WP_012118025.1 2504107..2506458(-) (comEC) [Bacillus amyloliquefaciens CC178]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATTGTAAAAACAAAGCAGTATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGCTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACATTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCAGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTTTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCCTCATTCATTGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTCTCAAGGCAGATGGGAGAATGTTTGTTTGGTATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCATTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCACTGATTTTAACCCATGCGGATCAAGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAGGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGAGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTTGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAAGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAGCAGCTTCAGCCGGAAGCGGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTAAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A7Z6X1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.995

98.595

0.562


Multiple sequence alignment