Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   MF598_RS07470 Genome accession   NZ_CP092185
Coordinates   1439603..1441954 (+) Length   783 a.a.
NCBI ID   WP_045208284.1    Uniprot ID   -
Organism   Bacillus velezensis strain K203     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1434603..1446954
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MF598_RS07430 (MF598_07430) yhbY 1434875..1435165 (+) 291 WP_045208288.1 ribosome assembly RNA-binding protein YhbY -
  MF598_RS07435 (MF598_07435) - 1435176..1435745 (+) 570 WP_007408258.1 nicotinate-nucleotide adenylyltransferase -
  MF598_RS07440 (MF598_07440) yqeK 1435735..1436295 (+) 561 WP_014418422.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  MF598_RS07445 (MF598_07445) rsfS 1436313..1436669 (+) 357 WP_007408260.1 ribosome silencing factor -
  MF598_RS07450 (MF598_07450) - 1436666..1437403 (+) 738 WP_015417856.1 class I SAM-dependent methyltransferase -
  MF598_RS07455 (MF598_07455) comER 1437472..1438293 (-) 822 WP_012118027.1 late competence protein ComER -
  MF598_RS07460 (MF598_07460) comEA 1438352..1438966 (+) 615 WP_045208286.1 helix-hairpin-helix domain-containing protein Machinery gene
  MF598_RS07465 (MF598_07465) - 1439033..1439602 (+) 570 WP_003152868.1 ComE operon protein 2 -
  MF598_RS07470 (MF598_07470) comEC 1439603..1441954 (+) 2352 WP_045208284.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  MF598_RS07475 (MF598_07475) - 1441973..1442107 (-) 135 WP_003152870.1 YqzM family protein -
  MF598_RS07480 (MF598_07480) - 1442148..1442299 (+) 152 Protein_1471 hypothetical protein -
  MF598_RS07485 (MF598_07485) holA 1442339..1443380 (+) 1042 Protein_1472 DNA polymerase III subunit delta -
  MF598_RS07490 (MF598_07490) rpsT 1443397..1443663 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  MF598_RS07495 (MF598_07495) gpr 1443866..1444972 (+) 1107 WP_014418417.1 GPR endopeptidase -
  MF598_RS07500 (MF598_07500) - 1445040..1446233 (+) 1194 WP_082998763.1 stage II sporulation protein P -
  MF598_RS07505 (MF598_07505) - 1446250..1446588 (+) 339 WP_007408270.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86603.52 Da        Isoelectric Point: 9.2699

>NTDB_id=656246 MF598_RS07470 WP_045208284.1 1439603..1441954(+) (comEC) [Bacillus velezensis strain K203]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIIVKTKQHAPVIVCLVSFCLYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKISAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYTQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIGDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFDMFNLVMKPVHDFITYAASVDLFTMIVSKPDFVSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGRSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDVSGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=656246 MF598_RS07470 WP_045208284.1 1439603..1441954(+) (comEC) [Bacillus velezensis strain K203]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATTGTAAAAACAAAGCAGCATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTCTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGTTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAAGTTGACGGAGCGAAAATATCAGCTGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCACTTGAGGAAAAGAGACTCATTGAACAGCTCGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACACTCAGCAAATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCATGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGGGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCGGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCTTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCGGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCTTCATTTATTGCAGAACTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTTTCGAGGCAGATGGGAGAATGTTTGTTTGATATGTTCAACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTGTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACTGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAAGATCACATCGGAGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCTACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATGT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.736

98.595

0.559