Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   C0W45_RS05840 Genome accession   NZ_CP029966
Coordinates   1127358..1129472 (-) Length   704 a.a.
NCBI ID   WP_235805391.1    Uniprot ID   -
Organism   Latilactobacillus curvatus strain ZJUNIT8     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1110879..1130101 1127358..1129472 within 0


Gene organization within MGE regions


Location: 1110879..1130101
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C0W45_RS10535 (C0W45_05760) - 1110879..1111181 (+) 303 Protein_1117 helix-turn-helix domain-containing protein -
  C0W45_RS05765 (C0W45_05765) - 1111302..1112192 (-) 891 WP_338121227.1 IS3 family transposase -
  C0W45_RS05770 (C0W45_05770) - 1112198..1112449 (-) 252 WP_002816285.1 IS3 family transposase -
  C0W45_RS05775 (C0W45_05775) uvrC 1112626..1114404 (-) 1779 WP_065825717.1 excinuclease ABC subunit UvrC -
  C0W45_RS05780 (C0W45_05780) - 1114474..1115205 (-) 732 WP_056966004.1 amino acid ABC transporter ATP-binding protein -
  C0W45_RS05785 (C0W45_05785) - 1115198..1116619 (-) 1422 WP_375163234.1 ABC transporter substrate-binding protein/permease -
  C0W45_RS10305 - 1116797..1116949 (+) 153 WP_004271218.1 SPJ_0845 family protein -
  C0W45_RS05790 (C0W45_05790) - 1117061..1117354 (-) 294 WP_004271114.1 nucleoside triphosphate pyrophosphohydrolase family protein -
  C0W45_RS05795 (C0W45_05795) yihA 1117354..1117953 (-) 600 WP_004271111.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  C0W45_RS05800 (C0W45_05800) clpX 1118151..1119404 (-) 1254 WP_039099649.1 ATP-dependent Clp protease ATP-binding subunit ClpX -
  C0W45_RS05805 (C0W45_05805) tig 1119612..1120907 (-) 1296 WP_004271115.1 trigger factor -
  C0W45_RS05810 (C0W45_05810) tuf 1121116..1122306 (-) 1191 WP_039099648.1 elongation factor Tu -
  C0W45_RS05815 (C0W45_05815) - 1122526..1123470 (-) 945 WP_111447700.1 hypothetical protein -
  C0W45_RS05820 (C0W45_05820) - 1123553..1125271 (-) 1719 WP_035186941.1 ribonuclease J -
  C0W45_RS05825 (C0W45_05825) rpsO 1125474..1125743 (-) 270 WP_004271112.1 30S ribosomal protein S15 -
  C0W45_RS05830 (C0W45_05830) rpsT 1126020..1126274 (+) 255 WP_004271117.1 30S ribosomal protein S20 -
  C0W45_RS05835 (C0W45_05835) holA 1126329..1127357 (-) 1029 WP_004270758.1 DNA polymerase III subunit delta -
  C0W45_RS05840 (C0W45_05840) comEC 1127358..1129472 (-) 2115 WP_235805391.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  C0W45_RS05845 (C0W45_05845) - 1129613..1130101 (-) 489 WP_111447701.1 ComE operon protein 2 -

Sequence


Protein


Download         Length: 704 a.a.        Molecular weight: 79788.15 Da        Isoelectric Point: 9.3226

>NTDB_id=297965 C0W45_RS05840 WP_235805391.1 1127358..1129472(-) (comEC) [Latilactobacillus curvatus strain ZJUNIT8]
MVSGLVLCLGGCHFWLARRTYEQPVRPNNQVYKIQPDTIKVAGDQIQLMAQGQNDDQRVVCYYRCQSSAEKRRWQTTNQP
LLLAGDIELERIQGATNRNEFDYARFMGQQKHCFYQANIIDGTVFRRIPPQGWLDWFHQRRQQGLIYLRQLPPALSFHAE
ALLCGVRESDGLAYQNVLGQLGIIHLLSLSGLHVFYFVTIIRRLATLCRVPREWVNVSLFCLLPLYALFVGGGTSITRAI
GLILLRLICEVSHFQQSRLDSWSWVLLVNLLWQPYLLISMGGLLSYLMAFGLIYQRRHGTFTTAFWLGLLSLPVCLRFNY
RWHILTILINGIVAPIFLPVMLGLIMVAICLWPVSHMIVWLEEGLLTTLYKGLAWIATFPYATITFGKIQLIPLFLIVIG
TLWLMASTGRRAKWGWGAVISLYVISFIGIHFNPVGRVIMFDIGQGDSLLIQQPFNRHNLLIDTGGRLALPQEKWQRRQI
VSRAEKVTVNYLYSCGIDHLEAIALSHQDADHIGDLNEIMQHIRVKRLICAAGLPQNRQFQRQVRPHLTKVAIEPYLADQ
HFMIGHQQINVLAPVVAGPGGNADSLVLQTQIGGASWLFTGDLEKAGEVAIVDRYPQLRVDYLKVGHHGSQTASDPQSIA
TWQVKGALISAGRHNRYGHPHAATLQTLKRANVPFWNTADCGMLEWRYGFGQVPMIKTTLKDCD

Nucleotide


Download         Length: 2115 bp        

>NTDB_id=297965 C0W45_RS05840 WP_235805391.1 1127358..1129472(-) (comEC) [Latilactobacillus curvatus strain ZJUNIT8]
ATGGTCAGTGGACTGGTACTGTGTTTAGGCGGTTGCCATTTTTGGCTGGCACGGCGCACCTACGAACAGCCGGTTAGACC
AAATAATCAAGTATATAAGATTCAACCCGATACGATCAAAGTGGCTGGCGATCAGATTCAATTAATGGCGCAGGGGCAGA
ATGATGACCAACGTGTTGTTTGCTACTATCGCTGCCAGAGTTCAGCGGAAAAACGCCGTTGGCAAACGACGAATCAGCCG
TTATTATTAGCGGGTGATATTGAATTAGAGCGTATCCAAGGTGCTACTAACCGGAATGAGTTCGACTATGCGCGCTTTAT
GGGGCAGCAAAAGCATTGTTTTTATCAAGCAAATATAATTGACGGAACGGTCTTTAGACGCATTCCACCACAAGGTTGGC
TAGATTGGTTCCACCAACGGCGCCAACAGGGGCTGATTTATCTCCGACAATTGCCACCAGCGTTGAGTTTTCATGCAGAG
GCGTTGTTATGTGGTGTTCGAGAGTCAGATGGCCTGGCCTACCAGAATGTGTTAGGTCAATTGGGGATTATTCATTTATT
GAGTTTATCCGGATTGCATGTATTCTACTTTGTGACAATCATTCGTCGATTGGCAACGCTGTGTCGAGTCCCGCGTGAAT
GGGTGAATGTTAGCTTGTTTTGTTTGCTACCCCTTTATGCGTTGTTTGTGGGGGGAGGCACGAGTATTACACGCGCTATC
GGGCTGATCTTACTACGATTAATCTGCGAAGTAAGTCATTTTCAGCAATCGCGTTTAGATAGTTGGAGTTGGGTATTGCT
GGTCAATCTTTTATGGCAGCCGTATTTATTAATCAGTATGGGGGGATTGCTGAGCTATCTGATGGCTTTTGGATTGATTT
ATCAGCGGCGCCATGGTACTTTTACAACGGCATTTTGGCTAGGCTTGTTAAGTTTACCGGTGTGCTTGCGGTTTAATTAT
CGTTGGCATATCTTGACGATTTTGATTAATGGGATTGTCGCGCCGATTTTCTTACCAGTCATGTTGGGTTTGATTATGGT
AGCGATTTGCTTATGGCCTGTTAGTCATATGATAGTTTGGTTAGAAGAAGGCCTGCTAACGACGCTGTATAAAGGATTAG
CTTGGATTGCAACGTTCCCATATGCGACAATTACTTTCGGTAAAATCCAGCTAATACCACTCTTTTTAATTGTGATAGGC
ACACTATGGTTGATGGCAAGTACTGGTCGGCGAGCAAAGTGGGGCTGGGGGGCTGTTATCAGCCTGTATGTCATTAGTTT
TATTGGTATCCACTTTAATCCGGTTGGCCGCGTGATTATGTTTGATATCGGGCAAGGCGATAGCCTGTTGATTCAACAAC
CCTTCAATCGACATAACCTGTTAATTGACACCGGCGGCCGACTAGCTTTACCGCAAGAAAAGTGGCAACGGCGGCAAATT
GTGAGTCGGGCGGAAAAAGTAACAGTTAATTACCTCTACAGCTGTGGCATTGATCATTTAGAGGCAATCGCATTATCGCA
TCAGGATGCTGATCATATAGGGGACTTAAATGAAATTATGCAACATATCCGAGTTAAACGGTTGATTTGTGCTGCGGGCT
TACCGCAAAACCGGCAGTTTCAACGGCAAGTGCGGCCGCATCTTACAAAAGTTGCAATTGAGCCTTATTTGGCGGACCAA
CATTTTATGATTGGTCACCAACAAATCAATGTGTTAGCGCCAGTGGTGGCCGGACCAGGCGGGAATGCTGACTCACTCGT
TTTGCAAACGCAAATTGGTGGTGCGAGTTGGTTATTTACTGGTGATTTAGAAAAAGCGGGTGAAGTGGCAATCGTTGATC
GCTACCCACAATTACGCGTTGATTACCTAAAGGTGGGTCACCATGGCAGTCAAACGGCGAGTGATCCCCAGTCAATTGCG
ACGTGGCAGGTAAAAGGGGCGTTGATTTCTGCGGGACGCCATAATCGTTATGGTCATCCACATGCCGCAACGTTGCAGAC
TTTAAAACGTGCGAACGTCCCATTTTGGAATACGGCTGACTGTGGGATGCTGGAATGGCGCTATGGTTTTGGCCAAGTGC
CAATGATTAAAACAACACTAAAGGATTGTGACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Latilactobacillus sakei subsp. sakei 23K

62.801

100

0.631


Multiple sequence alignment