Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CU052_RS00040 Genome accession   NZ_CP025537
Coordinates   5938..8193 (-) Length   751 a.a.
NCBI ID   WP_101904055.1    Uniprot ID   -
Organism   Vibrio harveyi strain 345     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 938..13193
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CU052_RS00015 (CU052_00415) - 1663..1842 (-) 180 WP_005378451.1 Trm112 family protein -
  CU052_RS00020 (CU052_00420) lpxK 1823..2830 (-) 1008 WP_101904052.1 tetraacyldisaccharide 4'-kinase -
  CU052_RS00025 (CU052_00425) msbA 2836..4584 (-) 1749 WP_050936084.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  CU052_RS29695 (CU052_00430) - 4910..5194 (+) 285 WP_101904053.1 glycosyltransferase -
  CU052_RS00035 (CU052_00435) - 5214..5900 (+) 687 WP_265093699.1 glycosyltransferase -
  CU052_RS00040 (CU052_00440) comEC 5938..8193 (-) 2256 WP_101904055.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CU052_RS00045 (CU052_00445) - 8202..8711 (+) 510 WP_005446692.1 DUF2062 domain-containing protein -
  CU052_RS00050 (CU052_00450) lolE 8909..10153 (-) 1245 WP_009698315.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  CU052_RS00055 (CU052_00455) lolD 10156..10863 (-) 708 WP_005446689.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  CU052_RS00060 (CU052_00460) lolC 10856..12064 (-) 1209 WP_026000130.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  CU052_RS00065 (CU052_00465) - 12329..12898 (+) 570 WP_009698318.1 PilZ domain-containing protein -

Sequence


Protein


Download         Length: 751 a.a.        Molecular weight: 84396.89 Da        Isoelectric Point: 9.6792

>NTDB_id=262378 CU052_RS00040 WP_101904055.1 5938..8193(-) (comEC) [Vibrio harveyi strain 345]
MTLLEKSLTLALFVASVISSAWWPTIPDWRWLLLGIIATGSIIKLRRGLLSIGVISGFMVVIIHGNVMEHQRQALFQAGV
NITINGKVDSPFTQISHGYEGIARVHQVNSQNLLPFFKPKIRLITPFPLAVNSEFTTEVTIKPIFGLLNEAGHDAEKQAV
GKGIVARATVSKDSAWLIRERSSLRTQIIAVVHKHIVQLEHFALIRALAFSDRTLLSRYDWQLLRDSGLLHLVSISGLHI
GMAFAFGMSFGVVVRYALPKFVFLPSLFGLATAFLYSWLADFSLPTTRAFSVCLIYLLLKSALIYWSAWRVLLLAVAIQL
CIEPFSALTMSFWLSYLSVIAVLFAVNCVQHSRGNWIRKLGTLFKIQLVLTVLIIPISGLFFAGTSLSSILYNLIFIPWF
GFVVVPLMFVALIITPFSVHLANMLWQWLDWMLVPLTWSLPFALGSWQSLSSQATLWVLALGVCVLSMRFLNRETSGVLF
LVITSLALWYERKSDGWRIDVLDVGHGLAVLLEKEGEVLLYDTGKTWAYGSIAEQVIAPILYRRGFGSIDMFVVSHADSD
HAGGRAYIERHFAPVRKFSSQNYANYQPCIAGERWKWQALEFEVLWPPKLVKRAYNPHSCVIRVVDTKTDFKLLLTGDIE
AVSEWILVRNPDQLKSDVVIVPHHGSKSSSNPKFVEAIAPKLAIASLAKGNQWGMPANNVVLAYENANAKWLDTGNGGQI
SVLIEQENWYFETKRSETFDPWYRQMLRNGN

Nucleotide


Download         Length: 2256 bp        

>NTDB_id=262378 CU052_RS00040 WP_101904055.1 5938..8193(-) (comEC) [Vibrio harveyi strain 345]
ATGACTCTCTTAGAAAAAAGTTTGACCTTGGCTTTATTTGTAGCGAGCGTTATTTCGTCTGCATGGTGGCCGACGATACC
AGATTGGCGTTGGTTGCTGCTGGGAATAATTGCCACTGGCTCAATAATAAAATTACGACGTGGCTTATTGAGCATAGGCG
TAATTTCGGGCTTTATGGTTGTCATTATCCACGGCAATGTTATGGAGCATCAAAGACAAGCCCTGTTTCAAGCAGGGGTG
AATATTACCATAAATGGCAAAGTTGACAGCCCTTTTACGCAAATAAGTCACGGATATGAAGGAATTGCGCGTGTCCATCA
GGTGAATTCTCAAAACTTGTTACCTTTTTTTAAACCGAAAATCCGGTTGATAACGCCTTTCCCACTCGCCGTTAACAGTG
AGTTCACTACCGAAGTGACAATTAAACCCATCTTTGGGCTTTTAAACGAAGCAGGTCATGACGCCGAAAAGCAGGCAGTA
GGAAAGGGCATTGTCGCAAGGGCGACGGTTTCAAAGGATTCTGCGTGGCTTATTCGTGAGCGATCATCATTAAGAACGCA
AATTATCGCCGTCGTTCATAAGCACATTGTTCAGCTTGAACATTTCGCTCTTATTCGTGCTTTGGCGTTTAGTGACCGTA
CGCTTCTGTCTCGATATGACTGGCAACTCCTACGTGATAGTGGCTTACTGCATTTGGTTTCGATTTCGGGTTTACATATA
GGAATGGCGTTTGCTTTTGGTATGAGCTTTGGGGTTGTTGTTAGGTACGCTTTGCCAAAATTTGTCTTTTTACCTTCCTT
GTTTGGGCTTGCTACTGCTTTTTTGTATTCATGGTTAGCAGACTTCTCTTTGCCTACGACTCGAGCTTTTTCAGTATGTT
TGATTTACTTGTTATTAAAGTCTGCTTTGATTTATTGGAGCGCTTGGCGCGTGCTGCTCCTTGCTGTAGCAATACAGTTG
TGCATCGAGCCGTTTTCTGCACTCACTATGAGCTTTTGGCTGTCTTATCTCTCTGTTATCGCCGTTTTATTTGCAGTGAA
TTGTGTACAACATAGCCGTGGCAATTGGATTAGGAAACTGGGTACCTTATTCAAAATTCAGTTAGTACTCACTGTGTTGA
TCATTCCGATTAGTGGGTTGTTTTTTGCCGGGACGAGCCTCTCCTCTATTTTATATAACCTCATTTTTATCCCTTGGTTC
GGTTTTGTTGTCGTTCCACTGATGTTTGTTGCACTTATCATTACCCCGTTTTCAGTGCACTTGGCGAACATGCTGTGGCA
GTGGTTAGATTGGATGCTTGTACCCCTTACTTGGTCTTTGCCATTTGCTTTGGGGAGTTGGCAATCTCTTAGCTCACAGG
CAACGTTGTGGGTGCTGGCATTGGGCGTTTGTGTGCTTTCGATGCGATTTTTAAATCGAGAAACCTCAGGCGTTTTGTTC
TTGGTCATCACAAGTCTGGCGTTGTGGTATGAACGAAAATCCGATGGTTGGCGTATTGATGTGTTAGATGTTGGGCATGG
ACTTGCAGTGCTCCTAGAAAAAGAAGGTGAAGTGCTTTTATATGACACGGGTAAAACATGGGCTTATGGAAGTATTGCTG
AACAAGTTATTGCTCCTATCTTGTATCGCAGAGGGTTCGGTTCTATCGACATGTTTGTCGTCAGTCACGCCGACTCAGAT
CATGCTGGAGGACGTGCATATATCGAAAGGCACTTCGCCCCAGTTCGTAAGTTTAGTAGCCAAAACTACGCTAACTACCA
ACCTTGTATTGCAGGGGAACGATGGAAATGGCAAGCGCTAGAATTTGAAGTTCTTTGGCCTCCGAAGTTGGTTAAACGTG
CATATAACCCACATTCATGCGTGATTCGTGTCGTCGATACCAAAACGGATTTTAAGTTGCTATTAACGGGCGATATTGAA
GCCGTTAGTGAATGGATCTTGGTGAGAAACCCCGATCAGTTAAAAAGTGATGTTGTGATTGTCCCGCATCACGGAAGTAA
AAGCTCTTCTAACCCCAAGTTTGTTGAAGCGATTGCCCCAAAACTGGCGATCGCATCTCTGGCAAAAGGGAATCAGTGGG
GAATGCCTGCAAATAACGTGGTTCTAGCGTACGAGAACGCCAATGCGAAATGGCTGGATACGGGGAATGGCGGTCAAATA
AGCGTCCTTATTGAGCAAGAGAATTGGTATTTTGAAACGAAACGAAGTGAGACATTTGACCCTTGGTATAGGCAGATGCT
GCGTAACGGAAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio campbellii strain DS40M4

75.434

99.734

0.752

  comEC Vibrio parahaemolyticus RIMD 2210633

65.154

99.734

0.65

  comEC Vibrio cholerae strain A1552

40.559

100

0.406


Multiple sequence alignment