Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   HPY04_RS05010 Genome accession   NZ_CP053744
Coordinates   1081550..1083793 (+) Length   747 a.a.
NCBI ID   WP_260607233.1    Uniprot ID   -
Organism   Vibrio cholerae strain SA3G     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1076550..1088793
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HPY04_RS04985 (HPY04_04985) - 1077013..1077588 (-) 576 WP_000999601.1 PilZ domain-containing protein -
  HPY04_RS04990 (HPY04_04990) lolC 1077764..1078972 (+) 1209 WP_000468893.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  HPY04_RS04995 (HPY04_04995) lolD 1078965..1079651 (+) 687 WP_001061290.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  HPY04_RS05000 (HPY04_05000) lolE 1079652..1080896 (+) 1245 WP_000493013.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  HPY04_RS05005 (HPY04_05005) - 1081011..1081541 (-) 531 WP_001881633.1 DUF2062 domain-containing protein -
  HPY04_RS05010 (HPY04_05010) comEC 1081550..1083793 (+) 2244 WP_260607233.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  HPY04_RS05015 (HPY04_05015) msbA 1083824..1085572 (+) 1749 WP_000052157.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  HPY04_RS05020 (HPY04_05020) lpxK 1085575..1086582 (+) 1008 WP_032467952.1 tetraacyldisaccharide 4'-kinase -
  HPY04_RS05025 (HPY04_05025) - 1086563..1086742 (+) 180 WP_000350068.1 Trm112 family protein -
  HPY04_RS05030 (HPY04_05030) kdsB 1086742..1087500 (+) 759 WP_000011327.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 747 a.a.        Molecular weight: 84419.57 Da        Isoelectric Point: 7.7612

>NTDB_id=446781 HPY04_RS05010 WP_260607233.1 1081550..1083793(+) (comEC) [Vibrio cholerae strain SA3G]
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRLHAQLLWSPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLGAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMMG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPILQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFY
PDQLDWFTQRSLGWQPWYRQMLRKGVE

Nucleotide


Download         Length: 2244 bp        

>NTDB_id=446781 HPY04_RS05010 WP_260607233.1 1081550..1083793(+) (comEC) [Vibrio cholerae strain SA3G]
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGACC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTAGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGGGCGGTCGTAAAACCTGACACCGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTGGAACAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCGATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGTTATTTTCTCGGGCTCGGCATGATGCGTCTTCATGCTCAGTTATTGTGGTCCCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTAATTATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGGCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTGCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCGCCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGTCTATTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGGTGTGG
TCGCAGCGAGTGTAGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATGGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGTTATGTGCAAAGCGTGATTGAGC
CTATTCTCCAACAGCGAGGGCTACGCCAAGTCGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGATTCGAGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACCGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCATCACGGCAGTAAAACGTCGTCCACCGC
AGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTAT
CCAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio cholerae strain A1552

99.331

100

0.993

  comEC Vibrio parahaemolyticus RIMD 2210633

41.347

100

0.419

  comEC Vibrio campbellii strain DS40M4

41.09

100

0.414