Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   ACG4I4_RS05165 Genome accession   NZ_CP171676
Coordinates   1089602..1091845 (+) Length   747 a.a.
NCBI ID   WP_000173776.1    Uniprot ID   -
Organism   Vibrio cholerae strain CRC1106     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1084602..1096845
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACG4I4_RS05140 - 1085065..1085640 (-) 576 WP_000999601.1 PilZ domain-containing protein -
  ACG4I4_RS05145 lolC 1085816..1087024 (+) 1209 WP_000468900.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  ACG4I4_RS05150 lolD 1087017..1087703 (+) 687 WP_001061290.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  ACG4I4_RS05155 lolE 1087704..1088948 (+) 1245 WP_000493010.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  ACG4I4_RS05160 - 1089063..1089593 (-) 531 WP_001881633.1 DUF2062 domain-containing protein -
  ACG4I4_RS05165 comEC 1089602..1091845 (+) 2244 WP_000173776.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  ACG4I4_RS05170 msbA 1091876..1093624 (+) 1749 WP_000052153.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  ACG4I4_RS05175 lpxK 1093627..1094634 (+) 1008 WP_001994134.1 tetraacyldisaccharide 4'-kinase -
  ACG4I4_RS05180 - 1094615..1094794 (+) 180 WP_000350068.1 Trm112 family protein -
  ACG4I4_RS05185 kdsB 1094794..1095552 (+) 759 WP_000011329.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 747 a.a.        Molecular weight: 84511.71 Da        Isoelectric Point: 7.7612

>NTDB_id=1063013 ACG4I4_RS05165 WP_000173776.1 1089602..1091845(+) (comEC) [Vibrio cholerae strain CRC1106]
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRFHAQLLWSPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLVAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPMLQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFY
LDQLDWFTQRSLGWQPWYRQMLRKGVE

Nucleotide


Download         Length: 2244 bp        

>NTDB_id=1063013 ACG4I4_RS05165 WP_000173776.1 1089602..1091845(+) (comEC) [Vibrio cholerae strain CRC1106]
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGATC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGAGCGGTCGTAAAACCTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTAGAGCAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCCATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGCTCGGCATGATGCGTTTTCATGCTCAGTTATTGTGGTCCCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATTATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTTCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGTCTATTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGGTGTGG
TCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGC
CTATGCTCCAACAGCGGGGGCTACGCCAAGTGGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGATTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACCGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCACCACGGCAGTAAAACATCGTCCACCGC
AGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTAT
CTAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio cholerae strain A1552

100

100

1

  comEC Vibrio parahaemolyticus RIMD 2210633

41.215

100

0.418

  comEC Vibrio campbellii strain DS40M4

40.957

100

0.412


Multiple sequence alignment