Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   ACG4I5_RS04965 Genome accession   NZ_CP172325
Coordinates   1126994..1129237 (-) Length   747 a.a.
NCBI ID   WP_000173776.1    Uniprot ID   -
Organism   Vibrio cholerae strain M812     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1121994..1134237
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACG4I5_RS04945 kdsB 1123287..1124045 (-) 759 WP_000011329.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  ACG4I5_RS04950 - 1124045..1124224 (-) 180 WP_000350068.1 Trm112 family protein -
  ACG4I5_RS04955 lpxK 1124205..1125212 (-) 1008 WP_001994134.1 tetraacyldisaccharide 4'-kinase -
  ACG4I5_RS04960 msbA 1125215..1126963 (-) 1749 WP_000052153.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  ACG4I5_RS04965 comEC 1126994..1129237 (-) 2244 WP_000173776.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  ACG4I5_RS04970 - 1129246..1129776 (+) 531 WP_001881633.1 DUF2062 domain-containing protein -
  ACG4I5_RS04975 lolE 1129891..1131135 (-) 1245 WP_000493010.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  ACG4I5_RS04980 lolD 1131136..1131822 (-) 687 WP_001061290.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  ACG4I5_RS04985 lolC 1131815..1133023 (-) 1209 WP_000468900.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  ACG4I5_RS04990 - 1133199..1133774 (+) 576 WP_000999601.1 PilZ domain-containing protein -

Sequence


Protein


Download         Length: 747 a.a.        Molecular weight: 84511.71 Da        Isoelectric Point: 7.7612

>NTDB_id=1066348 ACG4I5_RS04965 WP_000173776.1 1126994..1129237(-) (comEC) [Vibrio cholerae strain M812]
MTLLSNYWSLISFSITVLSAPYWPWMPSWGWAWLCLIVMVLLGYHRVGRQFLGFVAAILTIVLQGNLIRDQSNVLYQAGP
DIIIKGRVDSFFTQTRYAYEGFVLIHEVNGQTLNKMTRPRIRLSAPLLLQPNDRVEFSVTLKPIVGRLNQTGFDLEAHYM
AQSVVARAVVKPDTAYQIVQESGIRSSLFFELEQLTHTSPYQGLILALTFGERKGIDEQEWQALRNSGLIHLVAISGLHI
GIAFSVGYFLGLGMMRFHAQLLWSPFVCGALLAVLYAWLAGFTLPTQRALIMCLLNVALIMLAFPLSALKRILLTLVAVL
LWSPFASLSNSFWMSFLAVAIVLYQLASQSQRQVWWKALLWAQVFLVCLMAPVTAYFFGGLSVTAVLYNLVFIPWFSLVI
VPALFLGLLLMVVWPSVAAAYWPWVDWTFLPLDWALQFADVGWWVVPSKVQGVVAASVAILLLYRFMSLKACSLLLGMIG
LWWWFPSLTPLWRMDVLDVGHGLAIVIEQDERAIVYDTGSSWPGGSYVQSVIEPMLQQRGLRQVDGVILSHLDNDHAGDW
QGLAERWQPNWIRASQLGTEFMPCIRGESWQWQSLHFTVLWPPQAVSRAYNQHSCVIRMTDTQSNHSVLLSGDVTAMGEW
LLARDGAQLQSEVMIVPHHGSKTSSTAEFIAQVNPKLAIASVAKDNRWNLPNPQVVARYQAQQVEWLDTGHAGQISLFFY
LDQLDWFTQRSLGWQPWYRQMLRKGVE

Nucleotide


Download         Length: 2244 bp        

>NTDB_id=1066348 ACG4I5_RS04965 WP_000173776.1 1126994..1129237(-) (comEC) [Vibrio cholerae strain M812]
ATGACTCTCTTGTCGAATTACTGGTCTCTCATTTCTTTTTCGATCACCGTGCTGTCTGCCCCTTATTGGCCTTGGATGCC
GAGTTGGGGTTGGGCTTGGTTATGCCTTATTGTTATGGTTTTGCTCGGTTATCACCGAGTTGGCCGTCAATTCCTTGGCT
TCGTGGCTGCCATACTAACCATTGTGCTACAGGGCAACCTTATACGAGATCAATCCAATGTGCTCTATCAAGCAGGGCCG
GATATTATCATAAAAGGCCGTGTTGACAGCTTTTTTACGCAAACTCGTTACGCTTATGAGGGTTTTGTCCTCATTCATGA
AGTGAATGGACAAACCTTAAACAAAATGACTCGCCCTCGCATACGTTTAAGTGCCCCTTTACTGTTACAACCCAATGATC
GCGTCGAATTTTCGGTAACTCTCAAGCCGATAGTGGGTCGACTCAACCAAACCGGCTTTGATTTAGAAGCGCATTACATG
GCGCAATCTGTCGTCGCACGAGCGGTCGTAAAACCTGACACTGCTTATCAAATTGTGCAAGAGAGTGGCATAAGGTCAAG
TTTGTTTTTTGAGCTAGAGCAATTAACGCATACCAGCCCATACCAAGGATTGATCTTAGCCCTGACGTTTGGCGAGCGAA
AAGGTATTGATGAGCAAGAGTGGCAAGCCTTACGCAATAGTGGCTTAATTCATTTAGTGGCCATTTCGGGGCTGCACATT
GGTATCGCTTTTAGCGTGGGGTATTTTCTCGGGCTCGGCATGATGCGTTTTCATGCTCAGTTATTGTGGTCCCCTTTTGT
GTGTGGGGCTTTACTGGCGGTGCTCTACGCTTGGCTGGCCGGATTTACGTTGCCTACTCAGCGTGCATTGATTATGTGCT
TACTCAATGTGGCGTTGATCATGTTGGCTTTTCCTCTTTCCGCGCTCAAGCGGATTCTACTCACCTTAGTCGCGGTCTTG
CTTTGGTCGCCATTCGCCTCACTTTCAAACAGTTTCTGGATGTCGTTTTTGGCGGTCGCGATTGTTCTCTACCAATTAGC
CAGTCAAAGCCAGCGTCAGGTGTGGTGGAAAGCTCTTCTTTGGGCGCAGGTGTTCCTCGTCTGTTTAATGGCACCGGTCA
CGGCCTATTTTTTCGGTGGCTTAAGCGTAACGGCAGTTCTGTACAATTTGGTGTTTATTCCTTGGTTTTCGTTGGTGATT
GTCCCAGCTTTGTTTTTGGGTCTATTACTCATGGTGGTATGGCCTAGTGTGGCCGCCGCTTACTGGCCTTGGGTGGATTG
GACGTTTTTACCGCTCGATTGGGCTTTGCAGTTTGCCGATGTAGGCTGGTGGGTGGTCCCCAGCAAAGTACAAGGTGTGG
TCGCAGCGAGTGTGGCCATCCTCTTGCTTTATCGATTTATGAGCCTAAAAGCCTGCAGCTTATTATTGGGTATGATTGGC
TTATGGTGGTGGTTTCCCTCTCTCACTCCACTTTGGCGAATGGATGTGCTGGATGTTGGACATGGCTTGGCGATTGTGAT
TGAGCAAGATGAGCGAGCAATTGTCTACGATACAGGCAGCAGTTGGCCGGGAGGCAGCTATGTGCAAAGCGTGATTGAGC
CTATGCTCCAACAGCGGGGGCTACGCCAAGTGGATGGAGTGATTTTAAGTCATCTTGATAATGATCATGCGGGTGATTGG
CAAGGTTTAGCTGAGCGCTGGCAACCCAATTGGATTCGTGCCAGCCAACTCGGGACAGAGTTTATGCCTTGTATCCGTGG
TGAAAGCTGGCAGTGGCAATCTCTCCATTTTACGGTGTTATGGCCACCACAAGCGGTTAGCCGAGCGTACAACCAGCATT
CGTGTGTGATTCGTATGACCGATACTCAGTCTAACCATTCTGTACTGCTCTCCGGGGATGTCACAGCCATGGGGGAGTGG
CTGCTTGCTCGCGACGGAGCGCAACTGCAAAGTGAGGTGATGATCGTGCCGCACCACGGCAGTAAAACATCGTCCACCGC
AGAGTTTATTGCCCAAGTGAATCCCAAACTTGCGATTGCTTCTGTGGCGAAAGATAACCGCTGGAATTTGCCTAATCCGC
AAGTCGTGGCACGTTATCAAGCTCAGCAAGTTGAGTGGCTAGATACTGGACACGCTGGGCAAATTAGCCTCTTTTTCTAT
CTAGATCAGCTGGATTGGTTTACCCAGCGTAGCCTTGGCTGGCAGCCTTGGTATAGGCAGATGCTGCGTAAAGGAGTAGA
ATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio cholerae strain A1552

100

100

1

  comEC Vibrio parahaemolyticus RIMD 2210633

41.215

100

0.418

  comEC Vibrio campbellii strain DS40M4

40.957

100

0.412


Multiple sequence alignment