Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   G5S32_RS04705 Genome accession   NZ_CP049331
Coordinates   1046603..1048846 (+) Length   747 a.a.
NCBI ID   WP_165310835.1    Uniprot ID   A0A6G7CGV0
Organism   Vibrio ziniensis strain ZWAL4003     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1041603..1053846
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G5S32_RS04680 (G5S32_04680) - 1041821..1042390 (-) 570 WP_165310825.1 PilZ domain-containing protein -
  G5S32_RS04685 (G5S32_04685) lolC 1042620..1043828 (+) 1209 WP_165310827.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  G5S32_RS04690 (G5S32_04690) lolD 1043821..1044507 (+) 687 WP_165310829.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  G5S32_RS04695 (G5S32_04695) lolE 1044507..1045751 (+) 1245 WP_165310831.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  G5S32_RS04700 (G5S32_04700) - 1046079..1046594 (-) 516 WP_165310833.1 DUF2062 domain-containing protein -
  G5S32_RS04705 (G5S32_04705) comEC 1046603..1048846 (+) 2244 WP_165310835.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  G5S32_RS04710 (G5S32_04710) msbA 1048879..1050627 (+) 1749 WP_165310837.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  G5S32_RS04715 (G5S32_04715) lpxK 1050637..1051644 (+) 1008 WP_165310839.1 tetraacyldisaccharide 4'-kinase -
  G5S32_RS04720 (G5S32_04720) - 1051625..1051804 (+) 180 WP_165310841.1 Trm112 family protein -
  G5S32_RS04725 (G5S32_04725) kdsB 1051804..1052559 (+) 756 WP_165310843.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 747 a.a.        Molecular weight: 84091.83 Da        Isoelectric Point: 9.3925

>NTDB_id=426154 G5S32_RS04705 WP_165310835.1 1046603..1048846(+) (comEC) [Vibrio ziniensis strain ZWAL4003]
MTLLSKNWTLASFSLTVLSSPYWPSMPVLGFALFCPLLFILSARFTKLRNWGGIVLGLLVIITHGNAVKTQSSSIFQAGQ
DITIKGKVDSFFKQISFGYEGSVVVHQINDHFITTWFPARIRLFSPIPLEIGDHFEFSVKVKPIIGRLNEVGFDAEKYSL
SQGWVARASVNKNARFQVTSRMNLRSWIYAKVESYITNSRHKGMITALVFGERSGLSQTDWMQLRNSGLIHLVAISGLHI
GIAFGIGYTLGVGLSRLNSNLLWFPFLLGAALAVSYAWLAGFTVPTQRALIMCLLNVAMISLRVQVSISRRIWLTLSAVL
IVDPFASLASSFWLSFTAVCIVIYLYTMMSQWKCWWVKLILGQVILVLLMAPISAYFFGGVSWISILFNMVFIPWFSFIV
VPLLFIAVIGSCFTVPHINYLWRIADSVFEPVAWALQYSSSGWFAINETNLYVIVTLALIWLSRFVISSFSQVLISSILI
GVFIFREPSYDWRMDVLDVGHGLAVLIEKGGKTLLYDTGSSWENGSYVRSVIAPLLTKRGTETIDTVIYSHLDDDHAGGR
FDVNQWFLPKHVYSSQTIENSSACIRGEAWEWQGLSFTVLWPPQRVVRAYNQHSCVIRIVDTQFGHSVLLSGDVTAVGEW
LLTRDKAVVQSDVMLVPHHGSQTSSTEDFIERVSPEIAIASLDKGNRWKLPHPKVINRYSSLGVVWYDTGDAGQITLSYR
AESRHLSTLRQEGYIPWYRQMLRKGVE

Nucleotide


Download         Length: 2244 bp        

>NTDB_id=426154 G5S32_RS04705 WP_165310835.1 1046603..1048846(+) (comEC) [Vibrio ziniensis strain ZWAL4003]
ATGACTCTCTTATCGAAAAATTGGACGCTTGCTTCGTTTTCGCTCACTGTTTTGTCATCGCCCTATTGGCCGAGTATGCC
AGTTTTGGGTTTTGCGCTATTTTGCCCACTTTTATTCATCCTTAGTGCTCGATTCACTAAGTTAAGAAACTGGGGCGGGA
TCGTACTAGGATTGCTAGTGATAATAACACATGGCAACGCAGTAAAAACTCAATCCAGTAGCATTTTTCAAGCAGGGCAG
GATATTACCATAAAAGGGAAAGTTGACAGCTTTTTTAAACAAATTAGTTTTGGATACGAAGGTTCCGTTGTTGTGCATCA
AATAAATGACCATTTTATCACTACATGGTTCCCTGCTCGTATTCGTTTATTTTCACCTATCCCTTTAGAAATAGGTGACC
ACTTTGAATTTTCAGTGAAAGTGAAACCTATTATTGGTCGATTAAATGAGGTTGGGTTCGACGCTGAGAAATATTCTCTA
AGCCAAGGGTGGGTAGCGAGAGCATCAGTTAATAAAAATGCTCGTTTTCAAGTTACGTCAAGAATGAACTTGCGATCATG
GATTTATGCCAAAGTAGAAAGTTACATCACCAATAGTCGACATAAAGGGATGATTACCGCTTTGGTATTTGGCGAGCGTT
CAGGTCTTAGTCAGACTGATTGGATGCAATTGAGAAATAGTGGATTGATTCATTTGGTCGCCATTTCAGGTTTACATATC
GGTATAGCTTTTGGTATCGGTTATACGTTGGGTGTTGGATTATCAAGACTGAATTCGAACTTACTTTGGTTTCCCTTTCT
GTTAGGCGCAGCTCTAGCTGTCAGCTATGCGTGGTTAGCCGGCTTTACTGTCCCTACTCAACGTGCTCTTATCATGTGTC
TGTTAAACGTAGCGATGATTTCATTACGTGTACAAGTGTCTATTTCTCGTCGAATTTGGCTTACGTTGAGTGCAGTGCTT
ATCGTTGATCCCTTTGCTTCTTTAGCTAGCAGCTTTTGGCTTTCTTTTACTGCTGTTTGTATTGTTATTTATCTCTACAC
TATGATGTCACAATGGAAATGCTGGTGGGTGAAGTTGATCTTGGGGCAGGTGATTTTAGTACTGCTTATGGCACCGATAA
GCGCTTACTTCTTTGGCGGTGTAAGTTGGATATCCATATTGTTCAATATGGTTTTTATCCCTTGGTTTTCTTTCATTGTC
GTGCCCTTATTGTTTATCGCGGTAATTGGCTCTTGCTTTACTGTTCCGCATATCAACTATTTGTGGCGAATTGCTGACAG
TGTATTTGAGCCTGTCGCTTGGGCTCTTCAATATTCAAGCTCAGGTTGGTTTGCAATAAACGAAACTAATCTCTATGTAA
TAGTTACATTAGCATTAATCTGGCTGTCAAGATTCGTCATTAGTAGCTTTAGTCAAGTATTGATAAGTTCGATTCTGATA
GGGGTGTTTATCTTTAGAGAGCCCTCTTATGACTGGCGAATGGATGTGCTTGATGTCGGGCATGGATTAGCCGTGTTAAT
TGAAAAGGGTGGGAAAACCTTACTCTACGATACGGGAAGCAGTTGGGAAAATGGTAGTTATGTTCGTTCAGTGATAGCTC
CATTACTTACAAAAAGAGGAACAGAAACGATTGATACGGTGATTTACAGCCATCTTGATGATGATCACGCTGGTGGTAGG
TTCGATGTTAATCAATGGTTTCTACCTAAACACGTCTATTCTAGTCAGACTATAGAAAACTCATCAGCCTGTATCCGAGG
AGAGGCTTGGGAGTGGCAAGGTTTGTCCTTTACTGTGTTGTGGCCGCCACAAAGAGTCGTTCGAGCTTACAATCAGCATT
CATGTGTTATTCGTATTGTAGATACTCAGTTTGGCCATAGTGTTTTGCTTTCTGGTGATGTCACTGCTGTTGGTGAGTGG
TTACTTACAAGAGACAAGGCTGTCGTGCAAAGTGATGTGATGCTAGTTCCTCACCATGGCAGTCAAACTTCTTCTACAGA
GGATTTTATTGAAAGAGTTTCCCCCGAGATTGCGATTGCCTCACTAGATAAAGGTAACCGCTGGAAACTTCCCCATCCGA
AGGTCATTAATAGGTATTCGAGTTTAGGTGTTGTTTGGTATGACACAGGCGATGCAGGACAGATTACTCTGAGCTATAGA
GCAGAAAGTCGTCATTTATCTACTTTGCGACAAGAAGGGTACATTCCTTGGTATAGGCAGATGCTGCGTAAAGGGGTAGA
ATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6G7CGV0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio cholerae strain A1552

51.802

100

0.519

  comEC Vibrio parahaemolyticus RIMD 2210633

44.152

100

0.45

  comEC Vibrio campbellii strain DS40M4

43.311

100

0.438