Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CCZ37_RS04770 Genome accession   NZ_CP022741
Coordinates   1016501..1018744 (+) Length   747 a.a.
NCBI ID   WP_094499886.1    Uniprot ID   A0A223MWR5
Organism   Vibrio qinghaiensis strain Q67     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1011501..1023744
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CCZ37_RS04745 (CCZ37_04735) - 1011948..1012517 (-) 570 WP_094499883.1 PilZ domain-containing protein -
  CCZ37_RS04750 (CCZ37_04740) lolC 1012683..1013891 (+) 1209 WP_094499884.1 lipoprotein-releasing ABC transporter permease subunit LolC -
  CCZ37_RS04755 (CCZ37_04745) lolD 1013884..1014570 (+) 687 WP_010319233.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  CCZ37_RS04760 (CCZ37_04750) lolE 1014571..1015815 (+) 1245 WP_094499885.1 lipoprotein-releasing ABC transporter permease subunit LolE -
  CCZ37_RS04765 (CCZ37_04755) - 1015985..1016494 (-) 510 WP_010319231.1 DUF2062 domain-containing protein -
  CCZ37_RS04770 (CCZ37_04760) comEC 1016501..1018744 (+) 2244 WP_094499886.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CCZ37_RS04775 (CCZ37_04765) msbA 1018777..1020525 (+) 1749 WP_010319229.1 lipid A ABC transporter ATP-binding protein/permease MsbA -
  CCZ37_RS04780 (CCZ37_04770) lpxK 1020529..1021536 (+) 1008 WP_094499887.1 tetraacyldisaccharide 4'-kinase -
  CCZ37_RS04785 (CCZ37_04775) - 1021517..1021696 (+) 180 WP_010319227.1 Trm112 family protein -
  CCZ37_RS04790 (CCZ37_04780) kdsB 1021696..1022448 (+) 753 WP_010319226.1 3-deoxy-manno-octulosonate cytidylyltransferase -

Sequence


Protein


Download         Length: 747 a.a.        Molecular weight: 83540.58 Da        Isoelectric Point: 9.5673

>NTDB_id=242061 CCZ37_RS04770 WP_094499886.1 1016501..1018744(+) (comEC) [Vibrio qinghaiensis strain Q67]
MTLLSNYWTLASFSLTAISASYWPWMPDWKWSMPLFAILILSVGYRKLRFLSGVTMAVVVIVISGNLLREQSNTLFQAGP
DITINGQVNSFFRQISHGYEGTISIRSINGQTLNIFLRPKVRLVAPLLLKKGDVVEFTITVKPIFGRLNETGFDVESYSL
SQGIVARATVSNGQPYRIEPLPERRTQWYQKIKTWLADDPNLGILMALTFGERSDISAAQWQALRDSGLIHLVAISGLHI
GIAFGFGYSLGLVLMRLHHRMGWSPFVVGSLCALGYAWLAGFTLPTQRALIMCLLNVAMIILNVRINTLQRLLMTLSAVL
LIDPFAALSSSFWMSFIAVSVVFYQLSLPQTQRFFLWRLLTMQIGLVLCMLPVTAYFFGGVSTSAALYNLVFIPWFSFVV
VPALFVALGCTVLTFDGAPMVWHWVAKLLIPVDWSIHYSSLSWFPISGAVLVLLSSALLLFVFSPLISRFALAVCGLIMM
ASWVTIPIKTGWRIDVLDVGHGLAVLIEKEGRYVLYDTGASWQGGDHIQTTVAPVLTKRGAKQLDGLILSHLDNDHAGGR
SEVERVWQPKWKRASQTIAGYQPCIAGEQWQWQQLFFDVIWPPKTVARAYNPHSCVIRIFDPNTGFSVLLPGDVDAMSEW
LLARTEQSLQSHILLVPHHGSRTSSTVALINKVNPEVAIASLAKGGRWQLPSAQVVQRYLQHGTNWLDTGEDGQISVIFG
MQNYQVNSLRTSRSQPWYRQMLRNEVE

Nucleotide


Download         Length: 2244 bp        

>NTDB_id=242061 CCZ37_RS04770 WP_094499886.1 1016501..1018744(+) (comEC) [Vibrio qinghaiensis strain Q67]
ATGACTCTCTTATCTAATTACTGGACTCTAGCTTCGTTTTCGCTAACCGCCATTTCTGCTTCTTATTGGCCTTGGATGCC
AGATTGGAAATGGAGTATGCCTCTATTCGCCATTCTTATACTCTCTGTCGGTTATAGAAAACTGCGCTTTCTCTCAGGAG
TAACAATGGCTGTAGTCGTCATTGTGATCAGCGGGAACCTATTGCGTGAGCAGTCCAACACTCTGTTCCAGGCAGGTCCG
GATATTACCATAAATGGCCAGGTTAACAGCTTTTTTAGACAAATTAGTCATGGTTACGAAGGAACAATTTCGATTCGATC
AATCAATGGTCAAACCTTAAACATTTTTTTGCGACCCAAAGTACGATTAGTTGCGCCTTTGCTGTTGAAAAAGGGCGATG
TGGTTGAATTTACCATTACTGTAAAACCTATCTTTGGCCGGTTAAATGAGACGGGATTTGATGTAGAGTCGTACTCTTTG
AGCCAAGGTATTGTGGCTCGAGCAACCGTGAGTAATGGGCAGCCTTATCGTATTGAACCCTTGCCAGAGCGGCGAACACA
ATGGTATCAGAAAATCAAAACTTGGCTCGCGGATGATCCTAATCTTGGTATTTTAATGGCGCTCACATTTGGAGAACGCA
GTGATATTTCAGCAGCACAATGGCAAGCATTACGAGACAGTGGCCTGATTCATTTGGTTGCCATTTCTGGTCTGCACATT
GGCATCGCATTTGGTTTTGGTTATAGCCTTGGGCTCGTATTGATGCGTTTGCATCACCGTATGGGGTGGTCGCCATTCGT
CGTAGGAAGCTTGTGCGCACTAGGCTATGCTTGGTTGGCTGGCTTTACCTTGCCCACTCAACGCGCGCTCATCATGTGTT
TGCTTAATGTGGCGATGATTATCCTCAATGTCCGTATCAATACGTTGCAGCGACTACTTATGACGTTATCGGCGGTGCTG
CTCATCGATCCTTTTGCTGCGCTATCCAGTAGCTTTTGGATGTCCTTTATTGCTGTCTCAGTGGTGTTTTACCAACTTTC
GCTACCTCAAACTCAGCGATTTTTTTTATGGCGTTTATTGACGATGCAGATTGGGTTAGTGCTGTGTATGCTTCCGGTAA
CCGCTTATTTTTTCGGTGGGGTGAGTACCAGTGCGGCGCTCTATAACTTAGTGTTTATTCCTTGGTTCAGTTTTGTGGTG
GTTCCGGCGCTCTTTGTTGCTTTAGGGTGTACGGTTTTGACTTTTGATGGCGCACCTATGGTGTGGCATTGGGTGGCTAA
ATTACTCATTCCCGTTGATTGGTCAATACATTATTCAAGTCTCAGTTGGTTTCCTATTTCTGGTGCTGTTTTGGTTCTGT
TGAGCAGTGCTCTGTTGTTGTTTGTGTTTTCACCGTTGATCTCGCGTTTTGCGCTTGCTGTCTGTGGCTTAATTATGATG
GCTTCCTGGGTCACCATTCCAATCAAAACAGGTTGGCGGATTGATGTGTTAGACGTTGGGCATGGGTTGGCGGTGTTGAT
CGAAAAAGAAGGGCGCTACGTGCTTTATGATACTGGTGCAAGTTGGCAAGGTGGCGATCATATCCAAACTACGGTTGCCC
CTGTGTTAACCAAGCGAGGAGCGAAACAATTAGACGGGTTGATTTTAAGTCACTTGGATAACGATCATGCAGGCGGAAGA
TCGGAAGTTGAGCGTGTTTGGCAGCCCAAATGGAAACGTGCTAGTCAAACGATAGCGGGTTATCAACCTTGTATTGCTGG
CGAACAATGGCAGTGGCAACAGCTCTTTTTTGACGTTATCTGGCCACCTAAAACTGTAGCGCGGGCTTACAATCCTCACT
CTTGCGTGATCCGGATCTTTGACCCAAATACCGGTTTTTCAGTGCTTCTGCCCGGGGATGTGGATGCCATGAGTGAGTGG
TTACTTGCGCGAACAGAGCAATCCTTACAAAGTCATATCCTGCTTGTACCACACCACGGTAGCAGAACATCTTCTACTGT
CGCTTTGATTAATAAGGTCAATCCTGAGGTCGCTATTGCCTCGCTTGCCAAAGGTGGACGCTGGCAATTACCTTCCGCAC
AGGTTGTTCAACGCTACCTGCAACATGGAACAAACTGGTTGGATACGGGTGAAGACGGGCAAATTAGCGTTATTTTTGGC
ATGCAGAACTATCAAGTCAACAGCCTGCGTACCTCTCGCTCTCAGCCTTGGTATAGGCAGATGCTCCGTAACGAGGTAGA
ATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Vibrio cholerae strain A1552

53.324

100

0.537

  comEC Vibrio parahaemolyticus RIMD 2210633

43.857

100

0.444

  comEC Vibrio campbellii strain DS40M4

43.461

100

0.44


Multiple sequence alignment