Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   A4H00_RS00555 Genome accession   NZ_CP015196
Coordinates   114448..116685 (-) Length   745 a.a.
NCBI ID   WP_067086023.1    Uniprot ID   -
Organism   Streptococcus marmotae strain HTS5     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 89982..126919 114448..116685 within 0


Gene organization within MGE regions


Location: 89982..126919
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A4H00_RS00395 - 89982..90431 (-) 450 WP_067085947.1 type II toxin-antitoxin system HicB family antitoxin -
  A4H00_RS00400 - 90469..90648 (-) 180 WP_067085951.1 type II toxin-antitoxin system HicA family toxin -
  A4H00_RS00405 - 90834..91247 (-) 414 WP_067085954.1 hypothetical protein -
  A4H00_RS00410 - 91228..91650 (-) 423 WP_067085956.1 hypothetical protein -
  A4H00_RS00415 - 91696..91974 (-) 279 WP_067085959.1 hypothetical protein -
  A4H00_RS00420 - 91964..92302 (-) 339 WP_067085961.1 replication protein -
  A4H00_RS00425 - 92335..92913 (-) 579 WP_157770960.1 hypothetical protein -
  A4H00_RS00430 - 93163..94419 (-) 1257 WP_067085967.1 ISL3 family transposase -
  A4H00_RS00435 - 94602..94853 (-) 252 WP_067085970.1 hypothetical protein -
  A4H00_RS00440 - 94850..95101 (-) 252 WP_067085973.1 hypothetical protein -
  A4H00_RS00445 - 95409..96833 (-) 1425 WP_237334203.1 virulence-associated E family protein -
  A4H00_RS00450 - 96826..97341 (-) 516 WP_067085974.1 hypothetical protein -
  A4H00_RS00455 - 97334..97615 (-) 282 WP_067085977.1 hypothetical protein -
  A4H00_RS00460 - 97625..97843 (-) 219 WP_067085980.1 hypothetical protein -
  A4H00_RS00470 - 98018..98257 (-) 240 WP_067085985.1 hypothetical protein -
  A4H00_RS11740 - 98513..98677 (-) 165 WP_157770961.1 hypothetical protein -
  A4H00_RS00475 - 98670..99134 (-) 465 WP_067085987.1 hypothetical protein -
  A4H00_RS00480 - 99194..99394 (-) 201 WP_067085989.1 helix-turn-helix transcriptional regulator -
  A4H00_RS00485 - 99551..100315 (+) 765 WP_067085991.1 XRE family transcriptional regulator -
  A4H00_RS00490 - 100358..101179 (+) 822 WP_067085994.1 EbhA -
  A4H00_RS00495 - 101222..101779 (+) 558 WP_067085997.1 GTP pyrophosphokinase -
  A4H00_RS00500 - 101941..102609 (+) 669 WP_067086000.1 Fic family protein -
  A4H00_RS00505 - 102782..103927 (+) 1146 WP_067091309.1 tyrosine-type recombinase/integrase -
  A4H00_RS00510 queA 104025..105053 (+) 1029 WP_067086002.1 tRNA preQ1(34) S-adenosylmethionine ribosyltransferase-isomerase QueA -
  A4H00_RS00515 - 105121..106239 (-) 1119 WP_067086005.1 aminotransferase -
  A4H00_RS00520 sodA 106343..106948 (-) 606 WP_067086007.1 superoxide dismutase SodA -
  A4H00_RS00525 holA 107024..108055 (-) 1032 WP_067086010.1 DNA polymerase III subunit delta -
  A4H00_RS00535 - 109084..110565 (-) 1482 WP_067086015.1 hypothetical protein -
  A4H00_RS00540 - 110549..111442 (-) 894 WP_067086017.1 ABC transporter ATP-binding protein -
  A4H00_RS12055 - 111809..112015 (+) 207 WP_206281834.1 hypothetical protein -
  A4H00_RS00550 - 112287..113810 (-) 1524 WP_067086021.1 class I adenylate-forming enzyme family protein -
  A4H00_RS00555 comEC/celB 114448..116685 (-) 2238 WP_067086023.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  A4H00_RS00560 comEA 116669..117307 (-) 639 WP_067086025.1 helix-hairpin-helix domain-containing protein Machinery gene
  A4H00_RS00565 - 117377..118123 (-) 747 WP_067086028.1 lysophospholipid acyltransferase family protein -
  A4H00_RS00570 - 118219..118968 (+) 750 WP_067091314.1 tRNA1(Val) (adenine(37)-N6)-methyltransferase -
  A4H00_RS00575 - 118910..119269 (+) 360 WP_067086031.1 GIY-YIG nuclease family protein -
  A4H00_RS00580 - 119436..119846 (-) 411 WP_067086033.1 GNAT family N-acetyltransferase -
  A4H00_RS00585 - 119939..120604 (-) 666 WP_067086036.1 ArsR/SmtB family transcription factor -
  A4H00_RS00590 - 120981..124115 (-) 3135 WP_067086039.1 GAG-binding domain-containing protein -
  A4H00_RS00600 - 124517..124834 (-) 318 WP_067086044.1 hypothetical protein -
  A4H00_RS00605 - 124976..126223 (-) 1248 WP_067086046.1 ABC transporter permease -
  A4H00_RS00610 - 126224..126919 (-) 696 WP_067086048.1 ABC transporter ATP-binding protein -

Sequence


Protein


Download         Length: 745 a.a.        Molecular weight: 85251.99 Da        Isoelectric Point: 8.9034

>NTDB_id=177408 A4H00_RS00555 WP_067086023.1 114448..116685(-) (comEC/celB) [Streptococcus marmotae strain HTS5]
MSQLIKRLPIAPIHLAVLLLALYFVMYRLSFLSVLLFLGLLVLLWFRQGRKVWYQVLPILGCFVLLFGLQRVKMMMDEGT
APTEVSQLLVKPDTIQVNGDSLTFRARSSGRLYTVFYQLKSEKEQAYFKHLSSLVDLEVEATVSEPEAQRNFNGFDYRDY
LRTQGIYRTVKISHITKIQKRFSWNPLDWLSVLRRKALVYIKEQFPNPMRHYMTGLLFGELDKEFDQMSDLYSSLGIIHL
FALSGMQVGFFVDKFRYLFLRLGLRKEIVDWLQVPFSFVYAGLTGFSVSVNRSLLQKMLANLGITKLDNMACTLVLSFLL
MPYFLLTAGGILSFAYAFLLTVFDFEELPHYKQVLVESLAISVGILPLLLYYFYSFQPLSILLTFLFSFVFDVVFLPGLS
IVFLLAPLIQLTQVNLFFVWMEACIQWVADLGFKPLIFGKPIATLLVVLLAVLLLTYDVYQNRKWRFVLISLAALLFFVV
KHPLENEITIVDIGQGDSIFLRDIRGRTVLIDVGGKVSFTAKESWQEKSAQANAERTLIPYLYSRGVSRIDTLVLTHTDT
DHVGDLVEVAQAVDIGCIYVSEGSLTVPDFIETLRGLRVPVHVVRVGGRIPIFDRFLEVLYPPTVGDGGNNDSVILYGNV
LQTRFLFTGDLEDGELDLVKTYPQLPVDVLKAGHHGSKGSSYPEFLDHISPKIALISAGKNNRYKHPHQETIDRFEERKV
QLFRTDEQGAIRFRGWKKWRIETVR

Nucleotide


Download         Length: 2238 bp        

>NTDB_id=177408 A4H00_RS00555 WP_067086023.1 114448..116685(-) (comEC/celB) [Streptococcus marmotae strain HTS5]
ATGTCACAGTTGATTAAGCGCCTACCCATTGCTCCCATTCATTTAGCTGTTTTACTGTTGGCTCTTTATTTTGTGATGTA
CCGCCTATCCTTTTTATCTGTGCTCCTGTTTTTAGGCTTGTTAGTTTTACTCTGGTTTCGACAAGGGCGAAAAGTCTGGT
ATCAGGTGCTTCCGATTTTGGGCTGTTTTGTTCTCTTGTTCGGTCTACAAAGAGTGAAAATGATGATGGATGAAGGAACA
GCTCCGACAGAAGTTAGTCAACTATTGGTCAAACCAGACACCATTCAAGTCAACGGCGATAGCCTGACCTTTCGAGCAAG
GTCATCAGGGCGCTTGTATACTGTTTTTTATCAATTAAAAAGTGAGAAGGAGCAGGCTTATTTTAAGCATTTGTCAAGCT
TGGTGGACCTAGAAGTGGAGGCAACTGTATCTGAACCAGAAGCACAGCGAAATTTTAATGGATTTGATTATCGAGATTAT
TTAAGGACACAAGGGATTTATCGGACGGTCAAGATTAGCCACATCACAAAAATACAAAAGCGTTTTTCTTGGAATCCCTT
GGATTGGTTATCGGTACTTCGGCGCAAAGCATTGGTGTATATCAAGGAGCAGTTCCCAAATCCGATGCGACATTACATGA
CTGGGCTGTTGTTTGGAGAGTTGGACAAGGAGTTTGACCAAATGAGTGACCTGTATTCTAGTCTAGGCATTATTCATTTA
TTTGCTTTATCAGGGATGCAAGTGGGATTCTTTGTCGATAAATTTCGCTATCTTTTCTTGCGATTGGGTCTGCGAAAGGA
GATAGTCGATTGGCTGCAAGTCCCCTTTTCTTTTGTGTATGCTGGCTTGACAGGCTTTTCGGTCTCAGTCAATCGCTCCC
TTTTGCAGAAAATGTTAGCCAATCTAGGCATTACGAAATTGGACAATATGGCTTGTACGCTTGTCCTATCTTTCCTGCTT
ATGCCGTATTTTTTATTAACAGCTGGAGGTATCCTGAGTTTTGCTTATGCCTTTCTGCTGACAGTATTTGATTTTGAGGA
GTTGCCGCATTATAAGCAGGTACTAGTGGAGAGTTTAGCGATTTCAGTTGGGATTTTACCCTTGTTGCTCTATTATTTTT
ACAGCTTTCAACCCTTGTCTATCCTTCTAACCTTTCTCTTTTCCTTTGTTTTTGATGTTGTTTTCTTACCAGGACTCAGC
ATTGTGTTTTTGCTAGCACCCTTGATACAGCTTACGCAGGTTAATCTTTTCTTTGTTTGGATGGAGGCCTGCATTCAGTG
GGTAGCAGATCTGGGGTTCAAACCATTGATTTTCGGGAAACCGATAGCTACGCTATTAGTCGTCTTGCTGGCCGTTTTGC
TTCTTACGTATGATGTCTATCAGAACCGTAAATGGCGTTTCGTTCTCATCAGCTTAGCAGCTCTGCTATTTTTTGTCGTC
AAACATCCTTTGGAAAATGAGATCACAATAGTGGATATTGGACAAGGCGATAGCATCTTTTTGCGGGATATTCGTGGTCG
AACAGTTTTGATTGATGTTGGAGGAAAGGTGAGTTTTACTGCTAAAGAAAGTTGGCAGGAGAAGAGTGCACAAGCAAATG
CAGAGCGCACCTTGATTCCCTACCTTTATAGCCGTGGCGTGAGTCGGATTGACACACTTGTGTTAACGCATACTGATACA
GACCACGTGGGAGATCTGGTAGAAGTTGCACAAGCTGTAGACATTGGGTGCATTTACGTGTCTGAAGGAAGCTTGACGGT
TCCTGATTTTATAGAGACCCTGAGAGGGCTACGGGTTCCTGTTCATGTTGTACGAGTAGGAGGTAGGATTCCGATTTTTG
ACCGTTTTTTAGAAGTGCTTTATCCCCCTACAGTAGGAGATGGGGGAAATAATGATTCAGTGATTTTGTATGGAAATGTA
TTGCAGACGCGTTTTCTTTTTACAGGGGATTTAGAAGATGGGGAGTTGGACCTGGTTAAAACCTATCCGCAGTTACCAGT
TGATGTATTGAAGGCAGGACACCATGGCTCCAAAGGCTCATCCTATCCCGAATTTCTGGATCACATTTCACCAAAGATAG
CTTTGATTTCAGCTGGTAAAAATAATCGTTACAAACATCCTCATCAGGAGACAATAGATCGATTTGAGGAAAGGAAGGTA
CAGCTATTTCGGACCGATGAGCAAGGCGCTATTCGTTTTAGGGGCTGGAAAAAATGGCGGATTGAAACGGTGAGGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis SK321

53.949

100

0.541

  comEC/celB Streptococcus mitis NCTC 12261

52.949

100

0.53

  comEC/celB Streptococcus pneumoniae TIGR4

52.61

100

0.528

  comEC/celB Streptococcus pneumoniae Rx1

51.807

100

0.519

  comEC/celB Streptococcus pneumoniae D39

51.807

100

0.519

  comEC/celB Streptococcus pneumoniae R6

51.807

100

0.519

  comEC Lactococcus lactis subsp. cremoris KW2

46.267

100

0.466


Multiple sequence alignment