Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   NQZ98_RS02880 Genome accession   NZ_CP102135
Coordinates   538897..540189 (+) Length   430 a.a.
NCBI ID   WP_171997439.1    Uniprot ID   A0A9Q5BU37
Organism   Streptococcus suis strain M106471_S40     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 500330..538840 538897..540189 flank 57


Gene organization within MGE regions


Location: 500330..540189
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NQZ98_RS02585 (NQZ98_02585) - 500330..501487 (-) 1158 WP_171997401.1 site-specific integrase -
  NQZ98_RS02590 (NQZ98_02590) - 501670..503037 (-) 1368 WP_257116660.1 DUF4041 domain-containing protein -
  NQZ98_RS02595 (NQZ98_02595) - 503049..503438 (-) 390 WP_171997402.1 ImmA/IrrE family metallo-endopeptidase -
  NQZ98_RS02600 (NQZ98_02600) - 503445..503798 (-) 354 WP_171997403.1 helix-turn-helix domain-containing protein -
  NQZ98_RS02605 (NQZ98_02605) - 504086..504301 (+) 216 WP_171997404.1 helix-turn-helix transcriptional regulator -
  NQZ98_RS02610 (NQZ98_02610) - 504298..505017 (+) 720 WP_171997405.1 phage antirepressor KilAC domain-containing protein -
  NQZ98_RS02615 (NQZ98_02615) - 505200..505409 (-) 210 WP_074403834.1 hypothetical protein -
  NQZ98_RS02620 (NQZ98_02620) - 505649..505903 (+) 255 WP_002936992.1 hypothetical protein -
  NQZ98_RS02625 (NQZ98_02625) - 505944..506192 (+) 249 WP_171997406.1 hypothetical protein -
  NQZ98_RS02630 (NQZ98_02630) - 506173..506376 (+) 204 WP_171997407.1 hypothetical protein -
  NQZ98_RS02635 (NQZ98_02635) - 506547..506912 (+) 366 WP_171997408.1 transcriptional regulator -
  NQZ98_RS02640 (NQZ98_02640) - 507223..507552 (+) 330 WP_171997409.1 hypothetical protein -
  NQZ98_RS02645 (NQZ98_02645) - 507559..507699 (+) 141 WP_171997410.1 hypothetical protein -
  NQZ98_RS02650 (NQZ98_02650) - 507742..508191 (+) 450 WP_105143501.1 hypothetical protein -
  NQZ98_RS02655 (NQZ98_02655) - 508163..508321 (+) 159 WP_153309136.1 hypothetical protein -
  NQZ98_RS02660 (NQZ98_02660) - 508321..508500 (+) 180 WP_024394211.1 hypothetical protein -
  NQZ98_RS02665 (NQZ98_02665) - 508497..509369 (+) 873 WP_257116661.1 recombinase RecT -
  NQZ98_RS02670 (NQZ98_02670) - 509369..510583 (+) 1215 WP_257116662.1 DUF1351 domain-containing protein -
  NQZ98_RS02675 (NQZ98_02675) - 510576..511412 (+) 837 WP_257116663.1 PD-(D/E)XK nuclease-like domain-containing protein -
  NQZ98_RS02680 (NQZ98_02680) - 511561..512019 (+) 459 WP_257116664.1 PcfK-like family protein -
  NQZ98_RS02685 (NQZ98_02685) - 512016..513398 (+) 1383 WP_257116665.1 PcfJ domain-containing protein -
  NQZ98_RS02690 (NQZ98_02690) - 513408..513740 (+) 333 WP_024415138.1 hypothetical protein -
  NQZ98_RS02695 (NQZ98_02695) - 513737..514255 (+) 519 WP_171997415.1 hypothetical protein -
  NQZ98_RS02700 (NQZ98_02700) - 514265..514579 (+) 315 WP_171997416.1 DUF1372 family protein -
  NQZ98_RS02705 (NQZ98_02705) - 514581..514901 (+) 321 WP_171997417.1 hypothetical protein -
  NQZ98_RS02710 (NQZ98_02710) - 514894..515442 (+) 549 WP_171997418.1 DUF1642 domain-containing protein -
  NQZ98_RS02715 (NQZ98_02715) - 515442..515624 (+) 183 WP_171997419.1 hypothetical protein -
  NQZ98_RS02720 (NQZ98_02720) - 515739..516020 (+) 282 WP_257116666.1 hypothetical protein -
  NQZ98_RS11325 - 515948..516208 (+) 261 WP_371318353.1 YopX family protein -
  NQZ98_RS02725 (NQZ98_02725) - 516208..516384 (+) 177 WP_171997420.1 hypothetical protein -
  NQZ98_RS02730 (NQZ98_02730) - 516385..516654 (+) 270 WP_171997421.1 hypothetical protein -
  NQZ98_RS02735 (NQZ98_02735) - 516651..516917 (+) 267 WP_248026188.1 hypothetical protein -
  NQZ98_RS02740 (NQZ98_02740) - 516991..517410 (+) 420 WP_171997422.1 DUF1492 domain-containing protein -
  NQZ98_RS02745 (NQZ98_02745) - 517727..518170 (+) 444 WP_371318357.1 terminase small subunit -
  NQZ98_RS02750 (NQZ98_02750) - 518173..519462 (+) 1290 WP_257116667.1 PBSX family phage terminase large subunit -
  NQZ98_RS02755 (NQZ98_02755) - 519472..520971 (+) 1500 WP_257116668.1 phage portal protein -
  NQZ98_RS02760 (NQZ98_02760) - 520971..522581 (+) 1611 WP_171997423.1 phage minor capsid protein -
  NQZ98_RS02765 (NQZ98_02765) - 522547..522768 (+) 222 WP_161949469.1 CPCC family cysteine-rich protein -
  NQZ98_RS02770 (NQZ98_02770) - 522818..522952 (+) 135 WP_257116669.1 hypothetical protein -
  NQZ98_RS02775 (NQZ98_02775) - 522945..523220 (+) 276 WP_257116670.1 hypothetical protein -
  NQZ98_RS02780 (NQZ98_02780) - 523232..523522 (+) 291 WP_002936883.1 hypothetical protein -
  NQZ98_RS02785 (NQZ98_02785) - 523662..523898 (+) 237 WP_171997424.1 hypothetical protein -
  NQZ98_RS02790 (NQZ98_02790) - 523883..524473 (+) 591 WP_171997425.1 hypothetical protein -
  NQZ98_RS02795 (NQZ98_02795) - 524488..525360 (+) 873 WP_171997426.1 hypothetical protein -
  NQZ98_RS02800 (NQZ98_02800) - 525374..525625 (+) 252 WP_105182381.1 hypothetical protein -
  NQZ98_RS02805 (NQZ98_02805) - 525628..526014 (+) 387 WP_171997427.1 hypothetical protein -
  NQZ98_RS02810 (NQZ98_02810) - 526008..526352 (+) 345 WP_171997428.1 putative minor capsid protein -
  NQZ98_RS02815 (NQZ98_02815) - 526349..526702 (+) 354 WP_171997429.1 minor capsid protein -
  NQZ98_RS02820 (NQZ98_02820) - 526705..527073 (+) 369 WP_171997430.1 minor capsid protein -
  NQZ98_RS02825 (NQZ98_02825) - 527077..527577 (+) 501 WP_171997431.1 phage tail tube protein -
  NQZ98_RS02830 (NQZ98_02830) - 527593..528003 (+) 411 WP_171997432.1 hypothetical protein -
  NQZ98_RS02835 (NQZ98_02835) - 528011..528622 (+) 612 WP_171997433.1 Gp15 family bacteriophage protein -
  NQZ98_RS02840 (NQZ98_02840) - 528623..531193 (+) 2571 WP_171997434.1 phage tail tape measure protein -
  NQZ98_RS02845 (NQZ98_02845) - 531196..531957 (+) 762 WP_171997435.1 hypothetical protein -
  NQZ98_RS02850 (NQZ98_02850) - 531979..535872 (+) 3894 WP_171997436.1 phage tail spike protein -
  NQZ98_RS02855 (NQZ98_02855) - 535881..536093 (+) 213 WP_024415112.1 hypothetical protein -
  NQZ98_RS02860 (NQZ98_02860) - 536096..536440 (+) 345 WP_171997437.1 hypothetical protein -
  NQZ98_RS02865 (NQZ98_02865) - 536486..536977 (+) 492 WP_202859617.1 phage holin family protein -
  NQZ98_RS02870 (NQZ98_02870) - 537086..537922 (+) 837 WP_171997438.1 N-acetylmuramoyl-L-alanine amidase -
  NQZ98_RS02875 (NQZ98_02875) - 538208..538840 (-) 633 WP_024389592.1 YigZ family protein -
  NQZ98_RS02880 (NQZ98_02880) comFA/cflA 538897..540189 (+) 1293 WP_171997439.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 430 a.a.        Molecular weight: 48942.59 Da        Isoelectric Point: 9.3820

>NTDB_id=714570 NQZ98_RS02880 WP_171997439.1 538897..540189(+) (comFA/cflA) [Streptococcus suis strain M106471_S40]
MKELENYYGRLFTKYQLTAKEREIAEKVPSITKKNNCFRCGTTFKEENKLPNDAYYCRACLLLGRVRSDEKLYHFPQKDF
PITKCLKWKGQLTDWQQRISDGLVANVENNRATLVHAVTGAGKTEMIYHTLASVIDKGGAVCLASPRIDVCIELYKRLQN
DFSVPISLLHGESEPYFRTPLVVATTHQLLKFYQAFDLVLIDEVDAFPYADNPMLYRAADNAVKEAGVQIFLTATSTDEL
DKKVRTGKLSRLSLPRRFHGNPLVVPQKVWFSKFDDTLKKNRLVPKLKKAIEEQRKSGFPLLIFVPEISKGQEFTKIMKK
TFPEETIGFVSSQTENRLEIVEGFRKREITVLISTTILERGVTFPCVDVFVVQANHYLYTASSLVQIAGRVGRSMERPTG
LLQFYHEGSTGAIEKAIAEIKQMNKEAGYV

Nucleotide


Download         Length: 1293 bp        

>NTDB_id=714570 NQZ98_RS02880 WP_171997439.1 538897..540189(+) (comFA/cflA) [Streptococcus suis strain M106471_S40]
ATGAAAGAATTAGAAAATTATTATGGAAGATTATTTACCAAATACCAATTGACAGCAAAAGAAAGAGAAATAGCAGAAAA
AGTGCCAAGTATTACAAAAAAGAATAACTGCTTTCGCTGTGGAACAACCTTTAAAGAAGAAAACAAATTGCCAAACGATG
CTTATTACTGTCGAGCTTGCTTGCTTCTAGGCAGAGTACGGTCAGACGAAAAACTCTATCATTTTCCTCAAAAAGATTTT
CCAATCACTAAGTGTTTAAAGTGGAAAGGTCAATTAACTGATTGGCAACAAAGAATTTCAGATGGACTAGTTGCAAACGT
GGAAAATAATCGTGCGACATTGGTTCATGCAGTAACAGGAGCAGGTAAGACAGAAATGATCTACCACACCCTTGCCTCAG
TGATTGATAAAGGCGGAGCGGTTTGCCTAGCCAGTCCTCGAATTGATGTTTGTATCGAACTCTATAAACGTCTGCAAAAT
GACTTTTCAGTTCCAATTAGTTTACTACATGGAGAGTCTGAACCCTATTTCCGAACCCCATTAGTTGTAGCAACCACACA
TCAGTTATTAAAATTTTATCAGGCCTTTGATTTGGTTTTGATTGATGAAGTAGACGCCTTTCCCTATGCAGATAATCCCA
TGCTCTATCGAGCAGCAGACAATGCGGTCAAGGAAGCCGGTGTTCAAATTTTTCTGACGGCGACTTCAACAGATGAATTG
GATAAAAAAGTCAGAACGGGCAAATTAAGCCGTCTTAGTTTGCCAAGGCGCTTTCATGGCAACCCACTTGTTGTCCCGCA
AAAAGTCTGGTTTAGTAAATTCGATGATACCCTAAAGAAAAATAGACTAGTTCCAAAGTTGAAAAAAGCGATTGAAGAAC
AGAGAAAGTCGGGCTTCCCCTTACTCATTTTTGTCCCAGAAATCTCCAAAGGTCAAGAATTTACCAAGATAATGAAAAAA
ACATTCCCAGAAGAAACAATTGGCTTTGTATCCAGTCAAACAGAAAATCGCCTTGAAATAGTTGAAGGGTTTCGCAAGAG
AGAAATCACAGTCTTAATCTCGACGACTATTCTTGAACGTGGGGTGACCTTCCCATGTGTAGACGTCTTTGTTGTTCAAG
CCAATCATTACCTCTACACAGCGTCAAGTCTTGTTCAGATTGCAGGCCGGGTCGGAAGGAGTATGGAACGTCCGACTGGT
TTACTCCAGTTTTATCATGAGGGAAGTACAGGCGCCATTGAAAAGGCAATCGCTGAAATTAAGCAGATGAACAAGGAGGC
TGGTTATGTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis NCTC 12261

67.285

100

0.674

  comFA/cflA Streptococcus pneumoniae Rx1

66.589

100

0.667

  comFA/cflA Streptococcus pneumoniae D39

66.589

100

0.667

  comFA/cflA Streptococcus pneumoniae R6

66.589

100

0.667

  comFA/cflA Streptococcus pneumoniae TIGR4

66.589

100

0.667

  comFA/cflA Streptococcus mitis SK321

66.125

100

0.663

  comFA Lactococcus lactis subsp. cremoris KW2

54.156

92.326

0.5

  comFA Latilactobacillus sakei subsp. sakei 23K

38.051

100

0.381

  comFA Bacillus subtilis subsp. subtilis str. 168

37.349

96.512

0.36