Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   SEQ_RS08610 Genome accession   NC_012471
Coordinates   1772844..1774022 (-) Length   392 a.a.
NCBI ID   WP_042357492.1    Uniprot ID   -
Organism   Streptococcus equi subsp. equi 4047     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1742042..1772823 1772844..1774022 flank 21


Gene organization within MGE regions


Location: 1742042..1774022
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SEQ_RS08425 (SEQ_1726) prx 1742042..1742227 (-) 186 WP_012679954.1 hypothetical protein Regulator
  SEQ_RS08430 (SEQ_1727) spel 1742349..1743137 (-) 789 WP_012679955.1 streptococcal pyrogenic exotoxin SpeL -
  SEQ_RS08435 (SEQ_1728) spek 1743404..1744183 (-) 780 WP_012679956.1 streptococcal pyrogenic exotoxin SpeK -
  SEQ_RS08440 (SEQ_1729) - 1744308..1745537 (-) 1230 WP_012679957.1 glucosaminidase domain-containing protein -
  SEQ_RS08445 (SEQ_1730) - 1745649..1745828 (-) 180 WP_012679339.1 holin -
  SEQ_RS08450 (SEQ_1731) - 1745831..1746121 (-) 291 WP_012679338.1 hypothetical protein -
  SEQ_RS08455 (SEQ_1732) - 1746134..1746748 (-) 615 WP_012679958.1 DUF1366 domain-containing protein -
  SEQ_RS08460 (SEQ_1733) - 1746751..1747182 (-) 432 WP_012679959.1 DUF1617 family protein -
  SEQ_RS08465 (SEQ_1734) - 1747191..1749095 (-) 1905 WP_012679960.1 gp58-like family protein -
  SEQ_RS08470 (SEQ_1735) - 1749106..1749720 (-) 615 WP_012679961.1 hypothetical protein -
  SEQ_RS08475 (SEQ_1736) - 1749722..1750429 (-) 708 WP_012679962.1 collagen-like protein -
  SEQ_RS08480 (SEQ_1737) - 1750429..1752486 (-) 2058 WP_012679963.1 phage tail spike protein -
  SEQ_RS08485 (SEQ_1738) - 1752483..1753262 (-) 780 WP_012679964.1 distal tail protein Dit -
  SEQ_RS08490 (SEQ_1739) - 1753294..1756548 (-) 3255 WP_012679965.1 tape measure protein -
  SEQ_RS08495 (SEQ_1740) - 1756565..1756861 (-) 297 WP_230197423.1 hypothetical protein -
  SEQ_RS08500 (SEQ_1741) - 1756936..1757295 (-) 360 WP_012679967.1 tail assembly chaperone -
  SEQ_RS08505 (SEQ_1743) - 1757357..1757882 (-) 526 Protein_1634 phage major tail protein, TP901-1 family -
  SEQ_RS08510 (SEQ_1744) - 1757958..1758347 (-) 390 WP_012679970.1 hypothetical protein -
  SEQ_RS08515 (SEQ_1745) - 1758344..1758709 (-) 366 WP_012679971.1 HK97-gp10 family putative phage morphogenesis protein -
  SEQ_RS08520 (SEQ_1746) - 1758690..1758998 (-) 309 WP_012679972.1 hypothetical protein -
  SEQ_RS08525 (SEQ_1747) - 1758995..1759348 (-) 354 WP_012679973.1 phage head-tail connector protein -
  SEQ_RS08530 (SEQ_1748) - 1759358..1759624 (-) 267 WP_012679974.1 HeH/LEM domain-containing protein -
  SEQ_RS08535 (SEQ_1749) - 1759635..1760684 (-) 1050 WP_012679975.1 major capsid protein -
  SEQ_RS08540 (SEQ_1750) - 1760687..1761067 (-) 381 WP_012679976.1 head decoration protein -
  SEQ_RS08545 (SEQ_1751) - 1761078..1761701 (-) 624 WP_042357153.1 DUF4355 domain-containing protein -
  SEQ_RS11620 (SEQ_1752) - 1761883..1762050 (-) 168 WP_012679978.1 hypothetical protein -
  SEQ_RS08550 (SEQ_1753) - 1762086..1762355 (-) 270 WP_012679979.1 hypothetical protein -
  SEQ_RS08555 (SEQ_1755) - 1762552..1762971 (-) 420 WP_012679981.1 HD domain-containing protein -
  SEQ_RS08560 (SEQ_1756) - 1762968..1763174 (-) 207 WP_003052398.1 hypothetical protein -
  SEQ_RS08565 (SEQ_1757) - 1763176..1764753 (-) 1578 WP_042357165.1 phage head morphogenesis protein -
  SEQ_RS08570 (SEQ_1758) - 1764734..1766236 (-) 1503 WP_012679983.1 phage portal protein -
  SEQ_RS08575 (SEQ_1759) - 1766248..1767495 (-) 1248 WP_012679984.1 PBSX family phage terminase large subunit -
  SEQ_RS08590 (SEQ_1762) - 1769086..1769874 (+) 789 WP_012679985.1 helix-turn-helix transcriptional regulator -
  SEQ_RS08595 (SEQ_1763) - 1769883..1771052 (+) 1170 WP_012679986.1 DUF4041 domain-containing protein -
  SEQ_RS08600 (SEQ_1764) - 1771055..1771258 (+) 204 WP_012679987.1 hypothetical protein -
  SEQ_RS08605 (SEQ_1765) - 1771390..1772823 (+) 1434 WP_012679988.1 recombinase family protein -
  SEQ_RS08610 (SEQ_1725) comFA/cflA 1772844..1774022 (-) 1179 WP_042357492.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 392 a.a.        Molecular weight: 44559.94 Da        Isoelectric Point: 9.7339

>NTDB_id=33536 SEQ_RS08610 WP_042357492.1 1772844..1774022(-) (comFA/cflA) [Streptococcus equi subsp. equi 4047]
MENIENYYGRLLPERQCPKAVSAWACSLQSMITKKGTLYCQRCSSLIEKAHQLPSGAYYCRACLVFGRNQSDRPLLYFPP
ASFPKGHYLRWQGQLTTYQAAISHQLTNHVKLKQDTLVHAVTGAGKTEMMYEAIAAVVNKGGWVCIASPRVDVCIELEKR
LSRDFSCQVCLMHAESEVYHRSPIIVATTHQLMTFYHAFDLLIIDEVDAFPFVNNRQLNHAAHQAAKADAVTVYLTATST
RDLERKVKQKELVKLTLARRFHGKPLVVPKYQRLFSLLEAINRGKLPRRFITLVKKQRATGYPLLIFFPIIELAEQCCQL
LHKYFPKETIAHASSQSSNRMAIIEQFRQGQITILISTTILERGVTFPTVDVFVLLANHRLYTSSSLIQIKG

Nucleotide


Download         Length: 1179 bp        

>NTDB_id=33536 SEQ_RS08610 WP_042357492.1 1772844..1774022(-) (comFA/cflA) [Streptococcus equi subsp. equi 4047]
ATGGAGAATATCGAAAATTACTATGGACGTCTTTTACCCGAAAGGCAATGCCCAAAGGCTGTTTCTGCCTGGGCTTGCAG
CTTACAAAGCATGATCACTAAAAAGGGAACGTTATACTGCCAACGCTGTAGCAGTTTAATTGAGAAGGCTCATCAGCTGC
CTAGCGGTGCTTACTACTGTAGAGCCTGTCTTGTTTTTGGTCGAAACCAATCTGATCGCCCCTTGCTCTATTTTCCACCG
GCTTCTTTTCCAAAGGGACATTATCTGAGGTGGCAAGGACAGCTCACGACATATCAGGCAGCTATCTCTCATCAGCTTAC
TAACCATGTTAAGCTCAAGCAAGACACCTTAGTTCATGCGGTTACTGGTGCTGGCAAAACAGAGATGATGTATGAAGCTA
TTGCAGCAGTTGTTAATAAGGGTGGCTGGGTCTGCATTGCTAGTCCACGAGTTGATGTTTGCATAGAGCTTGAAAAACGA
CTATCGCGGGACTTTTCCTGTCAAGTCTGCCTTATGCATGCTGAGTCAGAGGTTTATCATAGAAGCCCCATTATCGTTGC
CACAACACATCAATTGATGACCTTTTACCATGCTTTTGATCTGCTCATTATTGACGAAGTAGATGCCTTCCCCTTTGTCA
ATAATCGTCAATTAAACCATGCTGCTCATCAGGCTGCAAAAGCAGATGCAGTGACAGTATACCTAACAGCAACCTCTACA
AGAGATTTAGAGCGCAAGGTCAAGCAAAAAGAGCTTGTCAAATTGACGTTGGCAAGAAGATTCCATGGCAAGCCCTTAGT
TGTTCCAAAGTATCAAAGATTATTCTCCCTTTTAGAGGCTATCAATCGTGGGAAATTGCCTAGAAGGTTCATCACCCTAG
TCAAAAAACAAAGGGCAACCGGCTATCCTCTTTTAATCTTTTTTCCGATTATTGAGCTGGCTGAGCAATGCTGTCAGCTG
CTCCACAAGTATTTTCCTAAGGAAACTATTGCTCATGCTTCCAGTCAGTCATCAAATCGAATGGCTATCATTGAGCAATT
CAGACAAGGACAAATCACTATACTTATATCAACAACCATTTTGGAAAGAGGTGTGACCTTTCCAACCGTAGATGTCTTTG
TCTTATTAGCCAATCATCGCCTTTACACAAGCAGCAGTCTTATTCAAATCAAGGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis SK321

52.455

98.724

0.518

  comFA/cflA Streptococcus pneumoniae D39

52.455

98.724

0.518

  comFA/cflA Streptococcus pneumoniae R6

52.455

98.724

0.518

  comFA/cflA Streptococcus pneumoniae TIGR4

52.455

98.724

0.518

  comFA/cflA Streptococcus pneumoniae Rx1

52.455

98.724

0.518

  comFA/cflA Streptococcus mitis NCTC 12261

52.196

98.724

0.515

  comFA Lactococcus lactis subsp. cremoris KW2

46.893

90.306

0.423

  comFA Latilactobacillus sakei subsp. sakei 23K

38.482

94.133

0.362


Multiple sequence alignment