Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   RLO20_RS08200 Genome accession   NZ_CP134538
Coordinates   1719206..1720384 (-) Length   392 a.a.
NCBI ID   WP_042357492.1    Uniprot ID   -
Organism   Streptococcus equi subsp. equi strain XJ5012     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1688403..1719185 1719206..1720384 flank 21


Gene organization within MGE regions


Location: 1688403..1720384
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RLO20_RS08015 (RLO20_08015) prx 1688403..1688588 (-) 186 WP_012679954.1 hypothetical protein Regulator
  RLO20_RS08020 (RLO20_08020) spel 1688710..1689498 (-) 789 WP_012679955.1 streptococcal pyrogenic exotoxin SpeL -
  RLO20_RS08025 (RLO20_08025) spek 1689765..1690544 (-) 780 WP_012679956.1 streptococcal pyrogenic exotoxin SpeK -
  RLO20_RS08030 (RLO20_08030) - 1690669..1691898 (-) 1230 WP_012679957.1 glucosaminidase domain-containing protein -
  RLO20_RS08035 (RLO20_08035) - 1692010..1692189 (-) 180 WP_012679339.1 holin -
  RLO20_RS08040 (RLO20_08040) - 1692192..1692482 (-) 291 WP_317608656.1 hypothetical protein -
  RLO20_RS08045 (RLO20_08045) - 1692495..1693109 (-) 615 WP_012679958.1 DUF1366 domain-containing protein -
  RLO20_RS08050 (RLO20_08050) - 1693112..1693543 (-) 432 WP_050316147.1 DUF1617 family protein -
  RLO20_RS08055 (RLO20_08055) - 1693552..1695456 (-) 1905 WP_012679960.1 gp58-like family protein -
  RLO20_RS08060 (RLO20_08060) - 1695467..1696081 (-) 615 WP_012679961.1 hypothetical protein -
  RLO20_RS08065 (RLO20_08065) - 1696083..1696790 (-) 708 WP_012679962.1 collagen-like protein -
  RLO20_RS08070 (RLO20_08070) - 1696790..1698847 (-) 2058 WP_317608657.1 phage tail spike protein -
  RLO20_RS08075 (RLO20_08075) - 1698844..1699623 (-) 780 WP_012679964.1 distal tail protein Dit -
  RLO20_RS08080 (RLO20_08080) - 1699655..1702909 (-) 3255 WP_317608658.1 tape measure protein -
  RLO20_RS08085 (RLO20_08085) - 1702926..1703255 (-) 330 WP_012679966.1 hypothetical protein -
  RLO20_RS08090 (RLO20_08090) - 1703297..1703656 (-) 360 WP_012679967.1 tail assembly chaperone -
  RLO20_RS08095 (RLO20_08095) - 1703718..1704243 (-) 526 Protein_1567 phage major tail protein, TP901-1 family -
  RLO20_RS08100 (RLO20_08100) - 1704319..1704708 (-) 390 WP_012679970.1 hypothetical protein -
  RLO20_RS08105 (RLO20_08105) - 1704705..1705070 (-) 366 WP_012679971.1 HK97-gp10 family putative phage morphogenesis protein -
  RLO20_RS08110 (RLO20_08110) - 1705051..1705359 (-) 309 WP_012679972.1 hypothetical protein -
  RLO20_RS08115 (RLO20_08115) - 1705356..1705709 (-) 354 WP_012679973.1 phage head-tail connector protein -
  RLO20_RS08120 (RLO20_08120) - 1705719..1705985 (-) 267 WP_012679974.1 HeH/LEM domain-containing protein -
  RLO20_RS08125 (RLO20_08125) - 1705996..1707045 (-) 1050 WP_012679975.1 major capsid protein -
  RLO20_RS08130 (RLO20_08130) - 1707048..1707428 (-) 381 WP_012679976.1 head decoration protein -
  RLO20_RS08135 (RLO20_08135) - 1707439..1708062 (-) 624 WP_042357153.1 DUF4355 domain-containing protein -
  RLO20_RS08140 (RLO20_08140) - 1708244..1708411 (-) 168 WP_012679978.1 hypothetical protein -
  RLO20_RS08145 (RLO20_08145) - 1708447..1708716 (-) 270 WP_012679979.1 hypothetical protein -
  RLO20_RS08150 (RLO20_08150) - 1708913..1709332 (-) 420 WP_012679981.1 HD domain-containing protein -
  RLO20_RS08155 (RLO20_08155) - 1709329..1709535 (-) 207 WP_003052398.1 hypothetical protein -
  RLO20_RS08160 (RLO20_08160) - 1709537..1711102 (-) 1566 WP_012679982.1 minor capsid protein -
  RLO20_RS08165 (RLO20_08165) - 1711095..1712597 (-) 1503 WP_012679983.1 phage portal protein -
  RLO20_RS08170 (RLO20_08170) - 1712609..1713856 (-) 1248 WP_012679984.1 PBSX family phage terminase large subunit -
  RLO20_RS08180 (RLO20_08180) - 1715448..1716236 (+) 789 WP_012679985.1 helix-turn-helix transcriptional regulator -
  RLO20_RS08185 (RLO20_08185) - 1716245..1717414 (+) 1170 WP_012679986.1 DUF4041 domain-containing protein -
  RLO20_RS08190 (RLO20_08190) - 1717417..1717620 (+) 204 WP_012679987.1 hypothetical protein -
  RLO20_RS08195 (RLO20_08195) - 1717752..1719185 (+) 1434 WP_012679988.1 recombinase family protein -
  RLO20_RS08200 (RLO20_08200) comFA/cflA 1719206..1720384 (-) 1179 WP_042357492.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 392 a.a.        Molecular weight: 44559.94 Da        Isoelectric Point: 9.7339

>NTDB_id=880867 RLO20_RS08200 WP_042357492.1 1719206..1720384(-) (comFA/cflA) [Streptococcus equi subsp. equi strain XJ5012]
MENIENYYGRLLPERQCPKAVSAWACSLQSMITKKGTLYCQRCSSLIEKAHQLPSGAYYCRACLVFGRNQSDRPLLYFPP
ASFPKGHYLRWQGQLTTYQAAISHQLTNHVKLKQDTLVHAVTGAGKTEMMYEAIAAVVNKGGWVCIASPRVDVCIELEKR
LSRDFSCQVCLMHAESEVYHRSPIIVATTHQLMTFYHAFDLLIIDEVDAFPFVNNRQLNHAAHQAAKADAVTVYLTATST
RDLERKVKQKELVKLTLARRFHGKPLVVPKYQRLFSLLEAINRGKLPRRFITLVKKQRATGYPLLIFFPIIELAEQCCQL
LHKYFPKETIAHASSQSSNRMAIIEQFRQGQITILISTTILERGVTFPTVDVFVLLANHRLYTSSSLIQIKG

Nucleotide


Download         Length: 1179 bp        

>NTDB_id=880867 RLO20_RS08200 WP_042357492.1 1719206..1720384(-) (comFA/cflA) [Streptococcus equi subsp. equi strain XJ5012]
ATGGAGAATATCGAAAATTACTATGGACGTCTTTTACCCGAAAGGCAATGCCCAAAGGCTGTTTCTGCCTGGGCTTGCAG
CTTACAAAGCATGATCACTAAAAAGGGAACGTTATACTGCCAACGCTGTAGCAGTTTAATTGAGAAGGCTCATCAGCTGC
CTAGCGGTGCTTACTACTGTAGAGCCTGTCTTGTTTTTGGTCGAAACCAATCTGATCGCCCCTTGCTCTATTTTCCACCG
GCTTCTTTTCCAAAGGGACATTATCTGAGGTGGCAAGGACAGCTCACGACATATCAGGCAGCTATCTCTCATCAGCTTAC
TAACCATGTTAAGCTCAAGCAAGACACCTTAGTTCATGCGGTTACTGGTGCTGGCAAAACAGAGATGATGTATGAAGCTA
TTGCAGCAGTTGTTAATAAGGGTGGCTGGGTCTGCATTGCTAGTCCACGAGTTGATGTTTGCATAGAGCTTGAAAAACGA
CTATCGCGGGACTTTTCCTGTCAAGTCTGCCTTATGCATGCTGAGTCAGAGGTTTATCATAGAAGCCCCATTATCGTTGC
CACAACACATCAATTGATGACCTTTTACCATGCTTTTGATCTGCTCATTATTGACGAAGTAGATGCCTTCCCCTTTGTCA
ATAATCGTCAATTAAACCATGCTGCTCATCAGGCTGCAAAAGCAGATGCAGTGACAGTATACCTAACAGCAACCTCTACA
AGAGATTTAGAGCGCAAGGTCAAGCAAAAAGAGCTTGTCAAATTGACGTTGGCAAGAAGATTCCATGGCAAGCCCTTAGT
TGTTCCAAAGTATCAAAGATTATTCTCCCTTTTAGAGGCTATCAATCGTGGGAAATTGCCTAGAAGGTTCATCACCCTAG
TCAAAAAACAAAGGGCAACCGGCTATCCTCTTTTAATCTTTTTTCCGATTATTGAGCTGGCTGAGCAATGCTGTCAGCTG
CTCCACAAGTATTTTCCTAAGGAAACTATTGCTCATGCTTCCAGTCAGTCATCAAATCGAATGGCTATCATTGAGCAATT
CAGACAAGGACAAATCACTATACTTATATCAACAACCATTTTGGAAAGAGGTGTGACCTTTCCAACCGTAGATGTCTTTG
TCTTATTAGCCAATCATCGCCTTTACACAAGCAGCAGTCTTATTCAAATCAAGGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis SK321

52.455

98.724

0.518

  comFA/cflA Streptococcus pneumoniae D39

52.455

98.724

0.518

  comFA/cflA Streptococcus pneumoniae R6

52.455

98.724

0.518

  comFA/cflA Streptococcus pneumoniae TIGR4

52.455

98.724

0.518

  comFA/cflA Streptococcus pneumoniae Rx1

52.455

98.724

0.518

  comFA/cflA Streptococcus mitis NCTC 12261

52.196

98.724

0.515

  comFA Lactococcus lactis subsp. cremoris KW2

46.893

90.306

0.423

  comFA Latilactobacillus sakei subsp. sakei 23K

38.482

94.133

0.362