Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   NBW44_RS07760 Genome accession   NZ_OW724079
Coordinates   1528224..1529522 (+) Length   432 a.a.
NCBI ID   WP_250307862.1    Uniprot ID   -
Organism   Streptococcus sp. Marseille-Q3533     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1489387..1529522 1528224..1529522 within 0


Gene organization within MGE regions


Location: 1489387..1529522
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NBW44_RS07545 - 1489387..1489581 (-) 195 WP_006150549.1 DUF951 family protein -
  NBW44_RS07550 dnaN 1489648..1490784 (-) 1137 WP_250307828.1 DNA polymerase III subunit beta -
  NBW44_RS07555 dnaA 1490943..1492301 (-) 1359 WP_250307993.1 chromosomal replication initiator protein DnaA -
  NBW44_RS07560 spo0J 1492514..1493272 (-) 759 WP_250307829.1 ParB/RepB/Spo0J family partition protein Regulator
  NBW44_RS07565 htrA 1493329..1494513 (-) 1185 WP_250307830.1 trypsin-like peptidase domain-containing protein Regulator
  NBW44_RS07570 rlmH 1494698..1495177 (+) 480 WP_250307831.1 23S rRNA (pseudouridine(1915)-N(3))-methyltransferase RlmH -
  NBW44_RS07580 comD 1495461..1496561 (+) 1101 WP_284346641.1 histidine kinase Regulator
  NBW44_RS07585 - 1496601..1497764 (-) 1164 WP_250307832.1 tyrosine-type recombinase/integrase -
  NBW44_RS07590 - 1497820..1498359 (-) 540 WP_250307833.1 helix-turn-helix transcriptional regulator -
  NBW44_RS07595 - 1498522..1498812 (+) 291 WP_250307834.1 hypothetical protein -
  NBW44_RS07600 - 1498963..1500330 (+) 1368 WP_250307835.1 virulence-associated E family protein -
  NBW44_RS07605 - 1501214..1501630 (-) 417 WP_250307836.1 hypothetical protein -
  NBW44_RS07610 - 1501639..1502394 (-) 756 WP_250307837.1 hypothetical protein -
  NBW44_RS07615 - 1502406..1502678 (-) 273 WP_250307838.1 hypothetical protein -
  NBW44_RS07620 - 1502696..1503157 (-) 462 WP_250307839.1 hypothetical protein -
  NBW44_RS07625 - 1503331..1503657 (+) 327 WP_250307840.1 helix-turn-helix transcriptional regulator -
  NBW44_RS07630 - 1503825..1504130 (+) 306 Protein_1469 GHKL domain-containing protein -
  NBW44_RS07635 comE 1504127..1504879 (+) 753 WP_250307841.1 competence system response regulator transcription factor ComE Regulator
  NBW44_RS07650 - 1505121..1505663 (-) 543 WP_247922302.1 TetR/AcrR family transcriptional regulator -
  NBW44_RS07655 - 1505792..1508446 (+) 2655 WP_250307842.1 YhgE/Pip domain-containing protein -
  NBW44_RS07660 - 1508640..1510262 (-) 1623 WP_250307843.1 ATP-binding cassette domain-containing protein -
  NBW44_RS07665 trpS 1510458..1511483 (+) 1026 WP_250307844.1 tryptophan--tRNA ligase -
  NBW44_RS07670 guaB 1511646..1513124 (+) 1479 WP_250307845.1 IMP dehydrogenase -
  NBW44_RS07675 recF 1513189..1514280 (-) 1092 WP_250307846.1 DNA replication/repair protein RecF -
  NBW44_RS07680 yaaA 1514283..1514642 (-) 360 WP_250307847.1 S4 domain-containing protein YaaA -
  NBW44_RS07685 - 1514805..1516055 (+) 1251 WP_250307848.1 pitrilysin family protein -
  NBW44_RS07690 - 1516052..1517335 (+) 1284 WP_250307849.1 pitrilysin family protein -
  NBW44_RS07695 rodZ 1517375..1518202 (+) 828 WP_250307850.1 cytoskeleton protein RodZ -
  NBW44_RS07700 pgsA 1518214..1518753 (+) 540 WP_075565198.1 CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase -
  NBW44_RS07705 - 1518756..1519583 (+) 828 WP_250307851.1 energy-coupling factor ABC transporter ATP-binding protein -
  NBW44_RS07710 - 1519568..1520407 (+) 840 WP_250307852.1 energy-coupling factor transporter ATPase -
  NBW44_RS07715 - 1520400..1521194 (+) 795 WP_250307853.1 energy-coupling factor transporter transmembrane component T -
  NBW44_RS07720 mreC 1521223..1522044 (+) 822 WP_250307854.1 rod shape-determining protein MreC -
  NBW44_RS07725 mreD 1522041..1522535 (+) 495 WP_250307855.1 rod shape-determining protein MreD -
  NBW44_RS07730 - 1522627..1523832 (+) 1206 WP_250307856.1 peptidoglycan hydrolase PcsB -
  NBW44_RS07735 rpsB 1524063..1524845 (+) 783 WP_250307857.1 30S ribosomal protein S2 -
  NBW44_RS07740 tsf 1524924..1525964 (+) 1041 WP_250307858.1 translation elongation factor Ts -
  NBW44_RS07745 cysK 1526053..1526979 (-) 927 WP_250307859.1 cysteine synthase A -
  NBW44_RS07750 - 1527076..1527519 (-) 444 WP_250307860.1 PH domain-containing protein -
  NBW44_RS07755 - 1527533..1528168 (-) 636 WP_250307861.1 YigZ family protein -
  NBW44_RS07760 comFA/cflA 1528224..1529522 (+) 1299 WP_250307862.1 DEAD/DEAH box helicase Machinery gene

Sequence


Protein


Download         Length: 432 a.a.        Molecular weight: 49743.72 Da        Isoelectric Point: 8.8281

>NTDB_id=1152621 NBW44_RS07760 WP_250307862.1 1528224..1529522(+) (comFA/cflA) [Streptococcus sp. Marseille-Q3533]
MKVNPNYLGRLFTEQELSLEERQLAKQLLSMRKEKEQLVCQRCGSRVEEEWHLPVGGFYCRECLIMKRVRNDEPLYYFPQ
EDFPRQDVLKWSGQLTPFQEQVSMGLLQAVDKKEPSLVHAVTGAGKTEMIYQVVAKVINRGGAVCLASPRIDVCLELYKR
LQSDFTCKISLLHGESEPYFRTPLVVATTHQLLKFYQAFDLLIVDEVDAFPYVDNPILYHAVNNCVKESGLKIFLTATST
DELDKKVKQGELKRLSLPRRFHGNPLVIPKPVWLSDFDKHLQKNRLVPKLRRYIEKQRETAYPLLIFASEIEKGERLKEI
LQRAFPNEKIGFVSSATEDRLQQVQAFRDRELTILISTTILERGVTFPCVDVFVVEANHKLFTKSSLIQIGGRVGRSMDR
PTGQLIFFHDGINRSIKKAIKEVKEMNQEAGL

Nucleotide


Download         Length: 1299 bp        

>NTDB_id=1152621 NBW44_RS07760 WP_250307862.1 1528224..1529522(+) (comFA/cflA) [Streptococcus sp. Marseille-Q3533]
ATGAAAGTTAATCCAAATTATCTTGGTCGTCTGTTTACCGAACAAGAATTGAGTCTCGAAGAGCGACAACTGGCTAAACA
ACTTCTTTCTATGAGAAAAGAGAAAGAACAACTAGTCTGTCAGCGTTGTGGTAGCAGGGTTGAGGAAGAATGGCATCTGC
CTGTTGGAGGATTTTATTGCCGAGAGTGTTTGATTATGAAGAGAGTTAGAAATGATGAGCCTCTTTATTATTTCCCTCAA
GAAGATTTTCCGAGACAAGATGTTTTAAAATGGTCAGGTCAATTAACGCCTTTTCAAGAACAGGTGTCAATGGGTTTGCT
TCAAGCAGTAGATAAGAAGGAACCTAGCTTAGTACATGCAGTTACGGGTGCGGGAAAGACAGAGATGATCTACCAAGTTG
TTGCTAAAGTAATTAATAGGGGTGGAGCAGTCTGTTTGGCAAGTCCAAGAATAGATGTCTGCTTGGAGTTATATAAGCGG
CTACAGAGTGATTTTACCTGTAAAATTTCTCTACTACATGGTGAATCAGAGCCCTACTTTCGGACTCCCTTAGTGGTTGC
TACAACACATCAGTTACTAAAATTCTATCAAGCTTTTGATTTATTAATAGTTGATGAAGTGGACGCTTTTCCTTATGTAG
ACAATCCTATACTGTATCATGCTGTCAATAACTGTGTAAAAGAGAGTGGTTTGAAAATTTTTTTGACGGCTACTTCTACA
GATGAGTTGGATAAGAAGGTCAAGCAAGGGGAATTAAAGCGTCTTAGTCTACCACGCCGTTTTCATGGCAATCCTTTAGT
TATTCCAAAGCCAGTTTGGTTATCTGACTTCGATAAACATTTACAAAAAAATCGTTTAGTACCCAAATTAAGGCGTTATA
TCGAGAAACAAAGAGAGACGGCTTATCCTTTACTGATTTTTGCATCAGAGATTGAAAAAGGAGAAAGGCTAAAGGAAATC
TTACAGCGTGCCTTTCCCAATGAGAAAATTGGTTTTGTTTCTTCTGCTACAGAGGATCGTTTGCAACAAGTACAAGCATT
TCGAGATAGAGAGCTGACTATTTTGATAAGTACAACCATCCTAGAACGCGGGGTTACTTTCCCGTGTGTGGATGTTTTTG
TAGTAGAAGCTAATCACAAACTCTTTACCAAGTCTAGTTTAATCCAAATTGGAGGACGAGTGGGGAGAAGTATGGATCGT
CCTACAGGTCAACTTATCTTCTTTCATGATGGGATAAATAGATCTATCAAGAAAGCTATAAAGGAAGTCAAAGAGATGAA
TCAGGAGGCAGGTCTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis NCTC 12261

84.028

100

0.84

  comFA/cflA Streptococcus pneumoniae Rx1

83.333

100

0.833

  comFA/cflA Streptococcus pneumoniae D39

83.333

100

0.833

  comFA/cflA Streptococcus pneumoniae R6

83.333

100

0.833

  comFA/cflA Streptococcus pneumoniae TIGR4

83.102

100

0.831

  comFA/cflA Streptococcus mitis SK321

82.639

100

0.826

  comFA Lactococcus lactis subsp. cremoris KW2

52.538

91.204

0.479

  comFA Latilactobacillus sakei subsp. sakei 23K

38.337

100

0.384


Multiple sequence alignment