Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   ACLV7H_RS09780 Genome accession   NZ_OZ217340
Coordinates   1919126..1920424 (-) Length   432 a.a.
NCBI ID   WP_045592629.1    Uniprot ID   A0A0F2CVH5
Organism   Streptococcus oralis isolate S. oralis A22     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1918467..1942665 1919126..1920424 within 0


Gene organization within MGE regions


Location: 1918467..1942665
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACLV7H_RS09775 (SORA22_19700) comFC/cflB 1918467..1919129 (-) 663 WP_045592630.1 ComF family protein Machinery gene
  ACLV7H_RS09780 (SORA22_19710) comFA/cflA 1919126..1920424 (-) 1299 WP_045592629.1 DEAD/DEAH box helicase Machinery gene
  ACLV7H_RS09785 (SORA22_19720) - 1920481..1921116 (+) 636 WP_045592628.1 YigZ family protein -
  ACLV7H_RS09790 (SORA22_19730) - 1921132..1921575 (+) 444 WP_045592627.1 PH domain-containing protein -
  ACLV7H_RS09795 (SORA22_19740) cysK 1921672..1922598 (+) 927 WP_411867045.1 cysteine synthase A -
  ACLV7H_RS09800 (SORA22_19750) tsf 1922703..1923743 (-) 1041 WP_000808056.1 translation elongation factor Ts -
  ACLV7H_RS09805 (SORA22_19760) rpsB 1923823..1924602 (-) 780 WP_411867046.1 30S ribosomal protein S2 -
  ACLV7H_RS09810 (SORA22_19770) pcsB 1924825..1926042 (-) 1218 WP_045592624.1 peptidoglycan hydrolase PcsB -
  ACLV7H_RS09815 (SORA22_19780) mreD 1926136..1926630 (-) 495 WP_045592623.1 rod shape-determining protein MreD -
  ACLV7H_RS09820 (SORA22_19790) mreC 1926633..1927448 (-) 816 WP_045592622.1 rod shape-determining protein MreC -
  ACLV7H_RS09825 (SORA22_19800) - 1927508..1928302 (-) 795 WP_038804767.1 energy-coupling factor transporter transmembrane component T family protein -
  ACLV7H_RS09830 (SORA22_19810) - 1928295..1929134 (-) 840 WP_045592621.1 energy-coupling factor transporter ATPase -
  ACLV7H_RS09835 (SORA22_19820) - 1929119..1929946 (-) 828 WP_045592620.1 energy-coupling factor ABC transporter ATP-binding protein -
  ACLV7H_RS09840 (SORA22_19830) pgsA 1929943..1930488 (-) 546 WP_045592619.1 CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase -
  ACLV7H_RS09845 (SORA22_19840) rodZ 1930499..1931320 (-) 822 WP_411867047.1 cytoskeleton protein RodZ -
  ACLV7H_RS09850 (SORA22_19850) yfmH 1931357..1932640 (-) 1284 WP_411867048.1 EF-P 5-aminopentanol modification-associated protein YfmH -
  ACLV7H_RS09855 (SORA22_19860) yfmF 1932637..1933887 (-) 1251 WP_411867049.1 EF-P 5-aminopentanol modification-associated protein YfmF -
  ACLV7H_RS09860 (SORA22_19870) yaaA 1934046..1934414 (+) 369 WP_411867050.1 S4 domain-containing protein YaaA -
  ACLV7H_RS09865 (SORA22_19880) recF 1934417..1935514 (+) 1098 WP_411867051.1 DNA replication/repair protein RecF -
  ACLV7H_RS09870 (SORA22_19890) guaB 1935567..1937045 (-) 1479 WP_411867052.1 IMP dehydrogenase -
  ACLV7H_RS09875 (SORA22_19900) trpS 1937208..1938233 (-) 1026 WP_411867053.1 tryptophan--tRNA ligase -
  ACLV7H_RS09880 (SORA22_19910) - 1938429..1940051 (+) 1623 WP_411867054.1 ATP-binding cassette domain-containing protein -
  ACLV7H_RS09885 (SORA22_19920) - 1940113..1942665 (+) 2553 WP_411867055.1 YfhO family protein -

Sequence


Protein


Download         Length: 432 a.a.        Molecular weight: 49730.58 Da        Isoelectric Point: 8.5187

>NTDB_id=1169992 ACLV7H_RS09780 WP_045592629.1 1919126..1920424(-) (comFA/cflA) [Streptococcus oralis isolate S. oralis A22]
MKVNPNYLGRLFTEKELTEEERQMAEKLPTMRKEKGKLFCQRCNSSILEEWHLPIGAYYCRECLLMKRVRSDQALYYFPQ
EDFPKQDVLKWRGQLTSFQEKVSEGLLQAVDRQEPTLVHAVTGAGKTEMIYQVVAKVINDGGAVCLASPRIDVCLELYKR
LQNDFSCEIALLHGESESYFRTPLVVATTHQLLKFYHAFDLLIVDEVDAFPYVDNSVLYYAVNQCVKEEGLRIFLTATST
DELDKKVRTGELKRLSLPRRFHGNPLIIPKPVWLSDFNRYIEKSQLSPKLKSYIKKQRRTDYPLLIFASEIKKGEKLKKL
LQEQFPNENIGFVSSVTENRLEQVQAFRDGELTILISTTILERGVTFPCVDVFVVEANHRLFTKSSLIQIGGRVGRSMDR
PTGELLFFHDGLNVSIKKAIKEIKQMNKEAGL

Nucleotide


Download         Length: 1299 bp        

>NTDB_id=1169992 ACLV7H_RS09780 WP_045592629.1 1919126..1920424(-) (comFA/cflA) [Streptococcus oralis isolate S. oralis A22]
ATGAAAGTAAATCCAAATTATCTCGGTCGCTTGTTTACTGAGAAAGAATTAACTGAAGAAGAACGTCAGATGGCTGAGAA
ACTTCCAACTATGAGAAAAGAGAAGGGGAAACTGTTTTGTCAACGTTGTAATAGTAGTATTCTAGAAGAATGGCATTTAC
CTATAGGCGCTTACTATTGTAGGGAGTGTTTATTGATGAAGAGAGTCAGGAGTGATCAAGCTTTATACTATTTTCCGCAG
GAGGATTTTCCTAAGCAAGACGTCCTCAAATGGCGTGGTCAGTTAACATCTTTTCAAGAAAAAGTGTCAGAGGGACTTCT
TCAAGCGGTAGACAGGCAAGAGCCAACCTTGGTTCACGCTGTAACAGGAGCTGGAAAGACAGAGATGATTTACCAGGTTG
TGGCTAAGGTAATCAATGACGGTGGTGCAGTTTGTTTGGCCAGTCCTCGAATAGATGTTTGCTTGGAGTTATATAAGCGA
CTGCAGAATGACTTTTCTTGTGAGATAGCACTACTTCATGGTGAGTCAGAATCCTATTTTCGAACACCACTAGTTGTTGC
AACGACTCATCAGCTGTTAAAATTTTATCATGCTTTTGACTTGCTTATAGTGGATGAAGTAGATGCCTTTCCTTATGTTG
ACAACTCTGTTCTTTACTATGCTGTAAACCAATGTGTAAAGGAGGAGGGGCTAAGGATATTTCTTACAGCGACTTCTACA
GATGAGTTAGATAAGAAGGTTCGCACAGGAGAATTAAAAAGGTTAAGCTTGCCGAGACGATTTCATGGAAATCCATTGAT
TATTCCAAAGCCAGTTTGGTTATCAGACTTTAATCGCTATATAGAAAAGAGTCAGTTGTCTCCAAAGTTAAAGTCCTACA
TTAAGAAGCAGAGAAGAACAGATTATCCTTTGCTAATCTTTGCATCAGAGATTAAGAAAGGCGAGAAACTAAAAAAACTC
TTGCAGGAACAGTTTCCAAATGAAAACATCGGCTTTGTGTCCTCTGTCACAGAAAATCGATTAGAGCAGGTACAAGCTTT
TCGAGATGGAGAGTTGACAATCCTTATTAGTACGACAATTTTGGAGCGTGGAGTCACCTTCCCTTGTGTGGATGTTTTCG
TTGTAGAAGCTAATCATCGTCTCTTTACCAAGTCTAGCTTGATTCAGATTGGAGGGCGAGTTGGGCGTAGTATGGATAGA
CCGACTGGTGAATTGCTCTTCTTTCATGATGGATTAAATGTTTCTATTAAAAAAGCAATCAAGGAAATTAAGCAGATGAA
CAAGGAGGCAGGCTTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0F2CVH5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis NCTC 12261

90.046

100

0.9

  comFA/cflA Streptococcus pneumoniae Rx1

89.815

100

0.898

  comFA/cflA Streptococcus pneumoniae D39

89.815

100

0.898

  comFA/cflA Streptococcus pneumoniae R6

89.815

100

0.898

  comFA/cflA Streptococcus pneumoniae TIGR4

89.583

100

0.896

  comFA/cflA Streptococcus mitis SK321

88.657

100

0.887

  comFA Lactococcus lactis subsp. cremoris KW2

52.13

92.361

0.481

  comFA Latilactobacillus sakei subsp. sakei 23K

37.9

100

0.384


Multiple sequence alignment