Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   GU334_RS01900 Genome accession   NZ_CP047628
Coordinates   390884..392158 (-) Length   424 a.a.
NCBI ID   WP_138492267.1    Uniprot ID   A0A5R9CEM0
Organism   Lactococcus raffinolactis strain Lr_19_14     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 385884..397158
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GU334_RS01865 (GU334_01865) obgE 386287..387606 (+) 1320 WP_096039988.1 GTPase ObgE -
  GU334_RS01870 - 387693..387869 (+) 177 WP_003137484.1 hypothetical protein -
  GU334_RS01875 (GU334_01870) - 387866..388852 (+) 987 WP_167841048.1 PhoH family protein -
  GU334_RS01880 (GU334_01875) - 388845..389303 (+) 459 WP_138492264.1 hypothetical protein -
  GU334_RS01885 (GU334_01880) ybeY 389361..389852 (+) 492 WP_061773786.1 rRNA maturation RNase YbeY -
  GU334_RS01890 (GU334_01885) - 389836..390237 (+) 402 WP_061773787.1 diacylglycerol kinase family protein -
  GU334_RS01895 (GU334_01890) - 390240..390887 (-) 648 WP_167841049.1 ComF family protein -
  GU334_RS01900 (GU334_01895) comFA/cflA 390884..392158 (-) 1275 WP_138492267.1 DEAD/DEAH box helicase Machinery gene
  GU334_RS01905 (GU334_01900) - 392202..392825 (+) 624 WP_096039992.1 YigZ family protein -
  GU334_RS01915 (GU334_01910) - 393650..394513 (+) 864 WP_138492268.1 S1 RNA-binding domain-containing protein -
  GU334_RS01920 (GU334_01915) msrA 394510..395031 (+) 522 WP_167841050.1 peptide-methionine (S)-S-oxide reductase MsrA -
  GU334_RS01925 (GU334_01920) - 395028..395237 (+) 210 WP_096039994.1 YozE family protein -
  GU334_RS01930 (GU334_01925) rplK 395479..395904 (+) 426 WP_003139488.1 50S ribosomal protein L11 -
  GU334_RS01935 (GU334_01930) rplA 396077..396763 (+) 687 WP_061773794.1 50S ribosomal protein L1 -

Sequence


Protein


Download         Length: 424 a.a.        Molecular weight: 47611.65 Da        Isoelectric Point: 9.0014

>NTDB_id=414681 GU334_RS01900 WP_138492267.1 390884..392158(-) (comFA/cflA) [Lactococcus raffinolactis strain Lr_19_14]
MDNLYGRLLTRQELGDDFENLPKDTETFPGMSITPKSVTCHRCGTTSSLQAVKLEIPAYFCPECLHLGRVRSDELLYHLP
QQDFPPQDSLLWSGQLTPYQAEISKELMAAVDQKTQILVHAVTGAGKTEMIYAAVARVISSGGAVGIATPRTDVARELHA
RLSRDFSIPISLLHAESEPYFPTPLVISTTHQLLRFRHAFDLLIIDEVDAFPFADNDALYFAAEQSRKPTATLIYLTATS
TDKLDKLVKTGQLKQVSLSRRFHGNPLVVPKVIFSGSEKRIYRHIKKQRETGFPLLIFAPVIAFGQSFTERLRRLFPDEK
IGFVASTSEERAEEIDRFRQGQLTILVSTTILERGVTFPKVDVFVVNSHHRLFTKSSLIQIAGRAGRSPDRSTGLVYFFH
TGLTRDMTRAISEIRQMNRLGGFT

Nucleotide


Download         Length: 1275 bp        

>NTDB_id=414681 GU334_RS01900 WP_138492267.1 390884..392158(-) (comFA/cflA) [Lactococcus raffinolactis strain Lr_19_14]
ATGGATAACTTATACGGTCGCCTATTAACGCGACAAGAACTGGGGGATGACTTCGAAAATCTCCCAAAAGACACCGAGAC
CTTCCCTGGGATGTCAATCACACCAAAATCGGTGACTTGTCATCGATGCGGGACAACATCTTCTCTTCAAGCAGTTAAGT
TAGAGATTCCTGCTTACTTTTGTCCAGAGTGCCTTCACCTCGGACGGGTTCGGTCTGATGAATTACTCTACCACTTACCT
CAGCAAGATTTCCCACCTCAAGATTCCCTCCTTTGGAGCGGCCAACTCACGCCCTATCAAGCTGAAATTTCAAAGGAACT
GATGGCTGCAGTTGATCAGAAAACTCAGATTTTAGTCCATGCTGTTACAGGGGCTGGTAAAACTGAGATGATTTATGCTG
CCGTGGCTAGAGTCATCTCAAGTGGTGGGGCCGTTGGGATTGCGACGCCCAGGACCGACGTTGCCCGCGAACTACATGCC
AGATTATCTCGCGACTTTTCTATCCCAATCTCTCTCCTACACGCTGAATCAGAGCCTTACTTTCCGACACCACTCGTCAT
TTCAACCACCCACCAACTACTGCGCTTTCGCCATGCCTTTGATCTATTGATTATCGACGAAGTGGACGCCTTTCCCTTCG
CTGATAATGATGCCCTTTACTTTGCGGCTGAGCAGTCCCGAAAACCTACCGCAACGTTGATTTATCTAACCGCGACATCT
ACTGACAAACTCGATAAGCTGGTCAAAACTGGCCAGTTAAAACAGGTATCTTTGTCCCGTCGCTTTCATGGCAACCCACT
CGTTGTCCCAAAAGTCATCTTCAGTGGCAGCGAAAAACGCATCTACCGTCACATCAAAAAACAGCGGGAGACAGGTTTTC
CACTCCTCATTTTCGCGCCCGTCATTGCCTTTGGTCAATCGTTCACAGAGCGCTTGCGACGTCTGTTTCCTGACGAAAAA
ATTGGCTTTGTAGCCTCTACTAGTGAGGAGCGTGCTGAGGAGATTGACCGTTTTCGACAAGGGCAACTCACGATTTTAGT
TTCGACGACCATTTTAGAGCGAGGGGTGACTTTTCCTAAGGTTGATGTTTTTGTGGTCAATAGTCACCACCGTCTCTTTA
CCAAATCAAGTTTGATTCAGATTGCGGGAAGGGCTGGCCGAAGCCCTGATCGGTCGACTGGTCTGGTATACTTTTTTCAT
ACTGGCCTAACCAGAGACATGACCCGAGCTATTTCAGAAATTCGCCAGATGAATCGACTGGGAGGTTTTACATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A5R9CEM0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis SK321

55.684

100

0.566

  comFA/cflA Streptococcus pneumoniae Rx1

55.607

100

0.561

  comFA/cflA Streptococcus pneumoniae D39

55.607

100

0.561

  comFA/cflA Streptococcus pneumoniae R6

55.607

100

0.561

  comFA/cflA Streptococcus pneumoniae TIGR4

55.374

100

0.559

  comFA/cflA Streptococcus mitis NCTC 12261

54.988

100

0.559

  comFA Lactococcus lactis subsp. cremoris KW2

56.888

92.453

0.526

  comFA Latilactobacillus sakei subsp. sakei 23K

36.552

100

0.375

  comFA Bacillus subtilis subsp. subtilis str. 168

37.019

98.113

0.363


Multiple sequence alignment