Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   DK43_RS07885 Genome accession   NZ_CP007573
Coordinates   1638802..1640103 (-) Length   433 a.a.
NCBI ID   WP_003028383.1    Uniprot ID   -
Organism   Streptococcus anginosus strain SA1     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1633802..1645103
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DK43_RS07855 (DK43_08045) - 1634552..1635115 (-) 564 WP_003043206.1 YutD family protein -
  DK43_RS07860 (DK43_08050) sepM 1635255..1636310 (-) 1056 WP_003028378.1 SepM family pheromone-processing serine protease Regulator
  DK43_RS07865 (DK43_08055) coaD 1636291..1636788 (-) 498 WP_003028379.1 pantetheine-phosphate adenylyltransferase -
  DK43_RS07870 (DK43_08060) rsmD 1636778..1637326 (-) 549 WP_176601969.1 16S rRNA (guanine(966)-N(2))-methyltransferase RsmD -
  DK43_RS07875 (DK43_08065) raiA 1637517..1638059 (-) 543 WP_003028381.1 ribosome-associated translation inhibitor RaiA -
  DK43_RS07880 (DK43_08070) - 1638140..1638805 (-) 666 WP_003028382.1 ComF family protein -
  DK43_RS07885 (DK43_08075) comFA/cflA 1638802..1640103 (-) 1302 WP_003028383.1 DEAD/DEAH box helicase Machinery gene
  DK43_RS07890 (DK43_08080) - 1640160..1640795 (+) 636 WP_003028384.1 YigZ family protein -
  DK43_RS07895 (DK43_08085) cysK 1640895..1641824 (+) 930 WP_003028385.1 cysteine synthase A -
  DK43_RS07900 (DK43_08090) - 1642230..1643510 (-) 1281 WP_003028386.1 hypothetical protein -
  DK43_RS07905 (DK43_08095) - 1643588..1644148 (-) 561 WP_022525299.1 hypothetical protein -
  DK43_RS07910 (DK43_08100) - 1644224..1644613 (-) 390 WP_003028388.1 hypothetical protein -

Sequence


Protein


Download         Length: 433 a.a.        Molecular weight: 48876.73 Da        Isoelectric Point: 8.3812

>NTDB_id=121225 DK43_RS07885 WP_003028383.1 1638802..1640103(-) (comFA/cflA) [Streptococcus anginosus strain SA1]
MTELQDCLGRIFTKSQLSPELQLQAQSLAGMVEEKGKLSCNRCGQAIDKEKQQLPIGAYYCRSCLILGRVRSDEELYYFP
QEEFPKANVLKWQGKLTEFQAKVSQGLVEAVTKRKDSLVHAVTGAGKTEMIYQVVAQVINQGGAVCLASPRIDVCLELYR
RLKVDFTCDISLLHGESEAYSRSPLVIATTHQLLKFYQAFDLLIVDEVDAFPYVDNPMLYHAVHQAVKVEGTKIFLTATS
TDELDKKVAKGELTRLSLPRRFHGNPLIVPQKIWLADFQKYLGQKKLVPKLRQFIQKQRKTSFPLLIFASEIKKGQELAE
ILQSTFPNEKVGFVASTTENRLEIVEKFRQKEITILVTTTILERGVTFPCVDVFVVEANHRLFSRSALVQIAGRVGRSMD
RPTGELIFFHDGATMAIEKAIKEIQEMNQEAGL

Nucleotide


Download         Length: 1302 bp        

>NTDB_id=121225 DK43_RS07885 WP_003028383.1 1638802..1640103(-) (comFA/cflA) [Streptococcus anginosus strain SA1]
ATGACGGAATTACAAGATTGCTTAGGTCGTATTTTTACAAAAAGTCAATTGTCACCAGAATTGCAATTGCAAGCACAAAG
CTTAGCTGGAATGGTAGAAGAGAAAGGAAAGTTAAGCTGTAATCGCTGTGGACAAGCCATTGACAAAGAAAAACAGCAAC
TACCAATTGGTGCTTATTATTGTAGGTCTTGCTTGATCTTAGGAAGGGTTAGGAGTGATGAAGAACTATACTATTTTCCA
CAGGAAGAATTTCCCAAAGCGAATGTTTTGAAATGGCAAGGAAAGTTGACAGAATTTCAAGCTAAGGTTTCTCAAGGACT
TGTAGAGGCGGTTACCAAACGCAAAGATAGCTTGGTTCACGCAGTCACAGGAGCCGGAAAGACGGAAATGATCTATCAGG
TGGTGGCACAAGTCATCAATCAAGGTGGAGCCGTCTGCTTGGCTAGCCCCAGAATTGATGTCTGCTTAGAACTTTATCGC
AGACTGAAAGTAGATTTTACTTGCGATATTTCACTCCTGCACGGTGAATCAGAAGCATATTCCCGCAGTCCTCTCGTGAT
TGCTACAACGCATCAGCTTCTCAAATTTTATCAAGCCTTTGATCTTCTTATCGTTGATGAAGTAGATGCCTTTCCTTACG
TGGACAATCCGATGCTTTATCATGCGGTTCATCAGGCAGTTAAAGTAGAGGGAACGAAGATTTTCTTAACAGCAACTTCC
ACAGATGAGTTGGATAAAAAAGTGGCTAAAGGAGAATTAACTCGTTTGAGTCTACCTAGACGTTTTCATGGGAATCCTTT
GATTGTTCCTCAAAAAATTTGGTTGGCAGATTTTCAAAAATATCTTGGTCAAAAGAAGCTAGTTCCTAAGTTAAGACAGT
TTATTCAAAAGCAGAGAAAAACCAGCTTTCCACTTCTTATTTTTGCTTCTGAGATTAAAAAGGGGCAGGAATTAGCAGAG
ATTCTTCAAAGCACTTTTCCTAATGAAAAAGTTGGCTTTGTAGCCTCGACGACTGAAAATCGACTGGAAATTGTAGAGAA
ATTTCGTCAAAAAGAAATCACGATTTTGGTAACAACGACAATTCTAGAACGTGGTGTAACTTTTCCTTGTGTAGATGTTT
TTGTGGTGGAGGCTAACCACCGTTTGTTTAGTCGCAGCGCTCTGGTACAAATCGCTGGACGCGTCGGTCGCAGTATGGAT
CGACCGACAGGCGAGTTAATCTTTTTTCATGACGGTGCGACTATGGCGATAGAAAAAGCGATTAAAGAAATTCAGGAGAT
GAATCAGGAGGCCGGTTTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae Rx1

72.43

98.845

0.716

  comFA/cflA Streptococcus pneumoniae D39

72.43

98.845

0.716

  comFA/cflA Streptococcus pneumoniae R6

72.43

98.845

0.716

  comFA/cflA Streptococcus pneumoniae TIGR4

72.196

98.845

0.714

  comFA/cflA Streptococcus mitis NCTC 12261

72.066

98.383

0.709

  comFA/cflA Streptococcus mitis SK321

71.596

98.383

0.704

  comFA Lactococcus lactis subsp. cremoris KW2

55.779

91.917

0.513

  comFA Latilactobacillus sakei subsp. sakei 23K

40.647

100

0.406

  comFA Bacillus subtilis subsp. subtilis str. 168

39.409

93.764

0.37


Multiple sequence alignment