Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   GJS33_RS07790 Genome accession   NZ_CP046041
Coordinates   1659524..1660846 (-) Length   440 a.a.
NCBI ID   WP_111692701.1    Uniprot ID   -
Organism   Streptococcus equi subsp. zooepidemicus strain AZ-45470     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1654524..1665846
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GJS33_RS07775 (GJS33_07775) - 1656766..1658162 (+) 1397 Protein_1515 ISNCY-like element ISSeq2 family transposase -
  GJS33_RS07780 (GJS33_07780) hpf 1658260..1658808 (-) 549 WP_014623350.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  GJS33_RS07785 (GJS33_07785) - 1658884..1659465 (-) 582 WP_126437237.1 ComF family protein -
  GJS33_RS07790 (GJS33_07790) comFA/cflA 1659524..1660846 (-) 1323 WP_111692701.1 DEAD/DEAH box helicase Machinery gene
  GJS33_RS07795 (GJS33_07795) - 1660902..1661543 (+) 642 WP_043031300.1 YigZ family protein -
  GJS33_RS07800 (GJS33_07800) cysK 1661664..1662590 (+) 927 WP_043041008.1 cysteine synthase A -
  GJS33_RS07805 (GJS33_07805) - 1662977..1663339 (-) 363 WP_111692702.1 S1 RNA-binding domain-containing protein -
  GJS33_RS07810 (GJS33_07810) - 1663339..1664739 (-) 1401 WP_111692703.1 bifunctional Cof-type HAD-IIB family hydrolase/peptidylprolyl isomerase -
  GJS33_RS07815 (GJS33_07815) - 1664772..1665413 (-) 642 WP_041785672.1 response regulator transcription factor -

Sequence


Protein


Download         Length: 440 a.a.        Molecular weight: 50014.12 Da        Isoelectric Point: 9.5419

>NTDB_id=400537 GJS33_RS07790 WP_111692701.1 1659524..1660846(-) (comFA/cflA) [Streptococcus equi subsp. zooepidemicus strain AZ-45470]
MENIENYYGRLLPERQCPKAVSVWACSLQSMIAKKGTLYCQRCSSLIEKAHQLPSGAYYCRACLVFGRNQSDRPLLYFPP
APFPKAHYLRWQGQLTTYQGAVSHQLTNHVKLKQDTLVHAVTGAGKTEMMYEAIAAVVDKGGWVCIASPRVDVCIELEKR
LSRDFSCQVCLMHAESEVYHRSPIIVATTHQLMTFYHAFDLLIIDEVDAFPFVNNRQLNHAAHQAAKADAVTVYLTATST
RDLERKVKQKELVKLTLARRFHGKPLVVPKYQRLFSLLEAINRGKLPRRFITLVKKQRATGYPLLIFFPIIELAEQCCQL
LQKYFPKETIAHASSQSSNRMTIIEQFRQGQITILISTTILERGVTFPTVDVFVLLANHHLYTSSSLIQISGRVGRSLER
PEGQVLFFHDGISQAMLKAVREIKDMNQKGYQNELSTLPQ

Nucleotide


Download         Length: 1323 bp        

>NTDB_id=400537 GJS33_RS07790 WP_111692701.1 1659524..1660846(-) (comFA/cflA) [Streptococcus equi subsp. zooepidemicus strain AZ-45470]
ATGGAGAATATCGAAAATTACTATGGACGTCTTTTACCCGAAAGGCAATGCCCAAAGGCTGTTTCTGTCTGGGCTTGCAG
CTTACAAAGCATGATCGCTAAAAAAGGAACATTATACTGCCAACGCTGTAGCAGCTTAATTGAGAAAGCTCATCAGCTAC
CTAGCGGTGCTTACTATTGTAGAGCCTGTCTCGTTTTTGGTCGAAACCAATCTGATCGGCCCTTGCTCTATTTTCCACCG
GCTCCTTTTCCAAAGGCACATTATCTGAGGTGGCAAGGACAGCTCACAACATATCAGGGAGCTGTCTCTCATCAGCTTAC
TAACCATGTTAAGCTCAAGCAAGACACACTGGTTCATGCCGTTACCGGCGCTGGTAAAACAGAGATGATGTATGAAGCCA
TTGCAGCAGTTGTTGATAAGGGTGGCTGGGTCTGCATTGCTAGTCCACGAGTTGATGTCTGCATAGAGCTTGAAAAACGA
CTATCGCGGGACTTTTCCTGTCAAGTCTGCCTTATGCATGCTGAGTCAGAGGTTTATCATAGAAGTCCCATTATCGTTGC
CACAACACATCAATTGATGACCTTTTACCATGCTTTTGATCTGCTCATTATTGACGAAGTAGATGCCTTCCCCTTTGTCA
ATAATCGTCAATTAAACCATGCTGCTCATCAGGCTGCAAAAGCAGATGCAGTGACAGTATACCTAACAGCAACCTCTACA
AGAGATTTAGAGCGCAAGGTCAAGCAAAAAGAGCTTGTCAAATTGACGTTGGCAAGAAGATTCCATGGCAAGCCCTTAGT
TGTTCCAAAGTATCAAAGATTATTCTCCCTTTTAGAGGCTATCAATCGCGGGAAATTGCCTAGAAGGTTCATCACCCTAG
TCAAAAAACAAAGGGCAACCGGCTATCCTCTTTTAATCTTTTTTCCGATTATTGAGCTGGCTGAGCAATGCTGTCAGCTG
TTGCAAAAGTATTTTCCTAAGGAAACTATTGCTCATGCTTCCAGTCAGTCATCAAATCGAATGACTATCATTGAGCAATT
CAGACAAGGACAAATCACTATACTTATATCAACAACCATTTTGGAAAGAGGTGTGACCTTTCCAACCGTAGATGTCTTTG
TCTTATTAGCCAATCATCACCTTTACACAAGCAGCAGTCTTATTCAAATTTCAGGTCGCGTTGGGCGCTCTTTGGAAAGA
CCTGAAGGACAGGTGCTATTTTTTCATGATGGTATTAGCCAGGCTATGCTTAAGGCAGTGCGGGAGATAAAGGACATGAA
TCAAAAAGGGTATCAAAATGAACTGTCTACTCTGCCACAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus pneumoniae R6

52.471

96.591

0.507

  comFA/cflA Streptococcus pneumoniae Rx1

52.471

96.591

0.507

  comFA/cflA Streptococcus pneumoniae D39

52.471

96.591

0.507

  comFA/cflA Streptococcus pneumoniae TIGR4

52.471

96.591

0.507

  comFA/cflA Streptococcus mitis SK321

52.471

96.591

0.507

  comFA/cflA Streptococcus mitis NCTC 12261

52.235

96.591

0.505

  comFA Lactococcus lactis subsp. cremoris KW2

48.214

89.091

0.43


Multiple sequence alignment