Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA/cflA   Type   Machinery gene
Locus tag   ABZ559_RS02875 Genome accession   NZ_CP160400
Coordinates   568800..570101 (-) Length   433 a.a.
NCBI ID   WP_367007613.1    Uniprot ID   -
Organism   Streptococcus sp. ZY19097     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 563800..575101
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ABZ559_RS02850 (ABZ559_02850) gap 564128..565141 (+) 1014 WP_018376716.1 type I glyceraldehyde-3-phosphate dehydrogenase -
  ABZ559_RS02855 (ABZ559_02855) - 565267..566463 (+) 1197 WP_367007180.1 phosphoglycerate kinase -
  ABZ559_RS02860 (ABZ559_02860) - 566682..567233 (+) 552 WP_367007181.1 2'-5' RNA ligase family protein -
  ABZ559_RS02865 (ABZ559_02865) raiA 567517..568065 (-) 549 WP_273414199.1 ribosome-associated translation inhibitor RaiA -
  ABZ559_RS02870 (ABZ559_02870) - 568141..568803 (-) 663 WP_277291370.1 ComF family protein -
  ABZ559_RS02875 (ABZ559_02875) comFA/cflA 568800..570101 (-) 1302 WP_367007613.1 DEAD/DEAH box helicase Machinery gene
  ABZ559_RS02880 (ABZ559_02880) - 570158..570784 (+) 627 WP_367007182.1 YigZ family protein -
  ABZ559_RS02885 (ABZ559_02885) cysK 570883..571812 (+) 930 WP_018376842.1 cysteine synthase A -
  ABZ559_RS02890 (ABZ559_02890) - 572144..572473 (+) 330 WP_367007183.1 IS630 transposase-related protein -
  ABZ559_RS02895 (ABZ559_02895) - 572538..572993 (+) 456 WP_367007614.1 transposase -

Sequence


Protein


Download         Length: 433 a.a.        Molecular weight: 49325.30 Da        Isoelectric Point: 9.0943

>NTDB_id=1020920 ABZ559_RS02875 WP_367007613.1 568800..570101(-) (comFA/cflA) [Streptococcus sp. ZY19097]
MSELDNYYGRLFIEEQVPEKFRTHANYFPAMTETLGEYYCNRCGSNISKEAILQTGQYYCRECLIFGRNTSRSQLYHFPQ
KVFPPTNSLIWKGELTPYQQEVSDGLLSGLSLKENLLVHAVTGAGKTEMIYQVTAKVIDDGGCVCLASPRIDVCIELYKR
LTHDFSCPITLMHGGSDPYQRAPLIIATTHQLLKFRHAFDLLIVDEVDAFPFVDNNTLYYAVENCVKEDGVKIFLTATST
DELDKKVKRGELKKLHLARRFHANPLVIPKMVWLGRVPKEMKKGKLPVKLVNLIKKQRLTNFPLLLFFPDIEQGKIFTKI
LHSYFPEEKIDFVSSQTTNRLQIVDAFRNNQLTILVSTTILERGVTFPCVDVFVVSANHRLYSKSALIQISGRVGRAKER
PTGELLFLHDGKTKAMARAIREIKEMNIKGGFS

Nucleotide


Download         Length: 1302 bp        

>NTDB_id=1020920 ABZ559_RS02875 WP_367007613.1 568800..570101(-) (comFA/cflA) [Streptococcus sp. ZY19097]
ATGAGTGAATTAGACAATTATTATGGCCGCTTATTTATTGAGGAGCAGGTTCCAGAGAAATTTAGAACACATGCAAATTA
TTTTCCAGCAATGACGGAAACTCTAGGAGAATATTACTGTAACCGTTGCGGAAGTAATATTTCCAAGGAAGCAATCCTAC
AAACTGGTCAATACTACTGTAGGGAATGTTTGATTTTTGGTAGAAATACTAGTCGATCTCAACTTTATCATTTTCCACAA
AAAGTTTTTCCTCCTACCAATAGTCTTATATGGAAAGGGGAATTAACTCCCTATCAACAAGAGGTCTCTGATGGGTTGCT
TTCAGGGCTTTCCCTTAAGGAAAACCTCCTAGTTCATGCTGTAACAGGTGCAGGAAAAACTGAGATGATTTATCAAGTAA
CAGCCAAAGTTATTGATGATGGTGGCTGTGTTTGTCTCGCAAGTCCCAGAATTGACGTCTGCATAGAACTTTATAAGCGT
TTGACACATGACTTTTCCTGTCCTATCACCCTAATGCACGGCGGATCAGACCCTTATCAGAGAGCACCTCTGATTATCGC
AACAACACATCAACTCCTCAAGTTTCGGCACGCATTTGACCTACTAATTGTTGATGAAGTCGATGCGTTTCCCTTTGTGG
ATAACAATACGCTCTATTACGCCGTCGAAAACTGTGTGAAAGAAGATGGAGTAAAGATTTTTCTGACAGCCACTTCAACA
GATGAACTTGATAAGAAAGTCAAGAGGGGTGAATTGAAAAAATTACATCTTGCACGCCGTTTTCATGCAAATCCCTTGGT
AATTCCGAAAATGGTCTGGCTAGGAAGGGTTCCAAAAGAGATGAAAAAAGGAAAACTTCCTGTAAAATTAGTCAACTTAA
TAAAAAAACAAAGGCTGACAAATTTTCCTCTTTTACTCTTTTTTCCCGATATTGAACAAGGGAAAATTTTTACCAAAATT
CTGCACAGCTATTTTCCAGAAGAAAAGATAGATTTTGTATCAAGTCAAACTACTAATCGATTACAGATTGTAGATGCATT
TCGCAATAATCAGCTTACTATCCTTGTGTCAACAACAATTTTAGAGCGAGGAGTCACTTTTCCATGTGTTGACGTCTTTG
TCGTATCAGCAAATCATCGACTTTACAGCAAGAGTGCTCTTATACAAATTTCTGGTCGAGTCGGAAGAGCAAAAGAGCGA
CCAACAGGAGAACTGCTTTTTCTTCATGATGGCAAAACTAAGGCGATGGCGAGAGCCATCCGAGAAATAAAAGAAATGAA
CATCAAAGGAGGATTTTCGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA/cflA Streptococcus mitis NCTC 12261

61.124

98.614

0.603

  comFA/cflA Streptococcus pneumoniae Rx1

60.656

98.614

0.598

  comFA/cflA Streptococcus pneumoniae D39

60.656

98.614

0.598

  comFA/cflA Streptococcus pneumoniae R6

60.656

98.614

0.598

  comFA/cflA Streptococcus pneumoniae TIGR4

60.656

98.614

0.598

  comFA/cflA Streptococcus mitis SK321

59.953

98.614

0.591

  comFA Lactococcus lactis subsp. cremoris KW2

51.269

90.993

0.467


Multiple sequence alignment