Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   CG474_RS20305 Genome accession   NZ_CP024120
Coordinates   3847625..3848974 (-) Length   449 a.a.
NCBI ID   WP_048723495.1    Uniprot ID   -
Organism   Bacillus cytotoxicus strain CH_1     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3842625..3853974
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CG474_RS20285 (CG474_020190) secA 3842835..3845345 (-) 2511 WP_048723492.1 preprotein translocase subunit SecA -
  CG474_RS20290 (CG474_020195) hpf 3845708..3846247 (-) 540 WP_012096178.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  CG474_RS20295 (CG474_020205) - 3846601..3846798 (-) 198 WP_012096179.1 cold shock domain-containing protein -
  CG474_RS20300 (CG474_020210) - 3846921..3847625 (-) 705 WP_048723493.1 ComF family protein -
  CG474_RS20305 (CG474_020215) comFA 3847625..3848974 (-) 1350 WP_048723495.1 DEAD/DEAH box helicase Machinery gene
  CG474_RS20310 (CG474_020220) - 3849100..3850329 (-) 1230 WP_048723497.1 C40 family peptidase -
  CG474_RS20315 (CG474_020225) - 3850600..3850914 (-) 315 WP_012096183.1 winged helix-turn-helix transcriptional regulator -
  CG474_RS20320 (CG474_020230) - 3851199..3852041 (-) 843 WP_094398369.1 DegV family protein -
  CG474_RS20325 (CG474_020235) - 3852432..3853070 (+) 639 WP_012096185.1 YigZ family protein -

Sequence


Protein


Download         Length: 449 a.a.        Molecular weight: 51530.44 Da        Isoelectric Point: 9.7376

>NTDB_id=252272 CG474_RS20305 WP_048723495.1 3847625..3848974(-) (comFA) [Bacillus cytotoxicus strain CH_1]
MIQKGRQMLLHELSWSKEQLHDFISRGEIVLEKGIVRKNSNYICQRCGNQEKRLFASFLCKRCNQVCAYCRKCIMMGRVS
ECTMLVRSVCEGKQECFPHALQWKGVLSPGQNRAAKGVVEAIKQKESFFIWAVCGAGKTEMLFSGIDEALKNGKRVCIAT
PRKDVVLELAPRLQEVFPDIPFATLYGGSKEYEKEAMLIVTTTHQLLRYYRAFDVMIIDEIDAFPYVMDHMLHEAVRKAA
KEDASYIYLTATPEEKWKQSVQNGKQKGVIISGRYHRHPLPVPKLKWCGNWRKLLRRKQIPQVLLDWLLIYLQKHYPIFL
FVPHICYVEEVTAVLQRLHHKITGVHAEDPMRKEKVEAFRKGDIPLLITTTILERGVTVANLQVAVLGAEEDIFSESALV
QIAGRVGRSSHNPKGEVIYFHYGKTKAMTAAKKHIQFMNKRAKREGLID

Nucleotide


Download         Length: 1350 bp        

>NTDB_id=252272 CG474_RS20305 WP_048723495.1 3847625..3848974(-) (comFA) [Bacillus cytotoxicus strain CH_1]
ATGATACAAAAAGGAAGACAAATGTTATTGCATGAACTTTCATGGTCAAAAGAACAGTTGCACGATTTCATAAGTAGAGG
GGAGATTGTATTAGAAAAAGGGATTGTTCGAAAAAATTCCAATTATATATGCCAACGTTGCGGGAATCAAGAGAAACGTC
TATTTGCATCATTTTTATGCAAAAGGTGTAATCAAGTATGTGCGTATTGCCGGAAGTGTATTATGATGGGACGAGTAAGC
GAATGTACGATGCTTGTTCGAAGTGTTTGCGAAGGGAAACAAGAATGTTTTCCTCATGCCTTGCAGTGGAAAGGAGTATT
GTCACCGGGGCAAAATAGAGCTGCAAAGGGGGTAGTAGAAGCGATTAAACAGAAAGAGTCCTTTTTTATTTGGGCGGTAT
GTGGTGCTGGAAAAACGGAGATGTTGTTTTCTGGCATAGATGAAGCGCTTAAAAACGGTAAGCGAGTTTGTATAGCGACG
CCTCGAAAAGATGTTGTATTAGAATTAGCTCCACGACTACAAGAAGTGTTTCCAGATATACCATTTGCGACATTATACGG
AGGCAGTAAAGAATATGAAAAAGAAGCGATGCTTATCGTAACTACAACTCATCAGTTGCTTCGCTATTATAGGGCGTTTG
ATGTCATGATTATAGATGAAATTGATGCTTTTCCGTATGTGATGGATCATATGTTACATGAGGCCGTAAGAAAAGCGGCG
AAGGAGGATGCTTCCTATATTTATTTAACAGCAACTCCAGAAGAGAAATGGAAACAGAGTGTGCAAAATGGTAAACAAAA
AGGTGTTATTATATCAGGACGTTATCATCGCCATCCATTACCGGTTCCTAAATTAAAATGGTGTGGAAATTGGAGAAAAC
TACTTCGACGTAAACAAATTCCTCAAGTTCTATTGGACTGGCTTCTTATATATCTACAAAAACATTATCCTATCTTTTTA
TTTGTTCCTCATATTTGTTATGTAGAGGAAGTAACGGCTGTGTTACAAAGATTACATCATAAAATTACTGGTGTTCATGC
AGAAGATCCAATGAGAAAAGAGAAAGTAGAGGCATTTCGTAAAGGGGATATACCATTACTCATAACTACAACTATATTGG
AGAGAGGGGTAACGGTCGCGAATTTACAAGTAGCTGTGCTAGGAGCAGAAGAGGATATTTTTTCTGAAAGTGCACTGGTT
CAGATAGCGGGCCGCGTTGGTAGAAGTTCTCACAATCCAAAGGGGGAAGTCATTTATTTTCATTATGGAAAAACGAAAGC
AATGACTGCTGCGAAAAAGCATATTCAATTTATGAATAAAAGGGCTAAACGAGAAGGGTTGATAGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

50.113

98.218

0.492

  comFA Latilactobacillus sakei subsp. sakei 23K

38.626

93.987

0.363


Multiple sequence alignment