Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFC   Type   Machinery gene
Locus tag   clem_RS10595 Genome accession   NZ_CP016397
Coordinates   2415499..2416194 (+) Length   231 a.a.
NCBI ID   WP_157698233.1    Uniprot ID   A0A222P4D4
Organism   Legionella clemsonensis strain CDC-D5610     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2410499..2421194
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  clem_RS10575 (clem_10745) uvrC 2410845..2412689 (-) 1845 WP_094091527.1 excinuclease ABC subunit UvrC -
  clem_RS10580 (clem_10750) letA 2412775..2413434 (-) 660 WP_094091528.1 two-component system response regulator LetA Regulator
  clem_RS10585 (clem_10755) phhA 2413653..2414471 (-) 819 WP_094091529.1 phenylalanine 4-monooxygenase -
  clem_RS10590 (clem_10760) bioC 2414597..2415463 (-) 867 WP_094091530.1 malonyl-ACP O-methyltransferase BioC -
  clem_RS10595 (clem_10765) comFC 2415499..2416194 (+) 696 WP_157698233.1 ComF family protein Machinery gene
  clem_RS10600 (clem_10770) - 2416191..2416907 (-) 717 WP_094091531.1 hypothetical protein -
  clem_RS10605 (clem_10780) - 2417027..2418093 (-) 1067 Protein_2118 M50 family metallopeptidase -
  clem_RS10610 (clem_10785) - 2418098..2418385 (-) 288 WP_094091532.1 hypothetical protein -
  clem_RS10615 (clem_10790) hemA 2418589..2419851 (+) 1263 WP_094091533.1 glutamyl-tRNA reductase -
  clem_RS10620 (clem_10795) prfA 2419848..2420942 (+) 1095 WP_094091534.1 peptide chain release factor 1 -

Sequence


Protein


Download         Length: 231 a.a.        Molecular weight: 26718.48 Da        Isoelectric Point: 9.6469

>NTDB_id=187977 clem_RS10595 WP_157698233.1 2415499..2416194(+) (comFC) [Legionella clemsonensis strain CDC-D5610]
MRYKKRSITHLLRLPSACIICGKYHQDIDAVCPDCLNLLQPLGPACQYCALPLSDDGFLVCGRCSKQKPAFDKTWVLYRF
EEPLRTLLHEYKYNGALYLRRLLVKLMMDALPQEELTTQCLIPVPLHYKKLRERGFNQAAEFSKMLSNRLKIPYELTLTQ
KVLHTPAQVSLNGRKRRHNLQHAFRIKKQAYQHITLIDDLLTTGSTVNELAKLFKQQGVTRVDVWCCARAC

Nucleotide


Download         Length: 696 bp        

>NTDB_id=187977 clem_RS10595 WP_157698233.1 2415499..2416194(+) (comFC) [Legionella clemsonensis strain CDC-D5610]
GTGCGCTACAAAAAAAGAAGTATAACACACTTATTACGCCTGCCATCTGCTTGTATCATTTGTGGTAAGTACCATCAGGA
CATTGATGCTGTTTGCCCAGACTGTCTAAATCTCTTACAACCCCTGGGGCCTGCTTGCCAATATTGTGCTCTTCCCTTAT
CCGATGATGGTTTTCTGGTTTGTGGTCGCTGCTCAAAACAAAAACCCGCTTTTGATAAAACGTGGGTACTGTATCGCTTT
GAGGAACCTTTACGTACTTTGCTGCATGAATACAAATACAACGGAGCACTTTACTTACGCAGACTATTGGTGAAATTAAT
GATGGATGCCTTACCGCAGGAAGAACTCACTACGCAATGTTTAATTCCTGTTCCCCTGCATTACAAAAAATTACGTGAAA
GAGGGTTTAATCAAGCTGCTGAGTTTTCCAAGATGCTGTCGAACCGTTTAAAAATACCTTATGAACTTACTCTTACCCAA
AAGGTGTTACATACTCCGGCACAAGTTAGCTTAAACGGCAGAAAACGCCGCCATAATCTGCAACATGCCTTTCGGATAAA
AAAGCAGGCTTATCAACACATTACTTTGATTGATGATTTATTAACTACAGGCAGCACTGTCAATGAACTGGCAAAACTTT
TTAAGCAACAGGGCGTAACCCGCGTTGATGTCTGGTGTTGTGCAAGAGCTTGTTAA

Domains


Predicted by InterproScan.

(14-64)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A222P4D4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFC Legionella pneumophila str. Paris

51.304

99.567

0.511

  comFC Legionella pneumophila strain ERS1305867

52.604

83.117

0.437


Multiple sequence alignment