Detailed information    

insolico Bioinformatically predicted

Overview


Name   cclA/cilC   Type   Machinery gene
Locus tag   EL140_RS07355 Genome accession   NZ_LR134336
Coordinates   1475595..1476257 (+) Length   220 a.a.
NCBI ID   WP_000566511.1    Uniprot ID   -
Organism   Streptococcus oralis ATCC 35037 strain NCTC 11427     
Function   processing and translocation of ComGC; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1461993..1477679 1475595..1476257 within 0


Gene organization within MGE regions


Location: 1461993..1477679
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL140_RS07260 (NCTC11427_01433) - 1461993..1463084 (-) 1092 WP_001122753.1 AAA family ATPase -
  EL140_RS07265 (NCTC11427_01434) - 1463066..1464550 (-) 1485 WP_000664191.1 DUF262 domain-containing protein -
  EL140_RS07270 (NCTC11427_01435) - 1464585..1465055 (-) 471 WP_000257122.1 DUF3013 family protein -
  EL140_RS07275 (NCTC11427_01436) - 1465171..1466442 (+) 1272 WP_001113188.1 replication-associated recombination protein A -
  EL140_RS07285 (NCTC11427_01439) - 1467134..1467649 (+) 516 WP_000672739.1 DUF308 domain-containing protein -
  EL140_RS07295 (NCTC11427_01440) - 1468343..1469488 (+) 1146 WP_000209254.1 low temperature requirement protein A -
  EL140_RS07305 (NCTC11427_01442) - 1469945..1471426 (-) 1482 WP_001205297.1 M protein trans-acting positive regulator PRD domain-containing protein -
  EL140_RS07310 (NCTC11427_01443) - 1471723..1471959 (+) 237 WP_000964740.1 GlsB/YeaQ/YmgE family stress response membrane protein -
  EL140_RS07315 (NCTC11427_01444) amaP 1472059..1472622 (+) 564 WP_000045616.1 alkaline shock response membrane anchor protein AmaP -
  EL140_RS07320 (NCTC11427_01445) - 1472633..1472803 (+) 171 WP_000454669.1 DUF2273 domain-containing protein -
  EL140_RS07325 (NCTC11427_01446) - 1472815..1473399 (+) 585 WP_000072930.1 Asp23/Gls24 family envelope stress response protein -
  EL140_RS07330 (NCTC11427_01447) - 1473441..1473644 (+) 204 WP_000102421.1 CsbD family protein -
  EL140_RS07335 (NCTC11427_01448) - 1473862..1474383 (-) 522 WP_002874801.1 hypothetical protein -
  EL140_RS09795 - 1474453..1474966 (-) 514 Protein_1383 thiol-disulfide isomerase -
  EL140_RS07350 (NCTC11427_01451) - 1475070..1475525 (-) 456 WP_000155446.1 GNAT family N-acetyltransferase -
  EL140_RS07355 (NCTC11427_01452) cclA/cilC 1475595..1476257 (+) 663 WP_000566511.1 prepilin peptidase Machinery gene
  EL140_RS07360 (NCTC11427_01453) - 1476246..1476701 (-) 456 WP_000682683.1 effector binding domain-containing protein -
  EL140_RS07365 (NCTC11427_01454) - 1476780..1477679 (+) 900 WP_000687279.1 helix-turn-helix transcriptional regulator -

Sequence


Protein


Download         Length: 220 a.a.        Molecular weight: 25410.76 Da        Isoelectric Point: 8.3737

>NTDB_id=1121858 EL140_RS07355 WP_000566511.1 1475595..1476257(+) (cclA/cilC) [Streptococcus oralis ATCC 35037 strain NCTC 11427]
MIDLYFFLVGSILASFLGLIIDRFPEQSIIRPASYCNSCQTRLRPLDLIPILSQVFNRFRCRYCKIRYPFWYALFELGLG
ILFLAWSWGWISLGQVILITAGLTLGIYDFRHQEYPLLVWMTFHLILMAFCSWNLVMVFFLILGIMAHFIDIRMGAGDFL
FLASCALIFSATELLILIQFASAAGILAFLLQKKKERLPFVPFLLLAATIIIFYQFLLVC

Nucleotide


Download         Length: 663 bp        

>NTDB_id=1121858 EL140_RS07355 WP_000566511.1 1475595..1476257(+) (cclA/cilC) [Streptococcus oralis ATCC 35037 strain NCTC 11427]
ATGATTGATCTTTATTTTTTTCTTGTCGGGAGCATTCTCGCTTCTTTCTTGGGTTTGATCATTGACCGTTTCCCTGAGCA
ATCCATTATTCGACCGGCTAGCTACTGCAATTCCTGTCAGACTCGCTTGCGACCCTTAGATTTGATTCCTATCCTTTCGC
AGGTCTTCAATCGCTTTCGCTGTCGCTACTGCAAGATTCGATATCCATTCTGGTATGCCCTCTTTGAACTCGGCTTAGGA
ATCCTCTTTCTAGCTTGGTCTTGGGGCTGGATTTCCTTGGGTCAAGTCATCCTCATCACTGCTGGTTTGACCTTGGGCAT
CTACGACTTTCGCCATCAGGAATATCCCTTACTAGTCTGGATGACTTTCCACCTAATCCTCATGGCTTTCTGTAGTTGGA
ATCTGGTTATGGTCTTCTTCCTTATACTTGGAATCATGGCCCATTTTATCGATATTCGCATGGGCGCAGGGGATTTTCTC
TTTCTAGCTTCTTGTGCTCTCATCTTTAGCGCGACAGAATTACTCATCTTGATTCAGTTCGCTTCTGCGGCAGGCATTCT
GGCCTTTCTCCTGCAAAAGAAAAAGGAAAGACTTCCTTTCGTGCCTTTCCTCTTACTTGCTGCGACTATAATAATTTTCT
ATCAGTTCTTATTAGTTTGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  cclA/cilC Streptococcus mitis SK321

87.215

99.545

0.868

  cclA/cilC Streptococcus pneumoniae Rx1

86.301

99.545

0.859

  cclA/cilC Streptococcus pneumoniae D39

86.301

99.545

0.859

  cclA/cilC Streptococcus pneumoniae R6

86.301

99.545

0.859

  cclA/cilC Streptococcus mitis NCTC 12261

85.845

99.545

0.855

  cclA/cilC Streptococcus pneumoniae TIGR4

85.388

99.545

0.85


Multiple sequence alignment