Detailed information    

insolico Bioinformatically predicted

Overview


Name   cclA/cilC   Type   Machinery gene
Locus tag   R4702_RS07925 Genome accession   NZ_CP137106
Coordinates   1527906..1528565 (-) Length   219 a.a.
NCBI ID   WP_000565017.1    Uniprot ID   A0A0E8GKN8
Organism   Streptococcus pneumoniae strain 16P4028     
Function   processing and translocation of ComGC; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1509799..1544697 1527906..1528565 within 0


Gene organization within MGE regions


Location: 1509799..1544697
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R4702_RS07830 - 1509817..1510200 (+) 384 WP_001813172.1 hypothetical protein -
  R4702_RS07835 - 1510258..1510899 (+) 642 WP_317809624.1 phosphate uptake regulator PhoU -
  R4702_RS07840 galT 1510912..1512387 (+) 1476 WP_000848802.1 UDP-glucose--hexose-1-phosphate uridylyltransferase -
  R4702_RS07845 galE 1512400..1513380 (+) 981 WP_317809626.1 UDP-glucose 4-epimerase GalE -
  R4702_RS07850 - 1513590..1514249 (+) 660 WP_000793807.1 DUF1868 domain-containing protein -
  R4702_RS07855 - 1514274..1515341 (+) 1068 WP_000738372.1 extracellular solute-binding protein -
  R4702_RS07860 - 1515444..1516454 (+) 1011 WP_000589142.1 ABC transporter ATP-binding protein -
  R4702_RS07865 - 1516466..1518157 (+) 1692 WP_001216185.1 iron ABC transporter permease -
  R4702_RS07870 - 1518160..1518870 (+) 711 WP_000127552.1 MgtC/SapB family protein -
  R4702_RS07875 - 1518966..1519334 (+) 369 WP_219590585.1 hypothetical protein -
  R4702_RS07880 - 1519442..1520443 (+) 1002 WP_000789949.1 LacI family DNA-binding transcriptional regulator -
  R4702_RS07885 - 1521051..1521302 (+) 252 WP_317809627.1 hypothetical protein -
  R4702_RS07890 trpE 1521675..1523036 (+) 1362 WP_219590583.1 anthranilate synthase component I -
  R4702_RS07895 - 1523033..1523599 (+) 567 WP_000601930.1 aminodeoxychorismate/anthranilate synthase component II -
  R4702_RS07900 trpD 1523610..1524614 (+) 1005 WP_000658684.1 anthranilate phosphoribosyltransferase -
  R4702_RS07905 trpC 1524611..1525378 (+) 768 WP_000076567.1 indole-3-glycerol phosphate synthase TrpC -
  R4702_RS07910 - 1525365..1525964 (+) 600 WP_000169915.1 phosphoribosylanthranilate isomerase -
  R4702_RS07915 trpB 1525942..1527165 (+) 1224 WP_000331287.1 tryptophan synthase subunit beta -
  R4702_RS07920 trpA 1527158..1527934 (+) 777 WP_044788824.1 tryptophan synthase subunit alpha -
  R4702_RS07925 cclA/cilC 1527906..1528565 (-) 660 WP_000565017.1 A24 family peptidase Machinery gene
  R4702_RS07930 - 1528635..1529081 (+) 447 WP_000155429.1 GNAT family N-acetyltransferase -
  R4702_RS07935 - 1529309..1530487 (-) 1179 WP_000175886.1 ROK family transcriptional regulator -
  R4702_RS07940 - 1530767..1532029 (+) 1263 WP_000413336.1 PTS sugar transporter subunit IIC -
  R4702_RS07945 - 1532032..1532928 (+) 897 WP_016398462.1 hypothetical protein -
  R4702_RS07950 - 1532943..1533260 (+) 318 WP_001151646.1 PTS sugar transporter subunit IIB -
  R4702_RS07955 - 1533277..1533594 (+) 318 WP_001005881.1 PTS lactose/cellobiose transporter subunit IIA -
  R4702_RS07960 - 1533604..1534959 (+) 1356 WP_001094994.1 PTS sugar transporter subunit IIC -
  R4702_RS07965 - 1535283..1536758 (+) 1476 WP_279513521.1 arylsulfatase -
  R4702_RS07970 - 1536755..1537603 (+) 849 WP_279513522.1 formylglycine-generating enzyme family protein -
  R4702_RS07975 - 1537618..1538388 (+) 771 WP_000229339.1 sulfite exporter TauE/SafE family protein -
  R4702_RS07980 - 1538666..1538911 (-) 246 WP_001810496.1 IS30 family transposase -
  R4702_RS07985 - 1539190..1539429 (+) 240 WP_001005784.1 transposase -
  R4702_RS07990 - 1539539..1539742 (-) 204 WP_000109957.1 CsbD family protein -
  R4702_RS07995 - 1539773..1540381 (-) 609 WP_000064115.1 Asp23/Gls24 family envelope stress response protein -
  R4702_RS08000 - 1540420..1540590 (-) 171 WP_000455066.1 DUF2273 domain-containing protein -
  R4702_RS08005 amaP 1540602..1541168 (-) 567 WP_000030213.1 alkaline shock response membrane anchor protein AmaP -
  R4702_RS08010 - 1541252..1541416 (-) 165 WP_001809002.1 GlsB/YeaQ/YmgE family stress response membrane protein -
  R4702_RS08015 mgaSpn 1541859..1543340 (+) 1482 WP_001205276.1 virulence factor transcriptional regulator MgaSpn -
  R4702_RS08020 - 1543702..1544697 (+) 996 WP_000230210.1 LacI family DNA-binding transcriptional regulator -

Sequence


Protein


Download         Length: 219 a.a.        Molecular weight: 24879.99 Da        Isoelectric Point: 7.6418

>NTDB_id=895588 R4702_RS07925 WP_000565017.1 1527906..1528565(-) (cclA/cilC) [Streptococcus pneumoniae strain 16P4028]
MIDFYFFLVGSILASFLGLVIDRFPEQSIISSASHCDSCQTRLRPLDLIPILSQVFNRFRCRYCKVRYPVWYALFELVLG
LLFLLYSWELLSLGQVVLITAGLTLGIYDFHHQEYPLLVWMTFHLILIASSGWNLVMVSFLALGILAHFIDIRMGAGDFL
FLASCALVFSVTELLILIQFASATGILAFILQKKKERLPFVPFLLLATCLIIFGKLLLV

Nucleotide


Download         Length: 660 bp        

>NTDB_id=895588 R4702_RS07925 WP_000565017.1 1527906..1528565(-) (cclA/cilC) [Streptococcus pneumoniae strain 16P4028]
ATGATTGATTTTTATTTTTTTCTCGTCGGGAGCATTCTCGCTTCCTTTCTTGGTTTGGTCATTGACCGTTTTCCAGAGCA
ATCCATTATCAGTTCAGCCAGTCACTGCGATTCCTGTCAGACTCGCTTGCGTCCCTTAGATTTGATTCCGATTCTCTCAC
AGGTCTTCAATCGCTTTCGCTGTCGCTACTGCAAAGTTCGCTATCCTGTCTGGTATGCCCTCTTTGAATTAGTCTTAGGA
CTCCTCTTTCTGCTTTACTCTTGGGAATTGCTTTCCTTGGGGCAAGTCGTCCTAATCACCGCTGGTTTGACCTTGGGTAT
CTACGACTTTCACCATCAGGAATATCCCTTACTGGTCTGGATGACTTTCCACCTAATCCTAATAGCTTCCTCTGGCTGGA
ATCTGGTCATGGTCTCCTTCCTTGCTCTTGGAATTTTGGCTCATTTTATCGATATCCGCATGGGCGCAGGAGATTTCCTC
TTTTTAGCTTCTTGTGCTCTCGTCTTTAGCGTAACGGAGTTACTGATCTTGATTCAGTTTGCTTCTGCGACGGGTATCCT
GGCCTTTATCCTGCAAAAGAAAAAGGAAAGACTTCCTTTCGTGCCTTTCCTCTTACTTGCTACTTGTTTGATTATTTTTG
GTAAGCTACTGCTTGTTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0E8GKN8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  cclA/cilC Streptococcus pneumoniae TIGR4

96.804

100

0.968

  cclA/cilC Streptococcus pneumoniae Rx1

95.89

100

0.959

  cclA/cilC Streptococcus pneumoniae D39

95.89

100

0.959

  cclA/cilC Streptococcus pneumoniae R6

95.89

100

0.959

  cclA/cilC Streptococcus mitis SK321

92.237

100

0.922

  cclA/cilC Streptococcus mitis NCTC 12261

85.845

100

0.858