Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   GG844_RS15395 Genome accession   NZ_CP045719
Coordinates   2154609..2155835 (+) Length   408 a.a.
NCBI ID   WP_000648511.1    Uniprot ID   Q9X4G9
Organism   Vibrio cholerae O395 substr. TCP2     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2149609..2160835
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GG844_RS15370 (GG844_15370) fldB 2149700..2150218 (+) 519 WP_000690112.1 flavodoxin FldB -
  GG844_RS15375 (GG844_15375) ampD 2150441..2150986 (-) 546 WP_000567318.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  GG844_RS15380 (GG844_15380) nadC 2151282..2152172 (+) 891 WP_000665274.1 carboxylating nicotinate-nucleotide diphosphorylase -
  GG844_RS15385 (GG844_15385) pilA 2152417..2152878 (+) 462 WP_000649326.1 pilin Machinery gene
  GG844_RS15390 (GG844_15390) pilB 2152878..2154566 (+) 1689 WP_000957200.1 type IV-A pilus assembly ATPase PilB Machinery gene
  GG844_RS15395 (GG844_15395) pilC 2154609..2155835 (+) 1227 WP_000648511.1 type II secretion system F family protein Machinery gene
  GG844_RS15400 (GG844_15400) pilD 2155893..2156768 (+) 876 WP_000418747.1 prepilin peptidase Machinery gene
  GG844_RS15405 (GG844_15405) coaE 2156765..2157373 (+) 609 WP_000011557.1 dephospho-CoA kinase -
  GG844_RS15410 (GG844_15410) zapD 2157405..2158145 (+) 741 WP_000207198.1 cell division protein ZapD -
  GG844_RS15415 (GG844_15415) yacG 2158287..2158484 (+) 198 WP_000162865.1 DNA gyrase inhibitor YacG -
  GG844_RS15420 (GG844_15420) parC 2158533..2160818 (-) 2286 WP_000102510.1 DNA topoisomerase IV subunit A -

Sequence


Protein


Download         Length: 408 a.a.        Molecular weight: 45134.43 Da        Isoelectric Point: 10.3424

>NTDB_id=396316 GG844_RS15395 WP_000648511.1 2154609..2155835(+) (pilC) [Vibrio cholerae O395 substr. TCP2]
MKATQTLPLKNYRWKGINSNGKKVSGQMLAISEIEVRDKLKDQHIQIKKLKKGSVSLLARLTHRVKSKDITILTRQLATM
LTTGVPIVQALKLVGDNHRKAEMKSILAQITKSVEAGTPLSKAMRTASAHFDTLYVDLVETGEMSGNLPEVFERLATYRE
KSEQLRAKVIKALIYPSMVVLVALGVSYLMLTMVIPEFESMFKGFGAELPWFTQQVLKLSHWVQAYSLWAFIAIAAAIFG
LKALRKNSFQIRLKTSRLGLKFPIIGNVLAKASIAKFSRTLATSFAAGIPILASLKTTAKTSGNVHFETAINEVYRDTAA
GMPMYIAMRNTEAFPEMVLQMVMIGEESGQLDDMLNKVATIYEFEVDNTVDNLGKILEPLIIVFLGTVVGGLVVAMYLPI
FNLMSVLG

Nucleotide


Download         Length: 1227 bp        

>NTDB_id=396316 GG844_RS15395 WP_000648511.1 2154609..2155835(+) (pilC) [Vibrio cholerae O395 substr. TCP2]
ATGAAAGCGACCCAAACCTTACCTCTGAAAAATTATCGCTGGAAAGGCATCAACAGCAACGGCAAAAAAGTTTCCGGCCA
GATGCTCGCCATCTCCGAAATCGAGGTGCGCGATAAGCTCAAAGATCAGCATATTCAGATCAAAAAACTCAAAAAAGGCA
GTGTATCTCTTTTGGCACGCCTAACCCATCGCGTGAAAAGTAAAGATATTACGATTTTGACTCGGCAGTTGGCGACCATG
CTCACCACGGGCGTACCCATTGTGCAAGCCCTCAAGTTGGTGGGCGATAATCACCGTAAAGCTGAGATGAAATCGATTCT
GGCGCAAATCACCAAAAGCGTGGAAGCGGGCACGCCACTTTCCAAGGCGATGCGCACCGCCAGCGCCCATTTTGATACCT
TGTATGTCGATTTAGTGGAAACCGGAGAGATGTCCGGTAACTTACCTGAGGTGTTTGAGCGTTTGGCCACCTACCGCGAG
AAAAGCGAGCAACTACGCGCCAAGGTGATTAAAGCGCTCATCTACCCCAGCATGGTTGTGTTGGTCGCGCTCGGGGTATC
TTACTTAATGCTCACCATGGTCATCCCAGAGTTTGAAAGCATGTTTAAAGGCTTTGGTGCTGAACTGCCTTGGTTTACGC
AGCAAGTGCTGAAACTCTCACACTGGGTGCAGGCTTACAGTTTATGGGCATTTATCGCCATCGCAGCAGCCATTTTTGGC
TTGAAAGCGCTGCGTAAAAACTCTTTCCAGATCCGTTTAAAAACCAGCCGCTTAGGGCTGAAATTTCCGATTATTGGTAA
TGTGCTCGCTAAGGCTTCCATCGCCAAATTCAGCCGTACCCTCGCCACCAGCTTTGCCGCGGGGATCCCAATTCTCGCCA
GTTTAAAAACCACGGCCAAAACCTCCGGCAATGTGCACTTTGAAACCGCGATTAATGAGGTGTACCGCGATACCGCTGCG
GGTATGCCGATGTACATTGCTATGCGCAATACCGAAGCTTTTCCCGAAATGGTGCTGCAAATGGTGATGATCGGTGAAGA
GTCTGGGCAATTAGACGACATGCTCAACAAGGTCGCGACCATCTATGAATTTGAAGTCGATAACACGGTCGATAACTTGG
GCAAGATTCTTGAACCACTGATCATCGTCTTTCTTGGGACGGTTGTGGGCGGCTTAGTGGTGGCGATGTACTTACCTATC
TTTAACTTAATGAGTGTATTGGGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q9X4G9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Vibrio cholerae strain A1552

100

100

1

  pilC Vibrio campbellii strain DS40M4

73.75

98.039

0.723

  pilC Acinetobacter baumannii D1279779

42.647

100

0.426

  pilC Acinetobacter baylyi ADP1

42.015

99.755

0.419

  pilC Legionella pneumophila strain ERS1305867

42.211

97.549

0.412

  pilC Pseudomonas stutzeri DSM 10701

42.172

97.059

0.409

  pilG Neisseria gonorrhoeae MS11

41.294

98.529

0.407

  pilG Neisseria meningitidis 44/76-A

40.796

98.529

0.402


Multiple sequence alignment