Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   KAQ61_RS00735 Genome accession   NZ_CP072916
Coordinates   162977..164200 (-) Length   407 a.a.
NCBI ID   WP_045267485.1    Uniprot ID   A0A7X2I2Z9
Organism   Comamonas aquatica strain NEB418     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 157977..169200
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KAQ61_RS00705 (KAQ61_00705) - 158892..159824 (+) 933 WP_043379904.1 ATP-binding protein -
  KAQ61_RS00710 (KAQ61_00710) - 159846..160307 (+) 462 WP_210930171.1 (deoxy)nucleoside triphosphate pyrophosphohydrolase -
  KAQ61_RS00715 (KAQ61_00715) - 160411..160620 (-) 210 WP_045267488.1 DNA gyrase inhibitor YacG -
  KAQ61_RS00720 (KAQ61_00720) zapD 160647..161402 (-) 756 WP_210930172.1 cell division protein ZapD -
  KAQ61_RS00725 (KAQ61_00725) coaE 161497..162105 (-) 609 WP_154163335.1 dephospho-CoA kinase -
  KAQ61_RS00730 (KAQ61_00730) pilD 162108..162977 (-) 870 WP_210930173.1 prepilin peptidase Machinery gene
  KAQ61_RS00735 (KAQ61_00735) pilC 162977..164200 (-) 1224 WP_045267485.1 type II secretion system F family protein Machinery gene
  KAQ61_RS00740 (KAQ61_00740) pilB 164252..165997 (-) 1746 WP_210930174.1 type IV-A pilus assembly ATPase PilB Machinery gene
  KAQ61_RS00745 (KAQ61_00745) - 166156..167085 (-) 930 WP_210931650.1 polyprenyl synthetase family protein -
  KAQ61_RS00750 (KAQ61_00750) rplU 167296..167607 (+) 312 WP_042417032.1 50S ribosomal protein L21 -
  KAQ61_RS00755 (KAQ61_00755) rpmA 167626..167886 (+) 261 WP_042416998.1 50S ribosomal protein L27 -
  KAQ61_RS00760 (KAQ61_00760) cgtA 168050..169123 (+) 1074 WP_042416995.1 Obg family GTPase CgtA -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 44589.73 Da        Isoelectric Point: 9.5471

>NTDB_id=556861 KAQ61_RS00735 WP_045267485.1 162977..164200(-) (pilC) [Comamonas aquatica strain NEB418]
MATAAPGRSNAKEFIYEWAGKDRQGKALHGEMRAVGEAQVQASLRRQGIFITKLKKRSMRAGKKIKPKDIALFTRQMATM
MKAGVPLLQSFDIVGRGNPNPSVTKLLNDIRQDVETGTSLSMAFRKHPMHFNALYCNLIEAGEAAGILDTLMDRLATYME
KTEAIKGKIKSALMYPIAVIVVAFVVVTVIMIFVIPAFKEVFTSFGADLPAPTLLVMAISEFFVAYWWLIFGVIGGGLYF
FMQAWRRSPKMQRVMDRLLLKLPIFGILIDKSCVARWTRTLSTMFAAGVPLVEALDSVGGASGNSVYREATDKIQQMVAT
GTNLTDAMTSADVFPSMVLQMCAIGEESGSIDHMLGKAADFYEQEVDEMVAGLSSLMEPIIIVILGGIIGGIVVSMYLPI
FKLGQVV

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=556861 KAQ61_RS00735 WP_045267485.1 162977..164200(-) (pilC) [Comamonas aquatica strain NEB418]
ATGGCAACCGCAGCACCTGGCAGGAGCAACGCAAAGGAGTTCATTTACGAATGGGCCGGCAAAGACCGCCAGGGCAAGGC
ACTGCACGGAGAGATGCGGGCCGTTGGGGAAGCCCAGGTCCAGGCCTCCTTGCGCCGCCAGGGCATCTTCATCACCAAGC
TGAAGAAGCGCAGCATGCGCGCCGGCAAGAAGATCAAGCCCAAGGACATTGCGCTGTTCACCCGCCAGATGGCCACCATG
ATGAAGGCCGGGGTCCCGCTGCTGCAGTCGTTCGACATCGTCGGGCGTGGCAACCCCAATCCCAGCGTGACCAAGCTGCT
CAACGACATCCGCCAGGATGTGGAGACCGGCACCTCGCTGAGCATGGCATTCCGCAAGCACCCCATGCACTTCAACGCCT
TGTACTGCAACCTGATCGAGGCCGGGGAAGCTGCGGGCATCCTGGACACCTTGATGGACCGCCTGGCCACCTACATGGAA
AAGACCGAGGCCATCAAGGGCAAGATCAAGTCCGCGCTGATGTACCCCATCGCGGTGATCGTGGTGGCGTTCGTGGTGGT
GACGGTGATCATGATCTTTGTGATCCCGGCGTTCAAGGAAGTCTTCACCTCGTTCGGCGCCGACCTGCCGGCCCCGACGC
TACTGGTCATGGCCATCAGTGAGTTCTTTGTCGCGTATTGGTGGCTGATCTTTGGCGTGATCGGTGGCGGCCTGTATTTC
TTCATGCAGGCATGGCGGCGCAGTCCCAAGATGCAGCGCGTGATGGACCGTCTGCTGCTGAAGCTGCCCATTTTTGGCAT
CCTGATCGACAAATCCTGCGTCGCACGCTGGACGCGCACCCTCTCCACCATGTTCGCCGCCGGCGTGCCCCTGGTGGAAG
CGCTGGACTCCGTCGGCGGGGCCTCGGGCAACAGCGTCTACCGGGAAGCCACCGACAAGATCCAGCAGATGGTTGCCACC
GGCACCAATCTCACCGATGCGATGACGAGTGCCGATGTCTTCCCGTCGATGGTGCTGCAGATGTGCGCCATCGGCGAGGA
ATCGGGCTCCATTGATCACATGCTGGGCAAGGCGGCCGACTTCTACGAACAGGAAGTGGACGAGATGGTGGCAGGCCTTT
CCAGCCTGATGGAGCCCATCATCATCGTGATCCTGGGTGGCATCATCGGCGGGATTGTGGTGTCCATGTACCTGCCGATC
TTCAAGTTAGGACAAGTGGTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7X2I2Z9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

54.887

98.034

0.538

  pilG Neisseria gonorrhoeae MS11

51.01

97.297

0.496

  pilG Neisseria meningitidis 44/76-A

50.758

97.297

0.494

  pilC Legionella pneumophila strain ERS1305867

50.505

97.297

0.491

  pilC Acinetobacter baylyi ADP1

49.246

97.789

0.482

  pilC Acinetobacter baumannii D1279779

48.622

98.034

0.477

  pilC Vibrio campbellii strain DS40M4

40.049

100

0.405

  pilC Thermus thermophilus HB27

39.089

100

0.4

  pilC Vibrio cholerae strain A1552

40.909

97.297

0.398