Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   E3U44_RS04815 Genome accession   NZ_CP038033
Coordinates   991706..992926 (-) Length   406 a.a.
NCBI ID   WP_134356919.1    Uniprot ID   A0A4P7BXQ8
Organism   Nitrosococcus wardiae strain D1FHS     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 986706..997926
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E3U44_RS04780 (E3U44_04780) - 987629..987856 (+) 228 WP_134356912.1 zinc-finger domain-containing protein -
  E3U44_RS04785 (E3U44_04785) - 987946..988146 (-) 201 WP_134356913.1 DUF2905 domain-containing protein -
  E3U44_RS04790 (E3U44_04790) - 988188..989138 (+) 951 WP_134356914.1 Nudix family hydrolase -
  E3U44_RS04795 (E3U44_04795) - 989144..989332 (-) 189 WP_134356915.1 DNA gyrase inhibitor YacG -
  E3U44_RS04800 (E3U44_04800) zapD 989329..990096 (-) 768 WP_134356916.1 cell division protein ZapD -
  E3U44_RS04805 (E3U44_04805) coaE 990169..990768 (-) 600 WP_134356917.1 dephospho-CoA kinase -
  E3U44_RS04810 (E3U44_04810) pilD 990817..991686 (-) 870 WP_134356918.1 prepilin peptidase Machinery gene
  E3U44_RS04815 (E3U44_04815) pilC 991706..992926 (-) 1221 WP_134356919.1 type II secretion system F family protein Machinery gene
  E3U44_RS04820 (E3U44_04820) pilB 992931..994646 (-) 1716 WP_206054890.1 type IV-A pilus assembly ATPase PilB Machinery gene
  E3U44_RS04825 (E3U44_04825) lptM 994811..994996 (+) 186 WP_206054891.1 LPS translocon maturation chaperone LptM -
  E3U44_RS04830 (E3U44_04830) lysA 995000..996250 (+) 1251 WP_134356920.1 diaminopimelate decarboxylase -
  E3U44_RS04835 (E3U44_04835) - 996311..996982 (+) 672 WP_134356921.1 class I SAM-dependent methyltransferase -
  E3U44_RS04840 (E3U44_04840) dapF 997021..997851 (+) 831 WP_134356922.1 diaminopimelate epimerase -

Sequence


Protein


Download         Length: 406 a.a.        Molecular weight: 44296.27 Da        Isoelectric Point: 10.1722

>NTDB_id=351652 E3U44_RS04815 WP_134356919.1 991706..992926(-) (pilC) [Nitrosococcus wardiae strain D1FHS]
MAQAATKQQIFVWEGANRQGQRVKGEVNGKNTSMVKADLRRQGIVPLKVRKKPTPLFSRGKKKITPKEIAIFSRQLATMM
SAGVPLVQAFEIIGRGHENASMQALVLNIKGEVEGGGTLAEALKKHPRQFDDLFCNLVNAGEQSGTLETLLDKIATYKEK
TEAIKGKIKKALFYPTAVVVVAFIITAILLIFVIPQFQTLFQNFGADLPALTLLVLQLSALFQEWWWAIFGGMGVAIYSL
IEARRRSRKINHLFDKLLLKLPVIGDILNKATIARYARTLSTMFAAGVPLVEAMASVAGAAGNSVYAQAILRMRDEVSTG
TQLQAAMRSSQLFPNMVVQMVAIGEEAGSIDQMLAKVADFYEEEVDNAVDALSSLLEPLIMAILGVLVGGLVIAMYLPIF
KMGSVV

Nucleotide


Download         Length: 1221 bp        

>NTDB_id=351652 E3U44_RS04815 WP_134356919.1 991706..992926(-) (pilC) [Nitrosococcus wardiae strain D1FHS]
ATGGCGCAAGCAGCCACGAAGCAGCAGATTTTTGTCTGGGAAGGCGCCAACCGGCAGGGGCAGCGGGTTAAAGGCGAGGT
GAACGGCAAAAACACTAGCATGGTGAAAGCCGATCTACGGCGACAAGGCATTGTACCCTTAAAGGTCCGGAAAAAACCCA
CGCCTTTGTTTAGCCGAGGCAAGAAAAAAATCACCCCTAAGGAAATCGCTATTTTTAGCCGCCAACTGGCAACGATGATG
TCCGCGGGCGTACCATTGGTTCAGGCCTTTGAAATCATAGGTCGCGGCCATGAAAACGCCTCCATGCAAGCGCTGGTCCT
AAATATCAAGGGGGAAGTGGAGGGTGGCGGCACGCTGGCTGAAGCCCTCAAGAAACACCCAAGGCAGTTTGACGACCTTT
TTTGTAACCTGGTCAATGCCGGGGAGCAGTCCGGTACCCTTGAGACATTGCTGGATAAAATCGCCACTTACAAAGAAAAG
ACAGAGGCCATAAAGGGAAAGATAAAAAAAGCGCTATTCTATCCTACCGCCGTAGTGGTGGTGGCTTTTATCATTACCGC
TATTTTGCTTATCTTCGTTATCCCCCAATTTCAAACCTTGTTCCAGAATTTCGGCGCCGATCTCCCCGCCCTCACCTTAT
TAGTACTCCAGCTTTCCGCCCTCTTTCAAGAGTGGTGGTGGGCTATTTTTGGAGGCATGGGGGTCGCCATTTATAGCCTT
ATCGAAGCCAGGCGGCGCTCACGCAAGATCAACCATCTCTTCGATAAACTACTACTCAAGCTGCCGGTGATTGGCGATAT
CCTGAACAAGGCAACTATCGCCCGCTATGCCCGTACCCTCTCGACCATGTTTGCCGCCGGCGTTCCCTTGGTTGAAGCAA
TGGCCTCTGTGGCTGGCGCTGCTGGCAACTCCGTCTATGCTCAGGCCATCCTCCGCATGCGTGATGAGGTCTCCACCGGG
ACTCAGTTACAAGCGGCTATGCGCAGCAGCCAATTATTTCCTAATATGGTGGTGCAGATGGTAGCTATTGGTGAAGAAGC
AGGCTCAATCGATCAGATGCTAGCCAAAGTAGCCGATTTCTATGAGGAGGAAGTCGATAATGCCGTTGATGCCCTAAGTA
GCTTGCTAGAGCCTCTTATCATGGCGATACTAGGGGTTTTGGTGGGCGGCCTTGTCATCGCCATGTACTTACCCATCTTC
AAGATGGGCTCGGTCGTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4P7BXQ8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

60.591

100

0.606

  pilC Legionella pneumophila strain ERS1305867

59.596

97.537

0.581

  pilC Acinetobacter baylyi ADP1

56.757

100

0.569

  pilC Acinetobacter baumannii D1279779

57.683

97.783

0.564

  pilG Neisseria meningitidis 44/76-A

47.395

99.261

0.47

  pilG Neisseria gonorrhoeae MS11

47.146

99.261

0.468

  pilC Vibrio cholerae strain A1552

41.919

97.537

0.409

  pilC Vibrio campbellii strain DS40M4

40

100

0.404

  pilC Thermus thermophilus HB27

35.8

100

0.369


Multiple sequence alignment