Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   NX720_RS18860 Genome accession   NZ_CP103300
Coordinates   4685428..4686645 (-) Length   405 a.a.
NCBI ID   WP_262596681.1    Uniprot ID   -
Organism   Endozoicomonas euniceicola strain EF212     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 4668794..4685291 4685428..4686645 flank 137


Gene organization within MGE regions


Location: 4668794..4686645
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NX720_RS18795 (NX720_18795) - 4668794..4669432 (+) 639 WP_262596661.1 VWA domain-containing protein -
  NX720_RS18800 (NX720_18800) - 4669467..4670114 (+) 648 WP_262596662.1 VWA domain-containing protein -
  NX720_RS18805 (NX720_18805) - 4670130..4671161 (+) 1032 WP_262596663.1 TerY-C metal binding domain-containing protein -
  NX720_RS18810 (NX720_18810) - 4671161..4673077 (+) 1917 WP_262596665.1 PP2C family serine/threonine-protein phosphatase -
  NX720_RS18815 (NX720_18815) - 4673070..4674566 (+) 1497 WP_262596667.1 helix-hairpin-helix domain-containing protein -
  NX720_RS18820 (NX720_18820) - 4675150..4675998 (-) 849 WP_262596669.1 hypothetical protein -
  NX720_RS18825 (NX720_18825) tnpC 4676652..4678205 (-) 1554 WP_262596671.1 IS66 family transposase -
  NX720_RS18830 (NX720_18830) tnpB 4678215..4678583 (-) 369 WP_262596672.1 IS66 family insertion sequence element accessory protein TnpB -
  NX720_RS18835 (NX720_18835) tnpA 4678585..4678920 (-) 336 WP_262596673.1 IS66 family insertion sequence element accessory protein TnpA -
  NX720_RS18840 (NX720_18840) - 4678917..4680824 (-) 1908 WP_262596675.1 TnsD family transposase -
  NX720_RS18845 (NX720_18845) - 4680860..4682236 (-) 1377 WP_262596677.1 ATP-binding protein -
  NX720_RS18850 (NX720_18850) - 4682252..4684468 (-) 2217 WP_262596678.1 Mu transposase C-terminal domain-containing protein -
  NX720_RS18855 (NX720_18855) - 4684470..4685291 (-) 822 WP_262596680.1 TnsA endonuclease N-terminal domain-containing protein -
  NX720_RS18860 (NX720_18860) pilC 4685428..4686645 (-) 1218 WP_262596681.1 type II secretion system F family protein Machinery gene

Sequence


Protein


Download         Length: 405 a.a.        Molecular weight: 43608.37 Da        Isoelectric Point: 9.7718

>NTDB_id=721823 NX720_RS18860 WP_262596681.1 4685428..4686645(-) (pilC) [Endozoicomonas euniceicola strain EF212]
MAKKSAKSSTFIWEGKDKSGRKTKGEIEGTSIALIKAELRKQGISATRVKKKGMSFGKKGGKITPLDIALFTRQLATMIK
AGVPLLNAFDITTDGIEKPAMKELLVKVKNEVAGGTTLAEALRAHPLYFDDLYCNLVSSGEQSGALETLLDRIATYKEKS
EALKAKIKKAMNYPVAVVCVAFIVTGILLVKVVPQFEEVFQGFGAELPAFTQMVIHISNFVQQWWLAAILGLAAFGFMIK
KLMLRSKAARDKKDRLVLKLPVIGPILEKSAVARFARTLSTTFAAGVPLVDALDSVSGAAGNVVFADATNQIKEDVSTGQ
QLQFAMKSSGIFPAMAIQMVSIGEESGSLDEMLDKVATFYENEVDNAVDGLTSLMEPLIMSVLGVLVGGLIVAMYLPIFQ
MGSVV

Nucleotide


Download         Length: 1218 bp        

>NTDB_id=721823 NX720_RS18860 WP_262596681.1 4685428..4686645(-) (pilC) [Endozoicomonas euniceicola strain EF212]
ATGGCCAAAAAATCAGCCAAATCCTCTACTTTTATCTGGGAAGGTAAAGACAAGAGTGGACGTAAAACCAAAGGGGAAAT
AGAAGGTACCAGCATTGCCTTGATCAAGGCCGAGCTGCGTAAGCAGGGCATTTCTGCCACCAGGGTAAAAAAGAAAGGTA
TGTCCTTTGGCAAAAAAGGGGGCAAAATTACCCCCCTTGATATCGCCCTGTTCACCCGGCAGCTTGCCACTATGATAAAA
GCGGGTGTACCACTGCTCAACGCCTTTGACATAACAACAGACGGTATCGAAAAACCCGCCATGAAAGAGCTGCTTGTCAA
GGTTAAAAACGAGGTGGCAGGTGGTACGACTCTGGCCGAAGCCCTTCGGGCGCACCCACTCTATTTTGATGACCTGTACT
GCAACCTGGTCAGCTCCGGTGAACAGTCCGGAGCACTGGAAACATTGCTGGACAGAATTGCAACCTATAAAGAGAAGTCT
GAAGCCCTTAAAGCCAAAATCAAAAAAGCGATGAACTACCCGGTTGCGGTTGTCTGTGTTGCTTTCATTGTTACCGGCAT
TCTGCTGGTAAAAGTGGTGCCACAGTTTGAAGAAGTCTTTCAGGGATTCGGAGCTGAACTCCCGGCCTTCACCCAGATGG
TTATTCATATTTCCAACTTTGTTCAGCAATGGTGGCTGGCGGCTATTCTCGGACTGGCGGCCTTTGGTTTTATGATTAAA
AAACTGATGCTGCGCTCAAAAGCCGCAAGAGATAAAAAAGACAGGCTAGTGCTAAAGCTGCCTGTTATTGGTCCAATACT
CGAAAAGTCTGCCGTCGCACGTTTTGCCCGAACCCTTTCAACGACATTTGCTGCCGGTGTTCCATTAGTCGACGCACTGG
ACTCAGTATCAGGCGCTGCGGGCAATGTTGTCTTTGCGGATGCCACCAACCAAATCAAAGAAGATGTTTCCACAGGTCAG
CAGCTGCAATTTGCTATGAAGAGTTCAGGCATTTTTCCGGCGATGGCGATTCAGATGGTTTCCATCGGAGAAGAGTCTGG
CTCTCTGGATGAAATGCTGGATAAAGTCGCCACCTTCTACGAAAACGAAGTGGATAACGCCGTTGATGGCCTGACCAGCC
TGATGGAACCATTGATTATGTCGGTACTCGGGGTGCTGGTTGGAGGCCTGATTGTGGCTATGTACCTGCCTATCTTTCAG
ATGGGGTCTGTTGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

67.16

100

0.672

  pilC Acinetobacter baylyi ADP1

60.049

100

0.605

  pilC Acinetobacter baumannii D1279779

59.753

100

0.598

  pilC Legionella pneumophila strain ERS1305867

55.637

100

0.56

  pilG Neisseria gonorrhoeae MS11

45.771

99.259

0.454

  pilG Neisseria meningitidis 44/76-A

45.522

99.259

0.452

  pilC Vibrio cholerae strain A1552

44.584

98.025

0.437

  pilC Vibrio campbellii strain DS40M4

42.857

98.519

0.422

  pilC Thermus thermophilus HB27

38.596

98.519

0.38