Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   HIV01_RS09815 Genome accession   NZ_CP071517
Coordinates   2100619..2101875 (+) Length   418 a.a.
NCBI ID   WP_200606764.1    Uniprot ID   -
Organism   Lysobacter arenosi strain R7     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 2090962..2104331 2100619..2101875 within 0


Gene organization within MGE regions


Location: 2090962..2104331
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HIV01_RS09775 (HIV01_009775) - 2090962..2091363 (+) 402 WP_200606753.1 pilin -
  HIV01_RS09780 (HIV01_009780) - 2091439..2093343 (+) 1905 WP_200606754.1 hypothetical protein -
  HIV01_RS09785 (HIV01_009785) - 2093370..2094218 (+) 849 WP_200606755.1 hypothetical protein -
  HIV01_RS09790 (HIV01_009790) - 2094220..2095518 (+) 1299 WP_207526915.1 oligosaccharide flippase family protein -
  HIV01_RS09795 (HIV01_009795) - 2095555..2096619 (-) 1065 WP_200606757.1 glycosyltransferase family 4 protein -
  HIV01_RS09800 (HIV01_009800) - 2096623..2097711 (-) 1089 WP_200606758.1 UDP-N-acetylglucosamine 2-epimerase -
  HIV01_RS09805 (HIV01_009805) - 2097708..2098580 (-) 873 WP_200606760.1 glycosyltransferase family 2 protein -
  HIV01_RS09810 (HIV01_009810) pilB 2098753..2100477 (+) 1725 WP_200606762.1 type IV-A pilus assembly ATPase PilB Machinery gene
  HIV01_RS09815 (HIV01_009815) pilC 2100619..2101875 (+) 1257 WP_200606764.1 type II secretion system F family protein Machinery gene
  HIV01_RS09820 (HIV01_009820) pilD 2101913..2102776 (+) 864 WP_200606766.1 A24 family peptidase Machinery gene
  HIV01_RS09825 (HIV01_009825) - 2102805..2103554 (-) 750 WP_200606767.1 class I SAM-dependent methyltransferase -
  HIV01_RS09830 (HIV01_009830) coaE 2103708..2104331 (+) 624 WP_200606768.1 dephospho-CoA kinase -

Sequence


Protein


Download         Length: 418 a.a.        Molecular weight: 45306.92 Da        Isoelectric Point: 10.2685

>NTDB_id=545897 HIV01_RS09815 WP_200606764.1 2100619..2101875(+) (pilC) [Lysobacter arenosi strain R7]
MSATRTATKPATVRRTDAMPLFVWQGADKRGKIMKGETPAKNANSLRAELRKQGITPKVVKPKGKPLFGAAGKRITPLDI
AIFSRQIATMMKSGVPIVGAMEIIANGNKNPRMVTLVNAVRADVESGSSLYEALCKHPVQFDELYRNLVRAGEAAGVLET
VLDTVATYKENIEALKGKIKKAMFYPAMVMAVAMIVSSILLVFVVPQFEEVFKGFGADLPAFTKMIVALSRFMVAWWFVV
LLIVVGTIVAFIMAKNRSVAFARLLDRLILKLPVIGQIMHNSAIARFSRTLAVTFKAGVPLVEALDTVAGATGNVLYEDA
VRNIRDDVAVGYPVNVAMKQVNLFPHMVTQMTAIGEEAGALDAMLFKVAEFYEQEVNNAVDALASLLEPLIMVFLGVIVG
GMVIAMYLPIFKLAAAVG

Nucleotide


Download         Length: 1257 bp        

>NTDB_id=545897 HIV01_RS09815 WP_200606764.1 2100619..2101875(+) (pilC) [Lysobacter arenosi strain R7]
ATGTCCGCGACCCGCACCGCCACCAAGCCCGCCACCGTGCGCCGCACCGATGCCATGCCCTTGTTCGTCTGGCAGGGCGC
GGACAAGCGCGGCAAGATCATGAAGGGCGAAACACCGGCCAAGAATGCCAACTCGCTGCGCGCCGAACTGCGCAAGCAAG
GCATCACCCCGAAGGTGGTCAAGCCCAAGGGCAAGCCGCTGTTCGGCGCCGCCGGCAAGCGCATCACCCCGCTCGACATC
GCGATCTTCAGTCGTCAGATCGCCACGATGATGAAGTCCGGCGTGCCCATCGTCGGCGCCATGGAAATCATCGCCAACGG
CAACAAGAATCCCCGCATGGTGACGCTGGTCAATGCGGTGCGCGCCGATGTCGAAAGCGGCTCCTCGCTGTATGAAGCCC
TGTGCAAGCATCCGGTCCAGTTCGACGAGCTGTACCGCAACCTGGTCAGGGCCGGCGAAGCAGCCGGTGTGCTCGAGACG
GTTCTCGATACGGTCGCGACCTACAAGGAAAACATCGAAGCCCTGAAGGGCAAGATCAAGAAGGCGATGTTCTATCCGGC
AATGGTCATGGCCGTTGCGATGATCGTCAGTTCGATCCTCCTTGTCTTCGTGGTGCCGCAATTCGAGGAAGTCTTCAAGG
GTTTCGGCGCGGACCTGCCGGCCTTCACCAAGATGATCGTGGCGCTCAGCCGCTTCATGGTTGCCTGGTGGTTCGTGGTA
CTGCTGATCGTCGTGGGAACGATCGTCGCGTTCATCATGGCCAAGAACCGCTCGGTGGCCTTTGCGCGGCTGCTCGACCG
CCTGATCCTGAAGTTGCCGGTCATCGGCCAGATCATGCATAACTCGGCGATCGCGCGTTTCTCGCGCACGCTGGCCGTCA
CCTTCAAGGCCGGCGTGCCGCTGGTCGAGGCGCTGGACACTGTCGCAGGCGCCACCGGCAACGTCCTGTACGAGGACGCC
GTGCGCAATATTCGCGATGACGTCGCGGTCGGCTACCCGGTCAACGTGGCGATGAAGCAGGTCAACCTGTTCCCGCACAT
GGTCACCCAGATGACCGCGATCGGCGAAGAGGCCGGCGCGCTCGATGCCATGTTGTTCAAGGTCGCCGAGTTCTACGAGC
AGGAGGTCAACAACGCGGTCGACGCACTGGCGAGCCTGCTGGAACCGCTGATCATGGTGTTCCTGGGCGTGATCGTCGGC
GGCATGGTCATCGCGATGTACCTTCCGATCTTCAAACTGGCGGCGGCGGTCGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Legionella pneumophila strain ERS1305867

56.171

94.976

0.533

  pilC Pseudomonas stutzeri DSM 10701

54.66

94.976

0.519

  pilC Acinetobacter baumannii D1279779

53.202

97.129

0.517

  pilC Acinetobacter baylyi ADP1

52.099

96.89

0.505

  pilG Neisseria gonorrhoeae MS11

42.714

95.215

0.407

  pilG Neisseria meningitidis 44/76-A

42.462

95.215

0.404

  pilC Vibrio campbellii strain DS40M4

42.065

94.976

0.4

  pilC Vibrio cholerae strain A1552

40.686

97.608

0.397

  pilC Thermus thermophilus HB27

40.199

96.411

0.388