Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   LOY37_RS23015 Genome accession   NZ_CP087197
Coordinates   5033122..5034330 (+) Length   402 a.a.
NCBI ID   WP_258715811.1    Uniprot ID   -
Organism   Pseudomonas sp. B21-012     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 5010017..5032186 5033122..5034330 flank 936


Gene organization within MGE regions


Location: 5010017..5034330
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LOY37_RS22860 (LOY37_22840) - 5010017..5010385 (+) 369 WP_249744399.1 YcgJ family protein -
  LOY37_RS22865 (LOY37_22845) - 5010452..5010982 (-) 531 WP_258715787.1 hypothetical protein -
  LOY37_RS22870 (LOY37_22850) - 5011778..5012011 (-) 234 WP_258715788.1 hypothetical protein -
  LOY37_RS22875 (LOY37_22855) - 5012529..5014859 (-) 2331 WP_258715789.1 YecA family protein -
  LOY37_RS22880 (LOY37_22860) - 5015851..5016693 (-) 843 WP_309475536.1 competence protein CoiA family protein -
  LOY37_RS22885 (LOY37_22865) - 5016789..5017211 (-) 423 WP_258715790.1 hypothetical protein -
  LOY37_RS22890 (LOY37_22870) - 5017208..5017690 (-) 483 WP_258715791.1 ImmA/IrrE family metallo-endopeptidase -
  LOY37_RS22895 (LOY37_22875) - 5018043..5018270 (-) 228 WP_258715792.1 hypothetical protein -
  LOY37_RS22900 (LOY37_22880) - 5018502..5019227 (+) 726 WP_258715793.1 hypothetical protein -
  LOY37_RS22905 (LOY37_22885) - 5020085..5020696 (-) 612 WP_258715794.1 tail assembly protein -
  LOY37_RS22910 (LOY37_22890) - 5020696..5021046 (-) 351 WP_258715795.1 hypothetical protein -
  LOY37_RS22915 - 5021185..5021316 (-) 132 WP_258715796.1 hypothetical protein -
  LOY37_RS22920 (LOY37_22895) - 5021530..5022129 (-) 600 WP_258715797.1 hypothetical protein -
  LOY37_RS22925 (LOY37_22900) - 5022133..5022732 (-) 600 WP_258715798.1 hypothetical protein -
  LOY37_RS22930 (LOY37_22905) - 5022729..5023031 (-) 303 WP_258715799.1 hypothetical protein -
  LOY37_RS22935 (LOY37_22910) - 5023368..5025167 (-) 1800 WP_258715800.1 DUF927 domain-containing protein -
  LOY37_RS22940 (LOY37_22915) - 5025154..5026044 (-) 891 WP_258715801.1 toprim domain-containing protein -
  LOY37_RS22945 (LOY37_22920) - 5026041..5026268 (-) 228 WP_258715802.1 hypothetical protein -
  LOY37_RS22950 (LOY37_22925) - 5026261..5026548 (-) 288 WP_258715803.1 hypothetical protein -
  LOY37_RS22955 (LOY37_22930) - 5026553..5026897 (-) 345 WP_254285660.1 hypothetical protein -
  LOY37_RS22960 (LOY37_22935) - 5026894..5027109 (-) 216 WP_210014866.1 hypothetical protein -
  LOY37_RS22965 (LOY37_22940) - 5027106..5027339 (-) 234 WP_236066298.1 hypothetical protein -
  LOY37_RS22970 (LOY37_22945) - 5027408..5027740 (-) 333 WP_258715804.1 hypothetical protein -
  LOY37_RS22975 (LOY37_22950) - 5027886..5028113 (-) 228 WP_232531808.1 hypothetical protein -
  LOY37_RS22980 (LOY37_22955) - 5028431..5029078 (-) 648 WP_258715805.1 KilA-N domain-containing protein -
  LOY37_RS22985 (LOY37_22960) - 5029075..5029377 (-) 303 WP_258715806.1 helix-turn-helix domain-containing protein -
  LOY37_RS22990 (LOY37_22965) - 5029377..5029637 (-) 261 WP_258715807.1 AlpA family transcriptional regulator -
  LOY37_RS22995 (LOY37_22970) - 5029802..5030695 (-) 894 WP_258715808.1 DUF6387 family protein -
  LOY37_RS23000 (LOY37_22975) - 5030873..5032186 (-) 1314 WP_258715809.1 phage integrase central domain-containing protein -
  LOY37_RS23010 (LOY37_22985) - 5032494..5032904 (-) 411 WP_258715810.1 pilin -
  LOY37_RS23015 (LOY37_22990) pilC 5033122..5034330 (+) 1209 WP_258715811.1 type II secretion system F family protein Machinery gene

Sequence


Protein


Download         Length: 402 a.a.        Molecular weight: 43035.58 Da        Isoelectric Point: 10.4634

>NTDB_id=627909 LOY37_RS23015 WP_258715811.1 5033122..5034330(+) (pilC) [Pseudomonas sp. B21-012]
MTESNRMYSWQGTNAQGALISGQSAARSPALIRAQLRQRGIRPGRIASVRPPLWQWRRPLDGKALSQFSRQLATLVKAGV
ALLQALDVVAQSSANPALASLLGAIKADIAAGSSLATALRRHPQYFDRLYCNLVDAGEQSGSLETVLAQVAALQEQRQAL
RMRLKKAMTYPLLVLLVGFGVSALLLLEVVPRFQSLFGGFGAELPAFTQQVIGLSQWLGAHLGWLLITFGGACGALVWAY
RHKPGVRLWLVRVALRLPVLGSLLRHAALARFARTLSTSFAAGVPLVEALGPVAGACGNPLYEQAVHKVRDDLASGLALN
VALRTSGLFPALVVQMCAIGETAGTLDDMLARIASHYEQAIDHLLDTLTALIEPLIVLILGLLVGSLVIAMYLPIFQLGN
VI

Nucleotide


Download         Length: 1209 bp        

>NTDB_id=627909 LOY37_RS23015 WP_258715811.1 5033122..5034330(+) (pilC) [Pseudomonas sp. B21-012]
ATGACCGAAAGCAACCGGATGTACAGCTGGCAAGGCACCAACGCCCAAGGTGCCTTGATCAGCGGCCAGAGTGCGGCGCG
CAGCCCGGCGTTGATCAGGGCGCAGTTGCGCCAGCGCGGGATTCGCCCGGGGCGTATCGCCAGTGTGCGCCCGCCGTTGT
GGCAATGGCGGCGGCCACTGGATGGCAAGGCGTTGAGCCAGTTCAGCCGCCAGCTGGCCACCCTGGTCAAGGCCGGGGTG
GCGCTGTTGCAGGCGCTGGATGTAGTGGCCCAGAGCAGCGCCAACCCGGCGCTGGCCAGCCTGCTGGGGGCGATCAAGGC
CGATATCGCCGCTGGCAGCAGCCTGGCCACTGCGCTGCGCCGCCACCCGCAGTACTTCGATCGGCTGTACTGCAACCTGG
TGGACGCCGGTGAGCAGTCCGGCAGCCTGGAGACGGTGCTGGCCCAGGTGGCGGCGCTGCAGGAGCAACGCCAGGCCCTG
CGCATGCGGCTGAAAAAAGCCATGACCTATCCTCTGCTGGTGTTGCTGGTGGGCTTTGGGGTGTCGGCGTTGTTATTGCT
GGAGGTGGTGCCGCGCTTTCAGTCGTTGTTCGGCGGTTTTGGTGCCGAGCTGCCGGCGTTTACTCAACAGGTGATCGGCT
TGTCGCAGTGGTTGGGCGCGCACCTTGGGTGGCTGTTGATAACGTTCGGCGGCGCTTGTGGTGCGCTGGTTTGGGCTTAT
CGGCACAAGCCTGGGGTGCGTTTGTGGCTGGTGCGCGTGGCCCTGCGCTTGCCGGTGCTGGGCAGCCTGCTCCGGCACGC
CGCCCTCGCCCGCTTTGCCCGCACCCTGTCGACCTCGTTTGCCGCCGGGGTGCCGCTGGTGGAGGCGCTGGGGCCGGTTG
CCGGGGCCTGTGGCAATCCGCTGTATGAGCAGGCGGTGCACAAGGTGCGCGACGACCTTGCCAGTGGCCTGGCCTTGAAC
GTGGCGTTGCGCACCAGCGGGCTGTTCCCGGCGCTGGTGGTGCAGATGTGCGCCATTGGCGAGACAGCCGGCACGCTGGA
CGATATGCTCGCCAGGATCGCCAGCCACTATGAACAGGCCATCGACCACCTGCTCGACACCCTCACGGCCCTGATTGAGC
CGTTGATCGTGCTGATTCTGGGGTTGCTGGTGGGCAGCCTGGTGATCGCCATGTACCTGCCAATCTTCCAGCTGGGCAAT
GTGATCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

49.624

99.254

0.493

  pilC Legionella pneumophila strain ERS1305867

45.477

99.005

0.45

  pilC Acinetobacter baylyi ADP1

44.888

99.751

0.448

  pilC Acinetobacter baumannii D1279779

44.081

98.756

0.435

  pilC Vibrio cholerae strain A1552

39.698

99.005

0.393

  pilG Neisseria meningitidis 44/76-A

39.055

100

0.391

  pilG Neisseria gonorrhoeae MS11

38.806

100

0.388

  pilC Vibrio campbellii strain DS40M4

38.272

100

0.386

  pilC Thermus thermophilus HB27

37.656

99.751

0.376