Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   INQ40_RS10115 Genome accession   NZ_CP063772
Coordinates   2212465..2214195 (+) Length   576 a.a.
NCBI ID   WP_194340330.1    Uniprot ID   -
Organism   Lysobacter sp. H21R4     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 2196233..2228294 2212465..2214195 within 0


Gene organization within MGE regions


Location: 2196233..2228294
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INQ40_RS10020 (INQ40_10020) - 2196233..2197336 (+) 1104 WP_228481988.1 M23 family metallopeptidase -
  INQ40_RS10025 (INQ40_10025) - 2198211..2198705 (-) 495 WP_194340318.1 hypothetical protein -
  INQ40_RS10030 (INQ40_10030) - 2199132..2199620 (-) 489 WP_194340319.1 hypothetical protein -
  INQ40_RS10035 (INQ40_10035) - 2199671..2199862 (-) 192 Protein_1954 IS5/IS1182 family transposase -
  INQ40_RS10040 (INQ40_10040) - 2199925..2200710 (-) 786 WP_194340320.1 hypothetical protein -
  INQ40_RS13085 - 2200704..2203904 (-) 3201 WP_345776520.1 peptidoglycan DD-metalloendopeptidase family protein -
  INQ40_RS10055 (INQ40_10055) - 2204408..2207002 (-) 2595 WP_194340321.1 type VI secretion system Vgr family protein -
  INQ40_RS10060 (INQ40_10060) - 2207254..2207556 (-) 303 WP_194340322.1 type II toxin-antitoxin system RelE/ParE family toxin -
  INQ40_RS10065 (INQ40_10065) - 2207544..2207801 (-) 258 WP_194340323.1 type II toxin-antitoxin system Phd/YefM family antitoxin -
  INQ40_RS10070 (INQ40_10070) - 2207922..2208275 (+) 354 WP_194340324.1 hypothetical protein -
  INQ40_RS10075 (INQ40_10075) - 2208236..2208796 (+) 561 WP_194340325.1 hypothetical protein -
  INQ40_RS10080 (INQ40_10080) pilA2 2209029..2209526 (+) 498 WP_194340326.1 pilin Machinery gene
  INQ40_RS10085 (INQ40_10085) - 2209691..2209987 (+) 297 WP_194340327.1 hypothetical protein -
  INQ40_RS10090 (INQ40_10090) - 2210134..2210448 (+) 315 WP_194340328.1 type II toxin-antitoxin system RelE/ParE family toxin -
  INQ40_RS10095 (INQ40_10095) nadS 2210451..2210741 (+) 291 WP_194340329.1 NadS family protein -
  INQ40_RS13090 - 2211206..2211532 (+) 327 WP_228482201.1 Fic family protein -
  INQ40_RS13095 - 2211590..2211916 (+) 327 WP_228481989.1 hypothetical protein -
  INQ40_RS13180 - 2211975..2212055 (+) 81 Protein_1968 hypothetical protein -
  INQ40_RS10110 (INQ40_10110) - 2212236..2212418 (-) 183 WP_228481990.1 hypothetical protein -
  INQ40_RS10115 (INQ40_10115) pilB 2212465..2214195 (+) 1731 WP_194340330.1 type IV-A pilus assembly ATPase PilB Machinery gene
  INQ40_RS10120 (INQ40_10120) - 2214293..2214595 (+) 303 WP_228481991.1 helix-turn-helix transcriptional regulator -
  INQ40_RS10125 (INQ40_10125) - 2214592..2215821 (+) 1230 WP_194340331.1 type II toxin-antitoxin system HipA family toxin -
  INQ40_RS10130 (INQ40_10130) pilC 2216043..2217302 (+) 1260 WP_194340332.1 type II secretion system F family protein Machinery gene
  INQ40_RS10135 (INQ40_10135) pilD 2217316..2218179 (+) 864 WP_194340333.1 A24 family peptidase Machinery gene
  INQ40_RS10140 (INQ40_10140) coaE 2218265..2218885 (+) 621 WP_194340334.1 dephospho-CoA kinase -
  INQ40_RS10145 (INQ40_10145) - 2219008..2220000 (-) 993 WP_194340335.1 Nudix family hydrolase -
  INQ40_RS10150 (INQ40_10150) secA 2220076..2222820 (-) 2745 WP_194340336.1 preprotein translocase subunit SecA -
  INQ40_RS10155 (INQ40_10155) - 2222980..2223813 (-) 834 WP_228482202.1 M23 family metallopeptidase -
  INQ40_RS10160 (INQ40_10160) - 2223896..2224354 (+) 459 WP_194340338.1 DUF721 domain-containing protein -
  INQ40_RS10165 (INQ40_10165) lpxC 2224547..2225464 (-) 918 WP_194340339.1 UDP-3-O-acyl-N-acetylglucosamine deacetylase -
  INQ40_RS10170 (INQ40_10170) ftsZ 2225699..2226934 (-) 1236 WP_194340340.1 cell division protein FtsZ -
  INQ40_RS10175 (INQ40_10175) ftsA 2227044..2228294 (-) 1251 WP_043958191.1 cell division protein FtsA -

Sequence


Protein


Download         Length: 576 a.a.        Molecular weight: 62983.23 Da        Isoelectric Point: 5.8951

>NTDB_id=496701 INQ40_RS10115 WP_194340330.1 2212465..2214195(+) (pilB) [Lysobacter sp. H21R4]
MNAVATANLVGITGIARRLVQDGALSESAAREAMAAATEQRKPLAAYIIEKRLVAPAHLAAANSVEFGVPIFDAAAMDPQ
QSAIKLVSEELLRKHTVLPLFKRGNRLFVGISEPTNTHALSEIKFQTNFTVEAILVDEESIKRHLDRWLENADALGDAMG
EDEEGLENLDVGGGDDELSADSGIDAKTDDTPVVKFINRVLVDAIRRGASDIHFEPYETEYRVRLRIDGLLKQSARVPIK
LQPRISARLKVMAQLDIAERRVPQDGRIKLNLSKTRQIDFRVSTLPTLFGEKIVLRILDGSAAKLGIEKLGYEDDQRDIF
LAAVKRPYGMVLVTGPTGSGKTVSLYTALNILNDETRNISTVEDPVEIRVPGINQVQMNVKRGMTFAAALRSFLRQDPDV
IMVGEIRDLETAEIGIKAAQTGHMVLSTLHTNDAPQTIARLMNMGVAPFNITSSVTLVIAQRLARRLHDCKHPVELPEHA
LLAEGFTAEEIASPDFRIFEAVGCGDCTEGYKGRTGIYQVMPMTDEIQGIVLAGGNAMQIAEAAQKSGVRDLRQSALMKV
RNGVTSLAEVNRVTKD

Nucleotide


Download         Length: 1731 bp        

>NTDB_id=496701 INQ40_RS10115 WP_194340330.1 2212465..2214195(+) (pilB) [Lysobacter sp. H21R4]
ATGAACGCCGTAGCCACTGCCAACCTTGTCGGCATCACCGGCATCGCCCGCCGCCTCGTGCAGGACGGCGCGCTCTCCGA
GTCCGCCGCCCGCGAGGCCATGGCCGCCGCGACCGAGCAGCGCAAACCGCTGGCCGCTTACATTATCGAGAAGCGACTGG
TCGCGCCCGCGCACCTGGCGGCCGCGAACTCGGTCGAGTTCGGTGTGCCGATCTTCGATGCCGCGGCGATGGACCCCCAA
CAGTCGGCGATCAAGCTGGTCAGCGAAGAACTGCTGCGCAAACACACGGTGCTGCCGCTGTTCAAGCGCGGCAACCGTTT
GTTTGTCGGCATCTCCGAGCCGACCAACACCCACGCGCTGTCGGAGATCAAGTTCCAGACCAACTTCACCGTCGAGGCGA
TCCTGGTCGATGAGGAAAGCATCAAGCGTCACCTGGACCGCTGGCTGGAGAACGCGGACGCGCTCGGCGACGCCATGGGC
GAGGACGAGGAAGGGCTGGAGAATCTGGACGTCGGGGGGGGAGACGATGAGCTGTCGGCGGACTCCGGCATCGACGCCAA
GACCGACGACACCCCCGTCGTCAAGTTCATCAACAGGGTGCTGGTGGACGCGATCCGCCGCGGCGCCTCCGACATCCACT
TCGAGCCCTACGAGACCGAGTACCGCGTGCGCCTGCGCATCGACGGCCTGCTGAAGCAGTCGGCCAGGGTGCCGATCAAG
CTGCAGCCGCGCATCTCCGCGCGCCTGAAGGTGATGGCGCAGCTGGACATCGCCGAGCGCCGGGTGCCGCAGGACGGGCG
CATCAAGCTCAACCTGAGCAAGACCAGGCAGATCGACTTCCGTGTCAGCACGCTGCCGACCCTGTTCGGCGAGAAGATCG
TGCTGCGTATCCTCGACGGCAGCGCCGCCAAGCTGGGCATCGAGAAGCTCGGCTACGAGGACGACCAGCGCGATATCTTC
CTCGCCGCGGTCAAGCGCCCCTACGGCATGGTGCTGGTCACCGGCCCGACCGGGTCCGGCAAGACGGTCTCGCTGTACAC
CGCGCTGAACATCCTCAACGACGAGACCCGCAACATCTCCACGGTCGAGGACCCGGTGGAAATCCGCGTGCCGGGCATCA
ATCAGGTGCAGATGAACGTCAAGCGCGGCATGACCTTCGCCGCCGCGCTGCGCAGCTTCCTGCGCCAGGATCCCGACGTG
ATCATGGTCGGCGAGATCCGCGACCTGGAAACCGCCGAGATCGGCATCAAGGCCGCGCAGACCGGTCACATGGTGCTGTC
CACCCTGCACACCAACGACGCGCCGCAGACCATCGCGCGTCTGATGAACATGGGCGTGGCGCCGTTCAACATCACCTCAT
CGGTGACGCTGGTGATCGCCCAGCGACTGGCGCGCCGGCTGCATGACTGCAAGCACCCGGTGGAACTGCCCGAGCACGCG
CTGCTGGCCGAAGGCTTTACCGCCGAGGAGATCGCCTCGCCCGACTTCAGGATCTTCGAGGCGGTCGGCTGCGGGGACTG
CACCGAAGGCTACAAGGGCCGCACCGGCATTTACCAGGTCATGCCGATGACCGACGAGATCCAGGGCATCGTGCTGGCCG
GCGGCAACGCGATGCAGATCGCCGAGGCCGCGCAGAAGTCCGGCGTGCGCGACCTGCGCCAGTCGGCGCTGATGAAGGTC
CGCAACGGCGTCACCAGCCTGGCGGAGGTCAACCGGGTGACCAAGGACTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Acinetobacter baumannii D1279779

55.282

98.611

0.545

  pilB Acinetobacter baylyi ADP1

52.86

100

0.53

  pilB Legionella pneumophila strain ERS1305867

50.352

98.611

0.497

  pilB Vibrio cholerae strain A1552

52.918

89.236

0.472

  pilF Neisseria gonorrhoeae MS11

46.737

98.438

0.46

  pilB Vibrio parahaemolyticus RIMD 2210633

50.503

86.285

0.436

  pilB Vibrio campbellii strain DS40M4

50.607

85.764

0.434

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

36.745

100

0.38

  pilF Thermus thermophilus HB27

36.61

100

0.375