Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA   Type   Machinery gene
Locus tag   H5647_RS05195 Genome accession   NZ_CP060092
Coordinates   1165643..1166650 (+) Length   335 a.a.
NCBI ID   WP_045861087.1    Uniprot ID   -
Organism   Teredinibacter purpureus strain Bs12     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 1138381..1194723 1165643..1166650 within 0


Gene organization within MGE regions


Location: 1138381..1194723
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H5647_RS05060 - 1138981..1139541 (+) 561 WP_045856816.1 hypothetical protein -
  H5647_RS05065 - 1139665..1140099 (+) 435 WP_045859537.1 hypothetical protein -
  H5647_RS05070 - 1140236..1140694 (+) 459 WP_045856819.1 hypothetical protein -
  H5647_RS05075 - 1141294..1142271 (-) 978 WP_045856823.1 hypothetical protein -
  H5647_RS05080 - 1142394..1143617 (+) 1224 WP_045856826.1 ISL3 family transposase -
  H5647_RS05085 - 1143796..1144659 (-) 864 WP_045856828.1 patatin-like phospholipase family protein -
  H5647_RS05090 - 1144880..1145323 (-) 444 WP_045856830.1 hypothetical protein -
  H5647_RS05095 - 1145310..1145984 (-) 675 WP_236074775.1 response regulator transcription factor -
  H5647_RS05100 - 1145984..1147222 (-) 1239 WP_045856835.1 sensor histidine kinase -
  H5647_RS05105 - 1147395..1147640 (+) 246 WP_045856836.1 hypothetical protein -
  H5647_RS05110 - 1147765..1148589 (+) 825 WP_052691878.1 hypothetical protein -
  H5647_RS05115 - 1148591..1149646 (+) 1056 WP_045856838.1 lipopolysaccharide assembly protein LapB -
  H5647_RS05120 - 1149643..1150053 (+) 411 WP_045856840.1 hypothetical protein -
  H5647_RS05125 - 1150276..1150737 (-) 462 WP_052691879.1 ATP-binding protein -
  H5647_RS05130 - 1151510..1154089 (+) 2580 WP_045856845.1 cellulose binding domain-containing protein -
  H5647_RS05135 - 1154541..1155227 (+) 687 WP_045856847.1 TorF family putative porin -
  H5647_RS05140 glnK 1155321..1155659 (+) 339 WP_045856849.1 P-II family nitrogen regulator -
  H5647_RS05145 - 1155708..1157006 (+) 1299 WP_045856851.1 ammonium transporter -
  H5647_RS05150 - 1157302..1157640 (+) 339 WP_045856854.1 P-II family nitrogen regulator -
  H5647_RS05155 - 1157963..1158178 (+) 216 WP_236075080.1 hypothetical protein -
  H5647_RS05160 - 1158227..1158916 (-) 690 WP_045856858.1 DUF484 family protein -
  H5647_RS05165 dapF 1158913..1159743 (-) 831 WP_045856860.1 diaminopimelate epimerase -
  H5647_RS05170 lysA 1159743..1161005 (-) 1263 WP_045856863.1 diaminopimelate decarboxylase -
  H5647_RS05175 lptM 1161040..1161240 (-) 201 WP_045856865.1 LPS translocon maturation chaperone LptM -
  H5647_RS05180 - 1161389..1164277 (-) 2889 WP_045856868.1 class I adenylate cyclase -
  H5647_RS05185 coaD 1164370..1164885 (-) 516 WP_045861086.1 pantetheine-phosphate adenylyltransferase -
  H5647_RS05190 rsmD 1164992..1165621 (-) 630 WP_328286796.1 16S rRNA (guanine(966)-N(2))-methyltransferase RsmD -
  H5647_RS05195 pilA 1165643..1166650 (+) 1008 WP_045861087.1 signal recognition particle-docking protein FtsY Machinery gene
  H5647_RS05200 ftsE 1166679..1167362 (+) 684 WP_045856872.1 cell division ATP-binding protein FtsE -
  H5647_RS05205 ftsX 1167362..1168330 (+) 969 WP_045856874.1 permease-like cell division protein FtsX -
  H5647_RS05210 rpoH 1168436..1169290 (+) 855 WP_045856876.1 RNA polymerase sigma factor RpoH -
  H5647_RS05215 - 1169394..1169772 (+) 379 Protein_1040 DUF423 domain-containing protein -
  H5647_RS05220 trmB 1169772..1170458 (+) 687 WP_045856878.1 tRNA (guanosine(46)-N7)-methyltransferase TrmB -
  H5647_RS05225 - 1170507..1171130 (+) 624 WP_121495351.1 TIGR02444 family protein -
  H5647_RS05230 - 1171277..1172094 (+) 818 Protein_1043 DUF4372 domain-containing protein -
  H5647_RS05235 - 1172162..1173103 (-) 942 WP_045856880.1 hypothetical protein -
  H5647_RS05240 - 1173618..1174415 (+) 798 WP_045856882.1 FKBP-type peptidyl-prolyl cis-trans isomerase -
  H5647_RS05245 - 1174432..1175466 (-) 1035 WP_045856884.1 WD40 repeat domain-containing protein -
  H5647_RS05250 - 1175581..1176042 (-) 462 WP_045856887.1 Rsd/AlgQ family anti-sigma factor -
  H5647_RS05255 - 1176327..1176782 (+) 456 WP_121495352.1 flagellar basal body-associated FliL family protein -
  H5647_RS05260 gshA 1176842..1178449 (-) 1608 WP_045856889.1 glutamate--cysteine ligase -
  H5647_RS05265 - 1178454..1178843 (-) 390 WP_052691881.1 hypothetical protein -
  H5647_RS05270 gspC 1179383..1180363 (+) 981 WP_045856891.1 type II secretion system protein GspC -
  H5647_RS05275 gspD 1180368..1182320 (+) 1953 WP_236074777.1 type II secretion system secretin GspD -
  H5647_RS05280 gspE 1182326..1183795 (+) 1470 WP_045856896.1 type II secretion system ATPase GspE -
  H5647_RS05285 gspF 1183797..1185026 (+) 1230 WP_045856898.1 type II secretion system inner membrane protein GspF -
  H5647_RS05290 gspG 1185054..1185506 (+) 453 WP_045856900.1 type II secretion system major pseudopilin GspG -
  H5647_RS05295 - 1185525..1186115 (+) 591 WP_045856902.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  H5647_RS05300 gspI 1186126..1186548 (+) 423 WP_045861091.1 type II secretion system minor pseudopilin GspI -
  H5647_RS05305 gspJ 1186545..1187159 (+) 615 WP_236074779.1 type II secretion system minor pseudopilin GspJ -
  H5647_RS05310 gspK 1187164..1188237 (+) 1074 WP_045856904.1 type II secretion system minor pseudopilin GspK -
  H5647_RS05315 gspL 1188254..1189486 (+) 1233 WP_045856905.1 type II secretion system protein GspL -
  H5647_RS05320 gspM 1189486..1189965 (+) 480 WP_045856907.1 type II secretion system protein GspM -
  H5647_RS05325 - 1189974..1190741 (+) 768 WP_045856909.1 type II secretion system protein N -
  H5647_RS05330 - 1190897..1191808 (+) 912 WP_408034021.1 FAD:protein FMN transferase -
  H5647_RS05335 - 1191771..1192838 (-) 1068 WP_082086958.1 integron integrase -
  H5647_RS05340 - 1193450..1193965 (+) 516 WP_408034022.1 AraC family transcriptional regulator -

Sequence


Protein


Download         Length: 335 a.a.        Molecular weight: 35906.16 Da        Isoelectric Point: 6.0703

>NTDB_id=473922 H5647_RS05195 WP_045861087.1 1165643..1166650(+) (pilA) [Teredinibacter purpureus strain Bs12]
MFFKRKKNTPEESSPKKAEIEQPQEKLSLLARMKRGLSRTSSQFSSGLATLLMGKKAINDELLEEIETLLLMADVGVDAT
TEIIDNLTERVARKQLADSDALFTALKDSLSALLQQVESPLVIDSEKKPYVILVVGVNGVGKTTTIGKLAKRLQNEGKSV
MLAAGDTFRAAAVEQLEVWGERNEVPVVAQHTGADSASVIYDAVQSAQSRGIDVVIADTAGRLHNKSNLMEELSKVKRVM
GKIDGTAPHEILLVLDAGTGQNAVSQTDHFLKAAGVTGLALTKLDGTAKGGIIFALSKKHQLPVRFIGVGEGIDDLQPFS
ASNFIDALFGEGKAE

Nucleotide


Download         Length: 1008 bp        

>NTDB_id=473922 H5647_RS05195 WP_045861087.1 1165643..1166650(+) (pilA) [Teredinibacter purpureus strain Bs12]
ATGTTTTTTAAGCGTAAGAAAAACACCCCTGAAGAATCTTCTCCTAAGAAGGCCGAGATCGAGCAGCCGCAAGAAAAACT
CAGCCTATTGGCTAGAATGAAACGCGGCTTGTCGCGTACCAGCAGCCAGTTCTCTTCCGGCTTGGCCACCCTACTAATGG
GCAAGAAGGCAATCAACGACGAATTGCTTGAAGAAATTGAAACCTTATTGTTAATGGCTGACGTGGGCGTAGATGCCACG
ACCGAAATTATCGATAATCTTACCGAGCGAGTCGCTCGCAAGCAGCTGGCCGATTCCGATGCCTTATTTACAGCGCTCAA
AGATTCCTTAAGCGCTTTATTGCAGCAGGTAGAATCACCGTTAGTAATTGATTCGGAGAAGAAGCCCTACGTTATTCTTG
TGGTAGGCGTAAATGGTGTGGGCAAAACAACAACTATTGGGAAGTTGGCAAAACGGCTTCAGAATGAAGGTAAATCGGTA
ATGTTGGCCGCAGGCGATACGTTTCGTGCAGCGGCTGTCGAACAGCTCGAAGTGTGGGGCGAGCGCAATGAGGTACCCGT
TGTTGCGCAGCATACAGGGGCCGATAGTGCATCGGTCATATATGATGCCGTGCAGTCTGCGCAGTCGCGTGGTATTGATG
TGGTAATAGCCGATACCGCCGGCCGCTTACACAATAAAAGTAACTTAATGGAAGAGCTCTCCAAAGTTAAACGCGTGATG
GGAAAGATCGATGGCACAGCGCCTCACGAAATATTATTGGTGCTGGATGCAGGAACGGGGCAAAATGCGGTGAGCCAAAC
CGATCATTTTCTTAAGGCCGCAGGTGTTACGGGATTAGCGTTAACCAAGCTAGACGGTACTGCCAAAGGCGGTATTATTT
TCGCCCTGAGTAAAAAACATCAATTACCGGTTCGCTTTATTGGCGTAGGCGAAGGCATTGATGACCTCCAACCTTTTTCC
GCCAGCAACTTTATAGATGCGTTGTTTGGCGAGGGAAAAGCAGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA Neisseria gonorrhoeae MS11

54.248

91.343

0.496