Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilU   Type   Machinery gene
Locus tag   KFB96_RS14445 Genome accession   NZ_CP073760
Coordinates   3178124..3179281 (-) Length   385 a.a.
NCBI ID   WP_213457781.1    Uniprot ID   -
Organism   MAG: Thiocapsa sp. isolate M50B4     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 3179169..3222054 3178124..3179281 flank -112


Gene organization within MGE regions


Location: 3178124..3222054
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KFB96_RS14445 (KFB96_14445) pilU 3178124..3179281 (-) 1158 WP_213457781.1 PilT/PilU family type 4a pilus ATPase Machinery gene
  KFB96_RS14450 (KFB96_14450) pilT 3179383..3180420 (-) 1038 WP_213457780.1 type IV pilus twitching motility protein PilT Machinery gene
  KFB96_RS14455 (KFB96_14455) - 3180856..3181638 (+) 783 WP_213457779.1 (Fe-S)-binding protein -
  KFB96_RS14460 (KFB96_14460) - 3181950..3182273 (+) 324 WP_213457778.1 type II toxin-antitoxin system RelE/ParE family toxin -
  KFB96_RS14465 (KFB96_14465) - 3182270..3182599 (+) 330 WP_093036041.1 addiction module antidote protein -
  KFB96_RS14470 (KFB96_14470) - 3183045..3184454 (+) 1410 WP_213457777.1 LutB/LldF family L-lactate oxidation iron-sulfur protein -
  KFB96_RS14475 (KFB96_14475) - 3184451..3185095 (+) 645 WP_213457776.1 LUD domain-containing protein -
  KFB96_RS14480 (KFB96_14480) - 3185206..3186105 (-) 900 WP_213457775.1 hypothetical protein -
  KFB96_RS14485 (KFB96_14485) - 3186330..3187142 (+) 813 WP_213457774.1 diguanylate cyclase -
  KFB96_RS14490 (KFB96_14490) - 3187420..3192882 (+) 5463 WP_213457773.1 RecQ family ATP-dependent DNA helicase -
  KFB96_RS14495 (KFB96_14495) - 3193368..3194234 (-) 867 WP_213457710.1 IS110 family transposase -
  KFB96_RS14500 (KFB96_14500) - 3194865..3196004 (-) 1140 WP_213457771.1 DUF4350 domain-containing protein -
  KFB96_RS14505 (KFB96_14505) - 3195994..3197580 (-) 1587 WP_213457770.1 DUF4129 domain-containing protein -
  KFB96_RS14510 (KFB96_14510) - 3197570..3198556 (-) 987 WP_213457769.1 stage II sporulation protein M -
  KFB96_RS14515 (KFB96_14515) - 3198553..3199251 (-) 699 WP_213457768.1 RDD family protein -
  KFB96_RS14520 (KFB96_14520) - 3199316..3199927 (-) 612 WP_213458226.1 chorismate pyruvate-lyase family protein -
  KFB96_RS14525 (KFB96_14525) - 3200365..3200565 (+) 201 WP_213457767.1 alanine-zipper protein -
  KFB96_RS14530 (KFB96_14530) - 3200544..3201509 (-) 966 WP_213457766.1 L,D-transpeptidase family protein -
  KFB96_RS14535 (KFB96_14535) - 3201543..3203591 (+) 2049 WP_213457765.1 translation factor GTPase family protein -
  KFB96_RS14540 (KFB96_14540) - 3203687..3204202 (-) 516 WP_213458225.1 histidine phosphatase family protein -
  KFB96_RS14545 (KFB96_14545) - 3204328..3205203 (-) 876 WP_213457764.1 fructosamine kinase family protein -
  KFB96_RS14550 (KFB96_14550) - 3205261..3205890 (-) 630 WP_213457763.1 HNH endonuclease -
  KFB96_RS14555 (KFB96_14555) - 3206157..3206339 (-) 183 WP_213457762.1 hypothetical protein -
  KFB96_RS14560 (KFB96_14560) - 3206497..3208776 (+) 2280 WP_213457761.1 primosomal protein N' -
  KFB96_RS14565 (KFB96_14565) - 3209403..3210101 (+) 699 WP_213465085.1 response regulator transcription factor -
  KFB96_RS14570 (KFB96_14570) - 3210351..3210824 (-) 474 WP_213457759.1 hypothetical protein -
  KFB96_RS14575 (KFB96_14575) - 3211227..3211538 (-) 312 WP_300970304.1 IS4 family transposase -
  KFB96_RS26835 - 3211870..3212991 (+) 1122 Protein_2940 recombinase family protein -
  KFB96_RS26840 - 3213845..3215041 (-) 1197 WP_213457758.1 IS4 family transposase -
  KFB96_RS14585 (KFB96_14585) - 3215038..3215880 (-) 843 WP_300970306.1 DUF4338 domain-containing protein -
  KFB96_RS14590 (KFB96_14590) - 3216018..3217463 (+) 1446 WP_213457757.1 hypothetical protein -
  KFB96_RS14595 (KFB96_14595) - 3217895..3218203 (-) 309 WP_213457756.1 helix-turn-helix domain-containing protein -
  KFB96_RS14600 (KFB96_14600) - 3218200..3218805 (-) 606 WP_213457755.1 type II toxin-antitoxin system RelE/ParE family toxin -
  KFB96_RS14605 (KFB96_14605) - 3218798..3219043 (-) 246 WP_213457754.1 type II toxin-antitoxin system ParD family antitoxin -
  KFB96_RS14610 (KFB96_14610) - 3219204..3220187 (-) 984 WP_213457753.1 DUF4351 domain-containing protein -
  KFB96_RS14615 (KFB96_14615) - 3220462..3221331 (+) 870 WP_213457752.1 Rpn family recombination-promoting nuclease/putative transposase -
  KFB96_RS14620 (KFB96_14620) - 3221348..3221803 (-) 456 WP_213457751.1 hypothetical protein -

Sequence


Protein


Download         Length: 385 a.a.        Molecular weight: 42976.20 Da        Isoelectric Point: 6.7823

>NTDB_id=560541 KFB96_RS14445 WP_213457781.1 3178124..3179281(-) (pilU) [MAG: Thiocapsa sp. isolate M50B4]
MDLNRAIEQLLSMVVERKASDLFITAGWPPSVKIDGAIYPATKHPLSAEQAHDMVLGLMNDRQREEFQATKECQFAIDRP
HLGRFRVSAFVQREATGMVLRRIETDIPTIEALKLPAILRELAMTKRGMMIFVGGTGTGKSTSLAALVGYRNRHSSGHII
TVEDPIEFVHEHHGCIITQREVGVDTESYEVALKNTLRQAPDVILIGEIRTRATMEYAIAFAETGHLVLATLHANNANQA
LDRIISFFPDDSRNQLLLDLSLNLKAVVAQQLAPRKSAQGRRAVVEILLNTPLASDLIRKGEVHKLKELMKKSNEQGMNT
FDQALFNLYQEGEISYEDALRYADSANEVRLVIKLQGGATQREATDRMIDGVSLVHDNEYSGRRP

Nucleotide


Download         Length: 1158 bp        

>NTDB_id=560541 KFB96_RS14445 WP_213457781.1 3178124..3179281(-) (pilU) [MAG: Thiocapsa sp. isolate M50B4]
ATGGACCTGAATCGTGCAATCGAGCAACTGCTGAGCATGGTGGTCGAGCGCAAGGCGTCCGATCTCTTCATCACTGCAGG
CTGGCCGCCCAGCGTCAAGATCGACGGGGCGATCTATCCCGCGACGAAGCATCCCTTGTCGGCCGAGCAGGCGCACGACA
TGGTGCTCGGTCTCATGAACGATCGCCAGCGCGAGGAGTTCCAGGCAACCAAGGAGTGTCAGTTCGCGATCGATCGCCCG
CATCTGGGGCGGTTCCGCGTCAGCGCATTCGTTCAGCGCGAGGCGACCGGGATGGTCCTGCGTCGGATCGAGACCGACAT
TCCGACCATCGAGGCACTGAAGCTTCCCGCGATCCTGCGCGAGCTGGCGATGACCAAGCGCGGCATGATGATCTTCGTCG
GCGGCACCGGGACGGGGAAATCGACCTCGCTCGCCGCGCTGGTGGGGTATCGCAACCGACACAGCTCGGGCCACATCATC
ACGGTCGAGGACCCGATCGAGTTCGTTCACGAGCATCACGGCTGCATCATCACCCAGCGCGAGGTGGGTGTGGATACCGA
GTCCTACGAGGTGGCGCTCAAGAACACCCTGCGCCAAGCCCCGGACGTTATCCTGATCGGCGAGATCCGCACGCGCGCGA
CCATGGAATACGCCATCGCCTTCGCCGAGACCGGCCATCTCGTTTTGGCGACGCTGCACGCCAACAACGCCAACCAGGCG
CTGGACCGCATCATCAGCTTCTTTCCGGACGACTCGCGCAACCAGCTCCTGCTCGATCTCTCGCTGAATCTCAAGGCCGT
CGTGGCACAACAGCTTGCCCCGCGCAAGAGCGCTCAGGGGCGTCGAGCCGTCGTCGAGATCCTGTTGAACACGCCCTTGG
CCTCCGATCTCATCCGCAAGGGCGAGGTGCACAAGCTCAAAGAGCTGATGAAGAAGTCCAACGAGCAGGGTATGAACACC
TTCGATCAGGCCCTCTTCAATCTCTATCAAGAGGGCGAGATCAGCTACGAGGATGCGCTGCGCTATGCCGACTCGGCCAA
CGAGGTGCGGCTCGTCATCAAGCTGCAGGGCGGCGCCACGCAGCGCGAGGCGACCGATCGCATGATCGACGGGGTTTCCC
TCGTGCACGACAACGAGTACAGCGGCCGCCGACCTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilU Pseudomonas stutzeri DSM 10701

65.517

97.922

0.642

  pilU Acinetobacter baylyi ADP1

62.155

94.026

0.584

  pilU Vibrio cholerae strain A1552

58.38

92.987

0.543

  pilT Acinetobacter baylyi ADP1

40.816

89.091

0.364

  pilT Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

41.246

87.532

0.361

  pilT Vibrio cholerae O1 biovar El Tor strain E7946

41.867

86.234

0.361

  pilT Vibrio cholerae strain A1552

41.867

86.234

0.361