Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   DQL22_RS04665 Genome accession   NZ_LS483485
Coordinates   911344..912753 (-) Length   469 a.a.
NCBI ID   WP_111711225.1    Uniprot ID   -
Organism   Aggregatibacter aphrophilus strain NCTC11096     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 906344..917753
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQL22_RS04635 (NCTC11096_00931) - 906539..907912 (-) 1374 WP_111301454.1 alanine/glycine:cation symporter family protein -
  DQL22_RS04640 (NCTC11096_00933) - 908280..908546 (-) 267 WP_111301453.1 GNAT family N-acetyltransferase -
  DQL22_RS04645 (NCTC11096_00934) yacG 908546..908758 (-) 213 WP_005701268.1 DNA gyrase inhibitor YacG -
  DQL22_RS04650 (NCTC11096_00935) coaE 908751..909371 (-) 621 WP_111301452.1 dephospho-CoA kinase -
  DQL22_RS04655 (NCTC11096_00936) - 909424..910116 (-) 693 WP_111301451.1 prepilin peptidase -
  DQL22_RS04660 (NCTC11096_00937) pilC 910113..911351 (-) 1239 WP_111301450.1 type II secretion system F family protein Machinery gene
  DQL22_RS04665 (NCTC11096_00938) pilB 911344..912753 (-) 1410 WP_111711225.1 GspE/PulE family protein Machinery gene
  DQL22_RS04670 (NCTC11096_00939) pilA 912779..913228 (-) 450 WP_111301448.1 pilin Machinery gene
  DQL22_RS04675 (NCTC11096_00940) ampD 913354..913908 (+) 555 WP_111301458.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  DQL22_RS04680 (NCTC11096_00941) nanQ 914382..914852 (-) 471 WP_083014717.1 N-acetylneuraminate anomerase -
  DQL22_RS04685 (NCTC11096_00942) pnp 915122..917251 (+) 2130 WP_111301447.1 polyribonucleotide nucleotidyltransferase -

Sequence


Protein


Download         Length: 469 a.a.        Molecular weight: 52937.37 Da        Isoelectric Point: 6.3454

>NTDB_id=1142596 DQL22_RS04665 WP_111711225.1 911344..912753(-) (pilB) [Aggregatibacter aphrophilus strain NCTC11096]
MQHSMEQPYSICSQQGDIFTITPELWQRNQQQQTVLLRYLALPLKEEPQKLWLGLDSLTNLAACEAFSFLTGKNIEPVLI
ESAVLKTALQDLAPHQEKLDENQPLFYSVTPQEQQKQSSDEPTIQLLNQIFENATTKKASDIHLEPQADFLQVRFRIDGV
LQVQNTIAQTLANRLISRLKLLAKLDISETRLPQDGRFQFKTTFSDILDFRLSTLATHFGEKAVLRLQQNRPVQLAFSEL
GMTEQQQQTFRHALSQPQGLILVTGPTGSGKSISLYTALQWLNTPDKHIMTAEDPIEIQLPGIIQSQVNPQIGLDFSRLL
RTFLRQDPDIIMLGEIRDEESAAMALRAAQTGHLVLSTLHTNDAASAISRLQQLSIQQHEIDSSLLLVIAQRLVRKRCQK
CGGNATRFCDCHQGYKGRIGVYQFLQPDVSQNQSYQLDFQNLYASALEKVKSQVTDLAEVQRVLGQPNV

Nucleotide


Download         Length: 1410 bp        

>NTDB_id=1142596 DQL22_RS04665 WP_111711225.1 911344..912753(-) (pilB) [Aggregatibacter aphrophilus strain NCTC11096]
ATGCAACATTCGATGGAACAACCTTATTCTATCTGTAGCCAACAGGGCGACATATTTACGATTACGCCCGAGTTATGGCA
ACGAAACCAACAGCAGCAAACCGTCCTTTTGCGTTATTTGGCGCTACCATTAAAAGAAGAACCACAAAAATTATGGCTGG
GATTGGATTCCTTAACCAACCTTGCCGCTTGTGAAGCGTTTTCCTTTTTAACGGGGAAAAATATCGAGCCGGTACTCATT
GAAAGTGCCGTGTTAAAAACGGCACTGCAAGATTTGGCGCCACATCAAGAAAAACTCGATGAAAATCAACCGCTCTTTTA
TTCCGTCACGCCTCAGGAGCAACAAAAACAATCCTCTGATGAGCCGACGATTCAATTGCTTAACCAAATTTTTGAAAACG
CTACAACAAAAAAAGCCTCCGATATTCATTTAGAACCGCAAGCGGATTTTTTACAAGTCCGTTTTCGTATTGACGGCGTA
TTACAAGTGCAAAACACCATTGCGCAGACACTCGCCAATCGACTGATTTCCCGTCTGAAATTACTGGCCAAATTAGACAT
CAGCGAAACCCGCCTGCCGCAAGACGGACGGTTTCAGTTTAAAACGACGTTTTCCGATATTTTGGATTTTCGACTATCCA
CACTTGCGACGCATTTCGGTGAAAAAGCCGTATTGCGTTTACAACAAAACCGCCCCGTGCAACTGGCCTTTAGCGAACTA
GGTATGACGGAACAACAGCAACAAACATTTCGTCATGCGCTCAGCCAACCGCAAGGGTTAATTTTAGTCACCGGCCCCAC
GGGAAGTGGAAAAAGTATTTCTTTATATACGGCATTACAATGGCTCAATACCCCCGACAAGCATATTATGACGGCGGAAG
ATCCCATTGAAATTCAATTACCGGGCATTATCCAAAGCCAAGTGAATCCTCAAATCGGCTTGGATTTCAGTCGTTTGTTA
CGCACCTTTTTGCGTCAAGACCCCGACATTATTATGTTGGGTGAAATTCGTGACGAAGAAAGTGCCGCCATGGCCTTACG
GGCGGCGCAAACCGGGCATTTGGTATTATCAACGTTACATACCAATGACGCCGCCTCAGCCATCTCACGCCTGCAACAAC
TTAGCATTCAACAACACGAAATCGACAGCAGTTTATTATTGGTTATCGCCCAACGTTTGGTGCGTAAACGATGCCAAAAG
TGCGGTGGAAATGCCACGCGTTTTTGCGATTGTCATCAAGGCTACAAAGGTCGAATCGGTGTTTATCAATTCCTTCAGCC
TGATGTGTCACAAAACCAAAGTTATCAATTAGATTTTCAGAATTTATATGCCAGCGCATTAGAAAAAGTGAAGTCGCAAG
TGACGGATTTAGCAGAAGTGCAACGGGTATTGGGGCAACCGAATGTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Haemophilus influenzae 86-028NP

73.638

97.868

0.721

  pilB Haemophilus influenzae Rd KW20

73.203

97.868

0.716

  pilB Glaesserella parasuis strain SC1401

58.785

98.294

0.578

  pilB Vibrio campbellii strain DS40M4

41.962

100

0.429

  pilB Vibrio parahaemolyticus RIMD 2210633

41.962

100

0.429

  pilB Vibrio cholerae strain A1552

41.458

100

0.424

  pilB Acinetobacter baumannii D1279779

40.249

100

0.414

  pilB Acinetobacter baylyi ADP1

38.632

100

0.409

  pilB Legionella pneumophila strain ERS1305867

38.323

100

0.409

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

40

90.618

0.362


Multiple sequence alignment