Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   EL144_RS10935 Genome accession   NZ_LR134327
Coordinates   2276428..2277837 (+) Length   469 a.a.
NCBI ID   WP_005704570.1    Uniprot ID   A0A3S4QTD0
Organism   Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2271428..2282837
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL144_RS10915 (NCTC5906_02195) pnp 2271932..2274061 (-) 2130 WP_032995329.1 polyribonucleotide nucleotidyltransferase -
  EL144_RS10920 (NCTC5906_02196) nanQ 2274331..2274801 (+) 471 WP_005704573.1 N-acetylneuraminate anomerase -
  EL144_RS10925 (NCTC5906_02197) ampD 2275274..2275828 (-) 555 WP_032995339.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  EL144_RS10930 (NCTC5906_02198) pilA 2275953..2276402 (+) 450 WP_005704571.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  EL144_RS10935 (NCTC5906_02199) pilB 2276428..2277837 (+) 1410 WP_005704570.1 GspE/PulE family protein Machinery gene
  EL144_RS10940 (NCTC5906_02200) pilC 2277830..2279068 (+) 1239 WP_005704569.1 type II secretion system F family protein Machinery gene
  EL144_RS10945 (NCTC5906_02201) - 2279065..2279757 (+) 693 WP_032995328.1 A24 family peptidase -
  EL144_RS10950 (NCTC5906_02202) coaE 2279810..2280430 (+) 621 WP_005704567.1 dephospho-CoA kinase -
  EL144_RS10955 (NCTC5906_02203) yacG 2280423..2280635 (+) 213 WP_005701268.1 DNA gyrase inhibitor YacG -
  EL144_RS10960 (NCTC5906_02204) - 2280635..2280901 (+) 267 WP_005704566.1 GNAT family N-acetyltransferase -
  EL144_RS10965 (NCTC5906_02206) - 2281269..2282642 (+) 1374 WP_032995327.1 sodium:alanine symporter family protein -

Sequence


Protein


Download         Length: 469 a.a.        Molecular weight: 52968.29 Da        Isoelectric Point: 5.8266

>NTDB_id=1121614 EL144_RS10935 WP_005704570.1 2276428..2277837(+) (pilB) [Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906]
MQHSMEQPYSICSQQGDIFTITPELWQQNQQQQTVLLRYLALPLKEEPQKLWLGLDSLTNLAACEAFSFLTGKNIEPVLI
ESAVLKTALQDLAPHREKLDENQPLFYSVTPQEQQEQSSDEPTIQLLNQIFENATTKKASDIHLEPQADFLQVRFRIDGV
LQVQNTIAQTLANRLISRLKLLAKLDISETRLPQDGRFQFKTTFSDILDFRLSTLATHFGEKAVLRLQQNRPVQLAFSEL
GMTEQQQQTFRHALSQPQGLILVTGPTGSGKSISLYTALQWLNTPDKHIMTAEDPIEIQLPGIIQSQVNPQIGLDFSRLL
RTFLRQDPDIIMLGEIRDEESAAMALRAAQTGHLVLSTLHTNDAASAISRLQQLGIQQHEIDSSLLLVIAQRLVRKRCQK
CGENSTRFCDCHQGYKGRIGVYQFLQPDVSQNQSYQLDFQNLYASALEKVKSQVTDLAEVQRVLGQPNA

Nucleotide


Download         Length: 1410 bp        

>NTDB_id=1121614 EL144_RS10935 WP_005704570.1 2276428..2277837(+) (pilB) [Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906]
ATGCAACATTCGATGGAACAACCTTATTCTATCTGTAGCCAACAGGGTGACATATTTACTATTACGCCCGAGTTATGGCA
ACAAAACCAACAGCAGCAAACCGTCCTTTTGCGTTATTTGGCGCTACCATTAAAAGAAGAACCACAAAAATTATGGCTGG
GATTGGATTCCTTAACCAACCTTGCCGCTTGTGAAGCGTTTTCCTTTTTAACAGGGAAAAATATCGAGCCGGTACTCATT
GAAAGTGCCGTGTTAAAAACGGCACTGCAAGATTTGGCGCCACATCGAGAAAAACTCGATGAAAATCAACCGCTCTTTTA
TTCCGTCACGCCTCAGGAGCAACAAGAACAATCCTCCGATGAGCCGACGATTCAATTGCTTAATCAAATTTTTGAAAACG
CCACAACAAAAAAAGCCTCCGATATTCATTTAGAACCGCAAGCGGATTTTTTACAAGTCCGTTTTCGTATTGACGGCGTA
TTACAAGTGCAAAACACCATTGCGCAGACACTCGCCAATCGACTGATTTCCCGTCTGAAATTACTGGCTAAATTAGACAT
CAGCGAAACCCGCCTGCCGCAAGACGGACGGTTTCAGTTTAAAACGACGTTTTCCGATATTTTGGATTTTCGCTTATCCA
CACTCGCTACGCATTTTGGTGAAAAAGCCGTATTGCGTTTACAACAGAACCGCCCCGTGCAACTGGCTTTTAGCGAACTG
GGCATGACGGAACAACAGCAACAAACATTTCGTCATGCGCTTAGCCAACCGCAAGGGTTAATTTTAGTCACCGGCCCCAC
GGGAAGCGGTAAAAGTATTTCTTTATATACGGCATTACAATGGCTCAATACCCCCGACAAGCATATTATGACGGCAGAAG
ATCCCATTGAAATTCAATTGCCGGGCATTATACAAAGCCAAGTGAATCCTCAAATCGGCTTGGATTTCAGTCGTTTGTTA
CGCACTTTTTTGCGTCAAGACCCCGACATTATTATGCTGGGTGAAATTCGTGACGAAGAAAGTGCCGCTATGGCCTTACG
GGCGGCACAAACCGGGCATCTGGTATTATCGACGTTACACACCAATGACGCTGCCTCAGCCATCTCACGCCTACAACAGC
TTGGCATTCAACAACACGAAATCGACAGCAGTTTATTATTGGTTATCGCCCAACGCTTGGTGCGTAAACGATGCCAAAAG
TGCGGTGAAAATTCCACGCGTTTTTGCGATTGTCATCAAGGTTACAAAGGTCGAATCGGTGTTTATCAATTCCTTCAGCC
TGATGTGTCACAAAACCAAAGTTATCAATTAGATTTTCAGAATTTATATGCCAGCGCCTTAGAAAAAGTGAAGTCGCAAG
TGACGGATTTAGCAGAAGTGCAACGGGTATTGGGGCAACCGAATGCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3S4QTD0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Haemophilus influenzae 86-028NP

73.42

97.868

0.719

  pilB Haemophilus influenzae Rd KW20

72.985

97.868

0.714

  pilB Glaesserella parasuis strain SC1401

58.568

98.294

0.576

  pilB Vibrio parahaemolyticus RIMD 2210633

42.38

100

0.433

  pilB Vibrio campbellii strain DS40M4

41.545

100

0.424

  pilB Acinetobacter baumannii D1279779

41.286

100

0.424

  pilB Vibrio cholerae strain A1552

41.25

100

0.422

  pilB Legionella pneumophila strain ERS1305867

38.323

100

0.409

  pilB Acinetobacter baylyi ADP1

38.431

100

0.407

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

40.471

90.618

0.367


Multiple sequence alignment