Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   DQL22_RS04660 Genome accession   NZ_LS483485
Coordinates   910113..911351 (-) Length   412 a.a.
NCBI ID   WP_111301450.1    Uniprot ID   -
Organism   Aggregatibacter aphrophilus strain NCTC11096     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 905113..916351
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQL22_RS04630 (NCTC11096_00930) raiA 905989..906312 (-) 324 WP_109083655.1 ribosome-associated translation inhibitor RaiA -
  DQL22_RS04635 (NCTC11096_00931) - 906539..907912 (-) 1374 WP_111301454.1 alanine/glycine:cation symporter family protein -
  DQL22_RS04640 (NCTC11096_00933) - 908280..908546 (-) 267 WP_111301453.1 GNAT family N-acetyltransferase -
  DQL22_RS04645 (NCTC11096_00934) yacG 908546..908758 (-) 213 WP_005701268.1 DNA gyrase inhibitor YacG -
  DQL22_RS04650 (NCTC11096_00935) coaE 908751..909371 (-) 621 WP_111301452.1 dephospho-CoA kinase -
  DQL22_RS04655 (NCTC11096_00936) - 909424..910116 (-) 693 WP_111301451.1 prepilin peptidase -
  DQL22_RS04660 (NCTC11096_00937) pilC 910113..911351 (-) 1239 WP_111301450.1 type II secretion system F family protein Machinery gene
  DQL22_RS04665 (NCTC11096_00938) pilB 911344..912753 (-) 1410 WP_111711225.1 GspE/PulE family protein Machinery gene
  DQL22_RS04670 (NCTC11096_00939) pilA 912779..913228 (-) 450 WP_111301448.1 pilin Machinery gene
  DQL22_RS04675 (NCTC11096_00940) ampD 913354..913908 (+) 555 WP_111301458.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  DQL22_RS04680 (NCTC11096_00941) nanQ 914382..914852 (-) 471 WP_083014717.1 N-acetylneuraminate anomerase -

Sequence


Protein


Download         Length: 412 a.a.        Molecular weight: 46733.16 Da        Isoelectric Point: 10.0112

>NTDB_id=1142595 DQL22_RS04660 WP_111301450.1 910113..911351(-) (pilC) [Aggregatibacter aphrophilus strain NCTC11096]
MCKLKLFNWHGVNRLQQKQEGTIVAESAVIAQQQLMSRGLQHIKLQQNWQLNSKPKNAEVCSLLSQLATLLQAAVPLKNS
LQILLQHCTNIALNAWLRQLLKDIESGLAFSQALEKQNVEKQNQYLTYQDRQLIKVGEMTGKLPTVCHEIAQHKQQALAL
QRKIRKILLYPVLVLGISLILTALLLLFIVPQFATMYDNSSAQLPTFTQVLLTLSQGLQDYWLHLLICMALTTLFIRARL
KHSPWFNRQKSRLINAMPVLNRIVQLSRLVGFSRSLFLMLQAGVPLNQALQSFLPQNPSWQRSPNVQGDWLLIEEVQSIL
HWLQQGYAFSASVSGHIFPLAAQQMLQVGEQSGQLPKMLQFIANDHQQQLDYQIDLLSQMLEPLLMVIIGGLIGLIMLGM
YLPIFNMGALVQ

Nucleotide


Download         Length: 1239 bp        

>NTDB_id=1142595 DQL22_RS04660 WP_111301450.1 910113..911351(-) (pilC) [Aggregatibacter aphrophilus strain NCTC11096]
ATGTGTAAACTGAAATTATTTAACTGGCACGGGGTTAACCGTTTACAGCAAAAGCAGGAAGGCACTATTGTGGCGGAAAG
CGCCGTGATAGCACAACAACAGCTGATGTCACGCGGTTTACAACACATTAAACTGCAACAAAATTGGCAATTAAACAGCA
AACCTAAAAACGCGGAAGTCTGTTCGTTGCTTTCTCAGTTGGCGACCTTATTACAAGCGGCGGTTCCGTTAAAAAATAGT
CTGCAAATATTGTTACAACATTGCACCAATATTGCGCTAAACGCTTGGTTGCGCCAACTTTTAAAGGATATTGAAAGCGG
TTTGGCATTTTCCCAAGCCTTAGAAAAACAAAACGTAGAAAAACAAAATCAATATTTGACCTATCAAGATCGCCAATTGA
TTAAAGTCGGTGAAATGACGGGCAAGTTACCCACCGTATGCCATGAAATCGCGCAACACAAACAGCAGGCATTAGCGTTG
CAACGCAAGATTCGAAAAATTCTGCTTTACCCGGTGTTGGTGCTTGGCATATCGCTAATTTTAACCGCACTTTTACTGCT
GTTTATCGTGCCGCAATTCGCCACGATGTATGACAATAGTAGCGCGCAACTCCCGACTTTCACCCAAGTGCTACTCACAC
TCTCGCAAGGATTACAGGATTATTGGCTGCATCTGTTGATTTGTATGGCATTGACTACCCTATTTATTCGCGCCCGCCTA
AAGCACTCCCCTTGGTTTAATCGGCAAAAAAGTCGATTAATCAATGCCATGCCTGTACTGAATCGCATCGTGCAGCTTTC
CCGTTTGGTGGGATTTAGCCGCAGTTTATTTTTAATGTTACAGGCGGGCGTGCCGCTGAATCAGGCTTTGCAGTCGTTTT
TACCGCAAAATCCAAGTTGGCAAAGATCGCCAAACGTGCAAGGCGATTGGCTGTTAATCGAAGAAGTACAATCCATTTTA
CACTGGCTACAACAAGGTTATGCCTTCTCCGCCAGTGTGAGCGGTCATATTTTTCCGTTGGCGGCACAACAAATGCTACA
GGTGGGCGAACAAAGTGGTCAGCTCCCTAAAATGTTGCAATTTATCGCCAACGATCATCAGCAACAATTGGACTATCAGA
TCGATCTGCTGTCACAAATGCTGGAACCGCTGTTGATGGTCATTATCGGTGGGCTTATCGGCTTAATTATGCTCGGAATG
TATTTGCCGATTTTCAATATGGGCGCGTTAGTGCAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Haemophilus influenzae Rd KW20

59.804

99.029

0.592

  pilC Haemophilus influenzae 86-028NP

59.559

99.029

0.59

  pilC Glaesserella parasuis strain SC1401

40.394

98.544

0.398


Multiple sequence alignment