Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   MQG37_RS00480 Genome accession   NZ_CP093911
Coordinates   96420..97643 (+) Length   407 a.a.
NCBI ID   WP_005547544.1    Uniprot ID   -
Organism   Aggregatibacter actinomycetemcomitans strain 16R     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 91420..102643
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MQG37_RS00460 nanQ 92919..93389 (+) 471 WP_005547536.1 N-acetylneuraminate anomerase -
  MQG37_RS00465 ampD 93863..94417 (-) 555 WP_005567973.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  MQG37_RS00470 pilA 94539..94991 (+) 453 WP_005586498.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  MQG37_RS00475 pilB 95018..96427 (+) 1410 WP_005547542.1 GspE/PulE family protein Machinery gene
  MQG37_RS00480 pilC 96420..97643 (+) 1224 WP_005547544.1 type II secretion system F family protein Machinery gene
  MQG37_RS00485 - 97640..98329 (+) 690 WP_005547547.1 A24 family peptidase -
  MQG37_RS00490 coaE 98379..99002 (+) 624 WP_005547548.1 dephospho-CoA kinase -
  MQG37_RS00495 yacG 98992..99204 (+) 213 WP_005547550.1 DNA gyrase inhibitor YacG -
  MQG37_RS00500 - 99204..99476 (+) 273 WP_005547552.1 GNAT family N-acetyltransferase -
  MQG37_RS00505 - 99873..101246 (+) 1374 WP_005547554.1 sodium:alanine symporter family protein -
  MQG37_RS00510 raiA 101475..101798 (+) 324 WP_005540708.1 ribosome-associated translation inhibitor RaiA -
  MQG37_RS00515 - 101947..102501 (+) 555 WP_005547559.1 DUF5358 domain-containing protein -

Sequence


Protein


Download         Length: 407 a.a.        Molecular weight: 46032.52 Da        Isoelectric Point: 10.0745

>NTDB_id=665408 MQG37_RS00480 WP_005547544.1 96420..97643(+) (pilC) [Aggregatibacter actinomycetemcomitans strain 16R]
MIKLKLFQWRAVNRLQQKQKDLIVAETEAMAQQQLLSCGLQNIKLQQNWQFSHKPKNADICTLLSQLATLLQAAVPLKNS
LQILLQNCTNIALNRWLRCLLQEIESGLAFSQALEQQGLYLTYQERQLIQVGEMTGKLAAVCHEIAQHKQQALALQRKIQ
KILLYPLLVLGISLILTALLLLFIVPQFAAMYDNSNAQLPVFTQVLLALSQGLQGYWFALLMFIALTVALIRFRLKHSPW
LNRQKNRLINSIPLLNRIVQLSRLVGFSHSLFLMLQAGIPLNQALQSFLPKQQSWQTKPQQQGDWLLIAGVQSALHWIQQ
GYPFSASVSGQIFPPAAQQMLQVGEQSGQLPKMLQFIANDHQQQLDHQIDLLSQMLEPLLMVIIGGLIGLIMLGMYLPIF
NMGSLVQ

Nucleotide


Download         Length: 1224 bp        

>NTDB_id=665408 MQG37_RS00480 WP_005547544.1 96420..97643(+) (pilC) [Aggregatibacter actinomycetemcomitans strain 16R]
ATGATTAAATTGAAACTGTTTCAATGGCGCGCCGTTAATCGACTACAGCAAAAACAAAAAGACTTAATCGTGGCGGAAAC
GGAGGCGATGGCACAACAACAGCTGCTTTCATGCGGGTTGCAAAACATCAAATTACAACAAAACTGGCAATTTAGCCACA
AACCGAAAAATGCCGACATCTGCACGCTGCTTTCCCAACTGGCGACGCTGTTACAAGCCGCCGTGCCGCTGAAAAACAGC
CTGCAAATTTTATTGCAAAATTGTACCAATATTGCGTTGAATCGCTGGCTACGCTGCTTGCTACAGGAGATTGAAAGCGG
CTTAGCATTTTCTCAAGCATTGGAACAACAAGGTTTATACTTAACCTATCAGGAACGCCAACTGATTCAGGTGGGTGAAA
TGACAGGCAAACTTGCCGCCGTTTGTCATGAAATCGCACAACATAAACAACAGGCGCTGGCGTTGCAGCGTAAAATTCAG
AAAATCCTGCTTTATCCGTTGCTGGTGCTCGGAATCTCATTGATTTTGACCGCACTTTTATTGCTGTTCATCGTGCCGCA
ATTTGCCGCCATGTATGACAACAGCAACGCGCAACTGCCCGTGTTCACGCAGGTTTTACTAGCCCTTTCCCAAGGCTTGC
AGGGTTATTGGTTCGCGCTGCTCATGTTTATTGCGCTGACCGTGGCGTTAATTCGCTTTCGCCTCAAACATTCGCCTTGG
CTTAACCGACAAAAAAACCGGCTGATTAACAGTATTCCGCTACTTAATCGTATCGTACAACTCTCCCGTTTAGTGGGATT
CAGTCACAGCTTGTTTTTGATGTTACAGGCAGGTATTCCGCTGAATCAGGCGCTACAATCTTTCCTACCGAAACAACAAA
GCTGGCAAACTAAACCGCAGCAGCAAGGGGATTGGCTGCTTATTGCGGGAGTGCAATCGGCACTGCATTGGATTCAACAA
GGCTATCCGTTCTCCGCCAGCGTAAGCGGACAAATCTTCCCGCCGGCGGCACAACAAATGCTGCAAGTCGGCGAACAAAG
CGGTCAATTGCCGAAAATGCTGCAATTTATCGCCAACGATCATCAGCAACAATTGGATCACCAAATCGATCTACTGTCAC
AAATGCTGGAACCGTTGTTGATGGTAATTATCGGCGGGCTGATCGGCTTGATTATGCTCGGTATGTATTTACCGATTTTC
AACATGGGCTCGCTGGTACAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Haemophilus influenzae Rd KW20

63.027

99.017

0.624

  pilC Haemophilus influenzae 86-028NP

62.283

99.017

0.617

  pilC Glaesserella parasuis strain SC1401

39.152

98.526

0.386