Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   EL144_RS10940 Genome accession   NZ_LR134327
Coordinates   2277830..2279068 (+) Length   412 a.a.
NCBI ID   WP_005704569.1    Uniprot ID   A0A448FBN8
Organism   Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2272830..2284068
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL144_RS10920 (NCTC5906_02196) nanQ 2274331..2274801 (+) 471 WP_005704573.1 N-acetylneuraminate anomerase -
  EL144_RS10925 (NCTC5906_02197) ampD 2275274..2275828 (-) 555 WP_032995339.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  EL144_RS10930 (NCTC5906_02198) pilA 2275953..2276402 (+) 450 WP_005704571.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  EL144_RS10935 (NCTC5906_02199) pilB 2276428..2277837 (+) 1410 WP_005704570.1 GspE/PulE family protein Machinery gene
  EL144_RS10940 (NCTC5906_02200) pilC 2277830..2279068 (+) 1239 WP_005704569.1 type II secretion system F family protein Machinery gene
  EL144_RS10945 (NCTC5906_02201) - 2279065..2279757 (+) 693 WP_032995328.1 A24 family peptidase -
  EL144_RS10950 (NCTC5906_02202) coaE 2279810..2280430 (+) 621 WP_005704567.1 dephospho-CoA kinase -
  EL144_RS10955 (NCTC5906_02203) yacG 2280423..2280635 (+) 213 WP_005701268.1 DNA gyrase inhibitor YacG -
  EL144_RS10960 (NCTC5906_02204) - 2280635..2280901 (+) 267 WP_005704566.1 GNAT family N-acetyltransferase -
  EL144_RS10965 (NCTC5906_02206) - 2281269..2282642 (+) 1374 WP_032995327.1 sodium:alanine symporter family protein -
  EL144_RS10970 (NCTC5906_02207) raiA 2282869..2283192 (+) 324 WP_005704563.1 ribosome-associated translation inhibitor RaiA -

Sequence


Protein


Download         Length: 412 a.a.        Molecular weight: 46748.29 Da        Isoelectric Point: 10.3395

>NTDB_id=1121615 EL144_RS10940 WP_005704569.1 2277830..2279068(+) (pilC) [Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906]
MRKLKLFNWHGVNRLQQKQKGTIVAESPVIAQQQLMSRGLQHIKLQQNWQLNSKPKNAEVCALLSQLATLLQAAVPLKNS
LQILLQHCTNIALNAWLRQLLKDIESGLAFSQALEKQNVEKQNQYLTYQDRQLLKVGEMTGKLPTVCHEIAQHKQQALAL
QRKIQKILLYPVLVLGISLILTALLLLFIVPQFAAMYDNSSAQLPTFTQVLLTLSQGLQDYWLHLLICMALTTLFIRARL
KHSPWFNRQKIRLINAMPVVNRIVQLSRLVGFSRSLFLMLQAGVPLNQALQSFLPQNPSWQRSPNVQGNWLLIEEVQSIL
HWLQQGYAFSASVSGHIFPLAAQQMLQVGEQSGQLPKMLQFIANDHQQQLDYQIDLLSQMLEPLLMVIIGGLIGLIMLGM
YLPIFNMGALVQ

Nucleotide


Download         Length: 1239 bp        

>NTDB_id=1121615 EL144_RS10940 WP_005704569.1 2277830..2279068(+) (pilC) [Aggregatibacter aphrophilus ATCC 33389 strain NCTC 5906]
ATGCGTAAACTGAAATTATTTAACTGGCACGGGGTTAACCGTTTACAGCAAAAGCAGAAAGGCACTATCGTGGCGGAAAG
CCCCGTGATAGCACAACAACAGCTGATGTCACGCGGTTTACAACATATTAAACTGCAACAAAATTGGCAATTAAACAGCA
AGCCTAAAAACGCCGAAGTCTGTGCGTTGCTTTCTCAGTTGGCGACCTTATTACAAGCGGCGGTTCCGTTAAAAAATAGT
CTGCAAATTTTATTACAACATTGCACCAATATTGCGCTAAACGCTTGGTTGCGCCAACTTTTAAAGGATATTGAAAGCGG
TTTGGCATTTTCCCAAGCCTTAGAAAAACAAAACGTAGAAAAACAAAATCAATATTTGACCTATCAAGATCGCCAATTGC
TTAAAGTCGGTGAAATGACGGGCAAGTTACCCACCGTATGCCATGAAATCGCGCAACACAAACAGCAGGCATTGGCGTTG
CAACGCAAGATTCAAAAAATTCTGCTTTACCCAGTGTTGGTACTTGGCATATCGCTGATTTTAACCGCACTTTTACTGCT
GTTTATCGTGCCGCAATTCGCCGCGATGTATGACAATAGCAGCGCGCAACTCCCGACTTTCACACAGGTGCTACTCACAC
TGTCGCAAGGATTACAGGATTATTGGCTGCATCTGTTGATTTGTATGGCATTGACTACTCTATTTATTCGCGCCCGCCTA
AAGCACTCCCCTTGGTTTAATCGGCAAAAAATTCGATTAATCAATGCCATGCCTGTAGTGAATCGCATCGTGCAACTTTC
CCGTTTGGTGGGATTTAGCCGCAGTTTATTTTTAATGTTACAGGCGGGCGTGCCGCTGAATCAGGCTTTGCAGTCGTTTT
TACCACAAAATCCAAGTTGGCAAAGATCGCCAAACGTGCAAGGCAATTGGCTGTTAATCGAAGAAGTGCAATCCATTTTA
CACTGGCTACAACAAGGTTATGCCTTCTCCGCCAGTGTGAGCGGTCATATTTTTCCGTTGGCGGCACAACAAATGCTACA
GGTGGGCGAACAAAGTGGTCAGCTCCCTAAAATGTTGCAATTTATTGCCAACGACCATCAACAACAATTGGACTATCAGA
TCGATCTGTTGTCACAAATGCTGGAGCCGCTATTGATGGTCATTATCGGTGGGCTTATCGGCTTAATTATGCTCGGAATG
TATTTGCCGATTTTCAATATGGGCGCGTTGGTACAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A448FBN8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Haemophilus influenzae Rd KW20

59.804

99.029

0.592

  pilC Haemophilus influenzae 86-028NP

59.559

99.029

0.59

  pilC Glaesserella parasuis strain SC1401

39.32

100

0.393


Multiple sequence alignment