Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA   Type   Machinery gene
Locus tag   CEP49_RS04650 Genome accession   NZ_CP022012
Coordinates   1047284..1047751 (-) Length   155 a.a.
NCBI ID   WP_265473583.1    Uniprot ID   -
Organism   Mergibacter septicus strain 16309-1-I1     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1042284..1052751
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CEP49_RS04620 (CEP49_04595) - 1042684..1042956 (-) 273 WP_265475097.1 GNAT family N-acetyltransferase -
  CEP49_RS04625 (CEP49_04600) yacG 1042996..1043202 (-) 207 WP_265473578.1 DNA gyrase inhibitor YacG -
  CEP49_RS04630 (CEP49_04605) coaE 1043204..1043833 (-) 630 WP_265473579.1 dephospho-CoA kinase -
  CEP49_RS04635 (CEP49_04610) - 1043872..1044561 (-) 690 WP_265473580.1 prepilin peptidase -
  CEP49_RS04640 (CEP49_04615) - 1044603..1045829 (-) 1227 WP_265473581.1 type II secretion system F family protein -
  CEP49_RS04645 (CEP49_04620) pilB 1045844..1047262 (-) 1419 WP_265473582.1 GspE/PulE family protein Machinery gene
  CEP49_RS04650 (CEP49_04625) pilA 1047284..1047751 (-) 468 WP_265473583.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  CEP49_RS04655 (CEP49_04630) ampD 1047879..1048478 (+) 600 WP_409240660.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  CEP49_RS04660 (CEP49_04635) - 1048643..1049710 (+) 1068 WP_265473584.1 LysM-like peptidoglycan-binding domain-containing protein -
  CEP49_RS04665 (CEP49_04640) cgtA 1049774..1050934 (-) 1161 WP_265473585.1 Obg family GTPase CgtA -
  CEP49_RS04670 (CEP49_04645) - 1050978..1051910 (-) 933 WP_265473586.1 DMT family transporter -
  CEP49_RS04675 (CEP49_04650) rpmA 1052082..1052339 (-) 258 WP_261920339.1 50S ribosomal protein L27 -
  CEP49_RS04680 (CEP49_04655) rplU 1052360..1052671 (-) 312 WP_261920338.1 50S ribosomal protein L21 -

Sequence


Protein


Download         Length: 155 a.a.        Molecular weight: 16230.68 Da        Isoelectric Point: 7.5827

>NTDB_id=235867 CEP49_RS04650 WP_265473583.1 1047284..1047751(-) (pilA) [Mergibacter septicus strain 16309-1-I1]
MKRSTLTQAALSRVGLSQGFTLIELMIVIAIIAVLATVAIPSYQNYTKKAAMSELIQAASPYKAEVELCIYNSGSKTNCS
AGQNGIQAAVTESEKVKYLKTVGVNAGVISVTGKGTLENIGYTMTPNGGENGEVIDWAVNCTGDKGIFPVGFCAN

Nucleotide


Download         Length: 468 bp        

>NTDB_id=235867 CEP49_RS04650 WP_265473583.1 1047284..1047751(-) (pilA) [Mergibacter septicus strain 16309-1-I1]
ATGAAAAGATCAACCTTAACTCAAGCAGCTTTATCTCGAGTTGGATTATCTCAAGGCTTTACTTTAATTGAATTGATGAT
TGTCATCGCAATCATTGCGGTATTAGCCACGGTGGCTATTCCTTCCTATCAAAACTATACCAAGAAAGCTGCAATGTCAG
AATTGATTCAAGCTGCTTCGCCGTATAAGGCTGAAGTTGAATTATGTATTTATAATTCAGGCTCTAAGACAAATTGCTCA
GCAGGACAAAATGGTATTCAAGCGGCGGTTACTGAGAGTGAGAAAGTAAAATATTTGAAAACGGTCGGAGTAAATGCAGG
TGTTATTTCAGTAACAGGTAAAGGAACATTGGAGAATATTGGATATACCATGACGCCAAATGGTGGTGAAAATGGAGAAG
TCATTGACTGGGCTGTAAACTGTACGGGGGATAAAGGAATATTTCCAGTAGGTTTTTGTGCTAATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA Glaesserella parasuis strain SC1401

54.61

90.968

0.497

  pilA Haemophilus influenzae 86-028NP

54.676

89.677

0.49

  pilA Haemophilus influenzae Rd KW20

53.237

89.677

0.477


Multiple sequence alignment