Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   ACK3CJ_RS05055 Genome accession   NZ_CP176580
Coordinates   1085428..1085826 (+) Length   132 a.a.
NCBI ID   WP_409525338.1    Uniprot ID   -
Organism   Nitrincola sp. WZQS7-5     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1085887..1100111 1085428..1085826 flank 61


Gene organization within MGE regions


Location: 1085428..1100111
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACK3CJ_RS05055 comP 1085428..1085826 (+) 399 WP_409525338.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  ACK3CJ_RS05060 - 1085887..1087818 (+) 1932 WP_409525339.1 hypothetical protein -
  ACK3CJ_RS05065 - 1087802..1088686 (+) 885 WP_409525340.1 glycosyltransferase family 2 protein -
  ACK3CJ_RS05070 - 1088703..1089482 (-) 780 WP_409525342.1 cephalosporin hydroxylase family protein -
  ACK3CJ_RS05075 - 1089490..1090182 (-) 693 WP_409525343.1 metallophosphoesterase family protein -
  ACK3CJ_RS05080 - 1090163..1091161 (-) 999 WP_409525344.1 ATP-grasp domain-containing protein -
  ACK3CJ_RS05085 - 1091158..1092039 (-) 882 WP_409525345.1 NAD-dependent epimerase/dehydratase family protein -
  ACK3CJ_RS05090 - 1092036..1092758 (-) 723 WP_409525346.1 class I SAM-dependent methyltransferase -
  ACK3CJ_RS05095 - 1092748..1093497 (-) 750 WP_409525347.1 WbqC family protein -
  ACK3CJ_RS05100 - 1093508..1094632 (-) 1125 WP_409525348.1 DegT/DnrJ/EryC1/StrS family aminotransferase -
  ACK3CJ_RS05105 - 1094629..1095852 (-) 1224 WP_409525350.1 methyltransferase domain-containing protein -
  ACK3CJ_RS05110 - 1095849..1096412 (-) 564 WP_409525351.1 dTDP-4-dehydrorhamnose 3,5-epimerase family protein -
  ACK3CJ_RS05115 rfbG 1096405..1097469 (-) 1065 WP_409525352.1 CDP-glucose 4,6-dehydratase -
  ACK3CJ_RS05120 rfbF 1097478..1098251 (-) 774 WP_409525353.1 glucose-1-phosphate cytidylyltransferase -
  ACK3CJ_RS05125 - 1098308..1099219 (-) 912 WP_409525354.1 hypothetical protein -
  ACK3CJ_RS05130 - 1099206..1100111 (-) 906 WP_409525355.1 glycosyltransferase family 2 protein -

Sequence


Protein


Download         Length: 132 a.a.        Molecular weight: 13207.17 Da        Isoelectric Point: 7.5932

>NTDB_id=1081051 ACK3CJ_RS05055 WP_409525338.1 1085428..1085826(+) (comP) [Nitrincola sp. WZQS7-5]
MKRQQGFTLIELMIVVAIIGILAAIALPAYQDYTTRARVSELVLSASAARTCVTEQSQLGGAVDAGGCVAPGTGGFVSAA
TLNSATGVVQVTGTAAAGSTVITLTPTWVSTLGSVTWACTGSPANFVPGSCR

Nucleotide


Download         Length: 399 bp        

>NTDB_id=1081051 ACK3CJ_RS05055 WP_409525338.1 1085428..1085826(+) (comP) [Nitrincola sp. WZQS7-5]
ATGAAAAGACAACAAGGTTTTACACTCATTGAGTTGATGATCGTCGTGGCGATCATCGGTATTCTGGCTGCGATCGCGCT
GCCGGCTTATCAGGATTATACAACCCGAGCACGCGTTTCTGAGCTTGTTTTATCAGCCTCAGCGGCCCGTACCTGCGTTA
CTGAACAGTCTCAGCTTGGTGGGGCTGTTGATGCTGGAGGGTGCGTAGCACCAGGAACAGGTGGTTTTGTATCGGCTGCT
ACGTTAAACAGTGCGACTGGTGTGGTTCAAGTAACAGGTACCGCTGCAGCAGGTAGTACTGTAATTACTTTAACACCTAC
ATGGGTATCGACCCTTGGAAGTGTTACGTGGGCTTGTACAGGATCTCCAGCTAACTTTGTTCCAGGCTCATGCCGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Acinetobacter baylyi ADP1

48.684

100

0.561

  pilA2 Legionella pneumophila str. Paris

52.174

100

0.545

  pilA2 Legionella pneumophila strain ERS1305867

52.174

100

0.545

  pilA Ralstonia pseudosolanacearum GMI1000

42.963

100

0.439

  pilA/pilA1 Eikenella corrodens VA1

38

100

0.432

  pilA Pseudomonas aeruginosa PAK

38.255

100

0.432

  pilA Haemophilus influenzae Rd KW20

36.364

100

0.424

  pilA Acinetobacter baumannii strain A118

36.62

100

0.394

  pilA Glaesserella parasuis strain SC1401

37.41

100

0.394

  pilA Haemophilus influenzae 86-028NP

37.956

100

0.394


Multiple sequence alignment