Detailed information    

insolico Bioinformatically predicted

Overview


Name   comC   Type   Machinery gene
Locus tag   N5P16_RS03640 Genome accession   NZ_CP104734
Coordinates   806290..810642 (+) Length   1450 a.a.
NCBI ID   WP_004923840.1    Uniprot ID   A0A7G2SAG8
Organism   Acinetobacter baylyi strain JAT2044     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 801290..815642
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  N5P16_RS03605 (N5P16_03605) rpoZ 801475..801756 (-) 282 WP_004923811.1 DNA-directed RNA polymerase subunit omega -
  N5P16_RS03610 (N5P16_03610) gmk 801825..802448 (-) 624 WP_004923822.1 guanylate kinase -
  N5P16_RS03615 (N5P16_03615) ispH 802583..803533 (+) 951 WP_004923824.1 4-hydroxy-3-methylbut-2-enyl diphosphate reductase -
  N5P16_RS03620 (N5P16_03620) - 803660..804106 (+) 447 WP_004923829.1 GspH/FimT family pseudopilin -
  N5P16_RS03625 (N5P16_03625) pilV 804103..804639 (+) 537 WP_004923832.1 type IV pilus modification protein PilV Machinery gene
  N5P16_RS03630 (N5P16_03630) comB 804640..805608 (+) 969 WP_004923834.1 PilW family protein Machinery gene
  N5P16_RS03635 (N5P16_03635) pilX 805605..806273 (+) 669 WP_004923837.1 PilX N-terminal domain-containing pilus assembly protein Machinery gene
  N5P16_RS03640 (N5P16_03640) comC 806290..810642 (+) 4353 WP_004923840.1 PilC/PilY family type IV pilus protein Machinery gene
  N5P16_RS03645 (N5P16_03645) comE 810704..811213 (+) 510 WP_004923843.1 type IV pilin protein Machinery gene
  N5P16_RS03650 (N5P16_03650) comF 811213..811647 (+) 435 WP_004923844.1 type IV pilin protein Machinery gene
  N5P16_RS03655 (N5P16_03655) rpsP 811795..812052 (+) 258 WP_004923845.1 30S ribosomal protein S16 -
  N5P16_RS03660 (N5P16_03660) rimM 812072..812620 (+) 549 WP_004923846.1 ribosome maturation factor RimM -
  N5P16_RS03665 (N5P16_03665) trmD 812668..813417 (+) 750 WP_004923847.1 tRNA (guanosine(37)-N1)-methyltransferase TrmD -
  N5P16_RS03670 (N5P16_03670) rplS 813559..813930 (+) 372 WP_004923849.1 50S ribosomal protein L19 -
  N5P16_RS03675 (N5P16_03675) - 814000..814968 (-) 969 WP_004923853.1 esterase/lipase family protein -

Sequence


Protein


Download         Length: 1450 a.a.        Molecular weight: 157837.56 Da        Isoelectric Point: 6.3258

>NTDB_id=731695 N5P16_RS03640 WP_004923840.1 806290..810642(+) (comC) [Acinetobacter baylyi strain JAT2044]
MKKSIYTQYLNSLYLVLQHKFSIFSISVFSCMLIMTSVVQASDVEVYQQGAQFDKRLMLMVDQSRSMGGAGALDLLKEYP
ICVGKGVSNVLGSGGGLGILGDKDALELVTDTVGAVDQVVLQGLLGGSLDPLLKITTESPSSKYNYARNYCTVVTTDLVV
KTLDGLLKPLLGATGLDAKSYIENTCDFQANIQLLNSVSVVGVYRCYDKLSRVKNALTDVLLGNDKTGLKPLPDNVSVGL
SAMPVDIMTKDRAGRILAPACKLSTTKASDTSNQCFEKGYAGSATTYRGYLVEKVANGIQSKGEFNLQNILLIVPKLVAA
LVGDVFNKLIGILTNPIRGLTDLLEFENTANVINDLLKTLGLSGNIPTASTYAETGAYLLGTSTKGTGARQIAMRIIPVK
ILFIPLLGWYDYKCDKYGKDSWQADGVTCKDDAWQYKLGGISTTNLIQVNDDLVSSLISDGVVGGLLNGVVGNILGIKVS
EKHLYYGYDSQADIEYSGFNYRDNSVPTRNNNQFYQAPANQPQCNASGIMVITGGVPNITPTTTDVLLQPSGKEGLGTKN
AIERLMGRSLDTQATDLSRLDLQSVFQCDESAGLKSTTLSSRDYATWSCIGNYSKKLLDKNVSTGVIGVGREFITIPSTK
DSAQLEASMSNLNNSLLGQTVPNLVKDTLNVLLGGPLVNGLTNLLGNLFPTDAEDVKNLARWGVLGKGGWYNSASSENIA
NSVYSFYSDLGVTKRETFLGAPVIPTDPLTPYNLNNYVFQNMFVPDDKQTWFGNVKKYMTDTGETIQTKTFQDVWSNQNI
DQTNILSGGVIDKLPIASGNNASTRQLYINRTCDTKAKKYDVSNAISAVGNNYYTQLCAGQTTDPRRNDLMNLLGYQIQS
VNNQETLVAKPEYRKIGMVLHSTPIKITQSATVKDDGSLTRDDYLVFGSAEGLLHVVNADTGVEKLAFLPNEMLENSKQR
KAFTGNGLEGHDSFSNMQYGVDGPWTAYTEYVWNSKDNKLTVGKSTTDAVCIKDGVFTGACGKQYLYGGLRMGGRSYYAL
DLNDLAAPKLKFYIDPASGRVYSDAYPGGKSFDAIKNMGQSWSKPTIAWINWQGKRKLVMFVGGGYDAGGTDGNSNNGGY
ELANYNQTNKRGAGVYMFDAENGDLLWWANNLATTSNDVSNSALNTNMQFSVVGRINTVDRDGDGLIDHLYFGDLGGQLW
RVDINNRVDAKQFASAALLLDLQSNRPTDLKDTNVRFYEAPVFSIYGYGSESLAVLSIASSNRSLPISDKSAGAIFNIFD
KDVTQVSFTTRDTYTDNKNLVAYSQLPKWGALTNENKPQYGWYVKLIDQQKVMDETAVINKSLYVSIYDPIAQDGRVADC
SIGIQGLSSIRRYCLPYGVCEKQTDLSGMLKLGKGILPVTIGSGSTDNKSTRQLIGGFSKDERTNAANVLGQNTLRRQIV
PLKWYEQSTP

Nucleotide


Download         Length: 4353 bp        

>NTDB_id=731695 N5P16_RS03640 WP_004923840.1 806290..810642(+) (comC) [Acinetobacter baylyi strain JAT2044]
ATGAAAAAATCGATATACACTCAGTATCTCAACTCTCTTTATTTGGTCTTGCAGCATAAGTTCAGTATTTTCAGTATTTC
AGTATTTAGCTGTATGCTAATCATGACTTCTGTTGTGCAGGCAAGCGATGTAGAGGTTTACCAGCAGGGAGCGCAGTTTG
ATAAACGACTGATGCTTATGGTCGATCAGTCGCGAAGCATGGGAGGAGCTGGAGCACTTGATCTATTAAAAGAATATCCA
ATTTGTGTGGGCAAAGGGGTATCCAATGTTTTGGGCTCTGGTGGCGGGTTAGGCATACTCGGCGACAAAGACGCGCTGGA
ATTGGTGACCGATACAGTAGGTGCGGTCGATCAGGTTGTTTTACAAGGCTTACTGGGTGGAAGTCTTGATCCGCTTTTAA
AAATCACTACTGAATCACCCAGTAGTAAATATAACTATGCACGTAATTACTGTACTGTTGTTACAACCGATTTGGTAGTG
AAAACATTAGATGGTTTATTAAAACCTTTACTTGGGGCAACAGGGTTAGATGCAAAAAGTTATATTGAAAATACTTGCGA
CTTTCAGGCGAATATCCAATTATTAAACAGTGTTTCTGTTGTAGGCGTTTATCGTTGCTATGATAAATTGTCCCGTGTAA
AAAATGCTTTGACAGATGTGTTGCTGGGGAATGATAAAACTGGCTTAAAACCTTTGCCAGATAATGTCAGTGTTGGATTA
TCAGCGATGCCTGTAGATATCATGACTAAAGATCGAGCAGGTCGGATTCTGGCACCAGCATGTAAGTTATCTACAACAAA
AGCCAGCGATACATCAAATCAATGTTTTGAAAAGGGATATGCGGGTTCGGCAACGACCTATCGTGGTTATTTAGTGGAAA
AAGTTGCTAATGGTATTCAATCTAAAGGTGAATTTAATCTTCAGAATATATTGTTGATTGTTCCCAAGCTAGTTGCTGCA
TTGGTAGGGGATGTATTCAACAAGCTAATAGGAATTCTTACCAATCCAATTCGAGGGTTAACAGATCTTCTTGAATTTGA
AAACACAGCAAATGTTATTAATGACTTGTTAAAAACACTGGGTTTATCTGGGAATATTCCAACAGCAAGTACCTACGCAG
AAACGGGGGCGTACCTGCTTGGAACATCGACCAAGGGTACAGGCGCTAGACAAATAGCAATGAGAATTATTCCAGTAAAA
ATTCTATTTATTCCGCTGTTAGGCTGGTACGATTATAAATGTGATAAATATGGAAAAGACAGCTGGCAAGCTGATGGTGT
GACGTGTAAGGATGATGCCTGGCAGTATAAGCTTGGTGGAATATCAACAACCAATCTTATACAGGTAAATGATGATCTGG
TCAGCTCCCTTATCAGTGATGGTGTGGTGGGGGGACTCTTAAATGGTGTGGTTGGTAACATCTTGGGAATCAAGGTCTCA
GAGAAACATCTTTACTATGGCTATGATTCACAAGCCGATATTGAATATAGTGGTTTTAATTATCGAGATAATTCTGTTCC
AACCAGAAATAATAATCAGTTTTATCAGGCACCAGCAAATCAGCCACAATGTAATGCCAGCGGAATTATGGTGATTACTG
GAGGTGTTCCAAATATTACCCCAACTACAACGGATGTTTTACTACAACCATCAGGTAAAGAAGGTTTGGGAACAAAAAAT
GCAATAGAGCGTTTAATGGGACGTTCATTGGATACCCAAGCAACAGATCTTTCCAGATTGGATCTGCAAAGTGTCTTTCA
ATGCGATGAGAGTGCTGGTTTAAAATCTACAACTCTCAGTAGTCGCGATTATGCTACATGGAGTTGTATTGGAAATTACA
GCAAAAAATTGCTGGATAAAAATGTGTCCACTGGCGTGATTGGAGTGGGGCGGGAGTTTATTACCATTCCGAGCACAAAA
GATTCTGCACAACTTGAAGCATCAATGAGTAACTTGAATAATTCACTATTAGGACAGACTGTTCCTAATCTAGTTAAAGA
TACTTTGAATGTGCTTTTGGGTGGACCATTGGTGAATGGATTGACTAACTTACTTGGAAATTTATTTCCAACAGATGCAG
AAGATGTTAAAAATTTGGCACGTTGGGGTGTGCTGGGCAAGGGTGGATGGTACAACTCTGCAAGTTCAGAGAATATTGCG
AATAGTGTTTATTCATTTTACTCAGATTTAGGGGTTACGAAACGAGAGACTTTCTTGGGAGCACCAGTAATTCCAACAGA
TCCTTTAACACCATACAATTTAAATAATTACGTTTTTCAAAATATGTTTGTTCCAGATGATAAACAAACATGGTTTGGTA
ACGTTAAAAAATATATGACAGATACTGGTGAAACAATCCAGACTAAAACTTTTCAAGATGTCTGGAGTAATCAGAATATT
GATCAGACTAATATTTTGAGTGGTGGCGTGATTGATAAATTACCTATTGCTTCTGGCAACAACGCATCTACGCGTCAGCT
TTATATCAATCGGACTTGTGATACAAAAGCCAAAAAATACGATGTCTCTAATGCGATTAGTGCAGTTGGTAATAATTACT
ATACTCAGCTCTGTGCAGGCCAGACTACGGATCCGCGCCGTAATGATCTGATGAATCTGTTGGGCTACCAGATTCAAAGT
GTCAATAATCAGGAAACTTTAGTTGCCAAGCCTGAGTACCGCAAGATTGGTATGGTTTTACATTCAACACCTATCAAAAT
CACACAGTCTGCCACAGTTAAGGATGATGGTTCTTTGACACGAGATGACTATCTTGTATTCGGTTCAGCAGAAGGTCTAT
TACACGTTGTAAATGCAGATACAGGCGTAGAAAAGCTAGCATTTTTACCAAATGAAATGCTGGAAAATTCTAAACAACGT
AAAGCATTTACAGGCAATGGTCTTGAAGGCCATGATAGTTTTAGTAACATGCAATATGGTGTAGATGGGCCATGGACTGC
CTATACCGAATATGTCTGGAATAGCAAAGACAATAAATTAACTGTAGGTAAATCTACTACTGATGCTGTTTGTATTAAAG
ATGGTGTATTTACAGGAGCATGTGGAAAACAATATCTTTATGGTGGTTTGCGTATGGGAGGGCGTAGTTATTATGCGCTT
GATCTGAATGATCTTGCTGCACCAAAACTCAAATTCTATATCGATCCAGCAAGTGGTCGGGTTTATTCCGATGCCTATCC
TGGTGGCAAAAGTTTTGATGCAATTAAAAATATGGGACAAAGTTGGTCAAAACCGACCATTGCTTGGATTAACTGGCAAG
GCAAACGTAAGTTGGTTATGTTTGTAGGTGGTGGCTACGACGCTGGCGGTACAGATGGCAATAGTAATAACGGTGGCTAT
GAACTTGCGAATTATAATCAGACCAATAAGCGTGGTGCAGGCGTTTACATGTTTGATGCCGAAAATGGTGATTTGCTTTG
GTGGGCAAATAATTTGGCAACGACATCAAATGACGTTTCGAATAGTGCCTTAAATACAAATATGCAATTCAGTGTTGTCG
GTAGAATCAATACAGTAGATCGAGATGGAGATGGTTTAATAGATCATCTTTACTTTGGTGATTTGGGGGGGCAGTTATGG
CGAGTCGATATTAATAATCGTGTAGATGCCAAGCAGTTTGCTTCGGCTGCGCTTCTTCTCGATTTACAATCTAACAGACC
TACAGATCTTAAAGATACAAATGTCCGTTTCTATGAAGCACCTGTATTTAGTATTTATGGTTATGGTAGTGAGTCGCTTG
CAGTTTTGAGCATTGCTTCAAGCAATCGTAGTTTGCCAATTAGTGATAAAAGTGCGGGTGCTATTTTTAATATCTTTGAC
AAGGATGTAACTCAAGTCAGTTTCACCACACGGGATACCTATACAGATAATAAAAATCTGGTTGCATATTCTCAATTACC
TAAATGGGGGGCTTTGACCAATGAAAATAAACCGCAATATGGTTGGTATGTGAAACTCATTGATCAGCAAAAAGTAATGG
ATGAAACAGCAGTAATCAATAAGAGTTTGTATGTCAGTATCTATGATCCGATTGCTCAGGATGGGCGTGTTGCAGATTGC
TCAATTGGTATACAAGGCTTAAGTAGCATCCGTCGTTATTGTTTACCTTATGGCGTATGTGAAAAACAGACCGATTTATC
AGGAATGCTAAAACTCGGAAAAGGGATTTTACCTGTAACAATTGGCTCAGGCTCAACGGATAATAAATCGACGAGACAAT
TGATCGGTGGATTTAGCAAAGACGAGCGTACCAACGCTGCCAACGTACTTGGGCAAAATACCCTACGCCGTCAGATCGTA
CCATTAAAATGGTATGAGCAAAGCACCCCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comC Acinetobacter baylyi ADP1

100

100

1