Detailed information    

insolico Bioinformatically predicted

Overview


Name   comC   Type   Machinery gene
Locus tag   AAC899_RS01450 Genome accession   NZ_CP151168
Coordinates   331936..336297 (+) Length   1453 a.a.
NCBI ID   WP_343316464.1    Uniprot ID   -
Organism   Acinetobacter soli strain HY24     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 326936..341297
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAC899_RS01415 (AAC899_01415) rpoZ 327125..327406 (-) 282 WP_004934284.1 DNA-directed RNA polymerase subunit omega -
  AAC899_RS01420 (AAC899_01420) gmk 327475..328098 (-) 624 WP_004945015.1 guanylate kinase -
  AAC899_RS01425 (AAC899_01425) ispH 328234..329184 (+) 951 WP_004934289.1 4-hydroxy-3-methylbut-2-enyl diphosphate reductase -
  AAC899_RS01430 (AAC899_01430) - 329305..329751 (+) 447 WP_111811938.1 GspH/FimT family pseudopilin -
  AAC899_RS01435 (AAC899_01435) pilV 329748..330284 (+) 537 WP_025095723.1 type IV pilus modification protein PilV Machinery gene
  AAC899_RS01440 (AAC899_01440) comB 330285..331253 (+) 969 WP_025095722.1 PilW family protein Machinery gene
  AAC899_RS01445 (AAC899_01445) pilX 331250..331918 (+) 669 WP_025095721.1 PilX N-terminal domain-containing pilus assembly protein Machinery gene
  AAC899_RS01450 (AAC899_01450) comC 331936..336297 (+) 4362 WP_343316464.1 PilC/PilY family type IV pilus protein Machinery gene
  AAC899_RS01455 (AAC899_01455) comE 336363..336836 (+) 474 WP_119689994.1 type IV pilin protein Machinery gene
  AAC899_RS01460 (AAC899_01460) comF 336836..337270 (+) 435 WP_055414914.1 type IV pilin protein Machinery gene
  AAC899_RS01465 (AAC899_01465) rpsP 337412..337669 (+) 258 WP_004934313.1 30S ribosomal protein S16 -
  AAC899_RS01470 (AAC899_01470) rimM 337687..338235 (+) 549 WP_317657585.1 ribosome maturation factor RimM -
  AAC899_RS01475 (AAC899_01475) trmD 338284..339018 (+) 735 WP_343316465.1 tRNA (guanosine(37)-N1)-methyltransferase TrmD -
  AAC899_RS01480 (AAC899_01480) rplS 339228..339599 (+) 372 WP_004934323.1 50S ribosomal protein L19 -
  AAC899_RS01485 (AAC899_01485) - 339666..340637 (-) 972 WP_048764474.1 triacylglycerol lipase -

Sequence


Protein


Download         Length: 1453 a.a.        Molecular weight: 157087.74 Da        Isoelectric Point: 7.9077

>NTDB_id=975568 AAC899_RS01450 WP_343316464.1 331936..336297(+) (comC) [Acinetobacter soli strain HY24]
MKNKKSKRFNPLLGTATPQHHRMLCTAMMSVSLIIACNAQASDVEVYHQGAQFDKRLMLMVDQSRTMGGAGALDLLKEYP
ICVGKGVSNILGSGGGLGLLGDKDALELVTNTVGAVDQVALQGLLGGSLDPLLKITTESPSSKYNYARNYCTVVTTDLVV
KTLDGLLKPLLGATGLDAKSYIENTCDFQANVKLLNSVSAVGVYRCYDKLSRVKNALTDVLLGNSSTGLKPLPDNVSVGI
SVMPVDVMTNDRAGRILAPACKLSTTRASDTNFRCFEKTYSGSATTYRNYLVEKIAGNIQSKGDFNLQNVLVIVPKLIAA
LVGDIFSKLWSILSNPVKGLSELLEFKGTASTVNDLLKTLGLAGNVPTASTYAETGAYLMGTTTKGTGARQIAMRIIPVK
ILFLPLLGWYDYKCDKYGTDSWQSDGVTCKDDAWQYKSGGISTTNLIQVNDDLVSSLISDGVVGGLLNGVVGNILGIKVS
EKHLYYGYDSQADIEYSGFSNSVDSSKRSTTFYRAPTDTPQCNANGIMVITGGVPNITPTVTDALLKQGAEGLGTQNAIE
RLMGRSLNTSATDLSKLDLSNVFQCDASAGLKSTTLSSRDYATWSCIGNYTKKLLDKNITTGVIGVGREFTTIPSTTDSA
QLEASMTGVNNSLLGQIVPNLLKDTLNVLLGGTLVNGLTSLLGNVFPTDAEDVKNLARWGVLGKGGWYNSASSENIANSI
YNFYANLGISNRETFLGVPVVPTDPLTPYNLDNYVFQNMFVPDDKQTWFGNVKKYLADTGETIQTKTITDVWSNSNIQST
KTSILTGGFADKLPTATSSSASSRQLYVNRACITKDKKYDVSASIGLIGNDYYTKLCESQSKDPRRSDLMNLLGYQIQTT
NNQESLIAKPEYKKVGMVLHSSPIKITQSATIKNDGSLTRDDYLVFGTVQGLLHVVNASTGVEKFAFLPNELLENAKQRK
AFTGTNLEGHDNFNNMQYGIDAPWTAYSEYVWNTKDGKLTVGKSATDATCIKDGVSTGACGKQYLYGGLRMGGRSYYALD
LSLWDDQTKPSLKFYIDPASGKVYSSTTPEGKSFDAIKYMGQSWSKPTVAWVNWQGKRKLVMFVGGGYDAGGNDGNANNG
GYEQVSYAQSNKIGSGVYMFDAENGDLLWWASNLATTANTGATMALKTNMQYSVVSRINAVDRNGDGLVDHLYFGDLGGQ
LWRVDINNGSNPQQFAQAAQLLDLQNNKPSDLTDANLRFYEAPIFSVYGYGANSLAVLSIATTNRSLPISDQSAGVVFNL
FDNDVTQVSFSTRDMAANNKNLVAYAQLPKYGELTMVNRPQYGWYAKLSNQQKVMDEPVVINKNLYVSIFDPIAQEGRVA
DCSIGVQGLSSVRRFCLPYGVCEKSADLTTTLKLGKGILPVTIGAGTTSNGVTTRQILGGSSTDRDSTKDVLSSTQTTRR
QIVPLKWYEQYTP

Nucleotide


Download         Length: 4362 bp        

>NTDB_id=975568 AAC899_RS01450 WP_343316464.1 331936..336297(+) (comC) [Acinetobacter soli strain HY24]
ATGAAAAACAAGAAATCTAAAAGATTTAATCCCTTACTAGGTACCGCTACACCTCAGCATCATCGGATGCTGTGTACAGC
TATGATGAGCGTTTCACTGATAATTGCATGCAACGCTCAGGCCAGTGATGTCGAGGTTTATCATCAAGGAGCACAATTTG
ATAAGCGTTTAATGCTGATGGTCGATCAGTCCCGCACCATGGGGGGAGCAGGAGCACTCGATTTATTAAAAGAATATCCT
ATTTGTGTGGGGAAAGGTGTCTCTAATATTTTAGGTTCAGGGGGTGGGCTGGGGCTGCTAGGCGATAAAGATGCGCTTGA
GTTGGTCACCAATACCGTAGGTGCAGTCGATCAGGTGGCTTTACAGGGCTTACTGGGCGGGAGTCTGGATCCGCTGTTAA
AGATCACCACAGAGTCACCGAGCAGTAAATACAACTATGCACGTAATTACTGTACCGTAGTCACCACAGATTTGGTGGTA
AAAACACTCGATGGCTTATTAAAACCGCTGCTGGGTGCAACCGGGCTAGACGCCAAAAGTTATATCGAAAATACCTGTGA
TTTTCAGGCCAATGTTAAACTGCTGAACAGTGTTTCTGCGGTGGGTGTATATCGATGCTATGACAAACTTTCGCGGGTCA
AAAATGCCTTAACGGACGTATTGCTTGGAAATAGTAGTACAGGGCTCAAACCTCTGCCAGACAATGTCAGTGTCGGGATA
TCGGTCATGCCGGTTGATGTTATGACCAATGATCGTGCGGGACGTATTTTGGCGCCTGCTTGTAAGCTGTCGACCACTCG
AGCCAGTGATACCAATTTCCGTTGTTTTGAAAAGACCTACTCAGGGAGTGCCACCACGTATCGTAACTATCTAGTCGAAA
AAATTGCAGGAAATATTCAGTCCAAAGGCGATTTTAATCTACAAAATGTATTGGTTATTGTGCCGAAACTGATTGCAGCG
TTGGTGGGTGATATCTTTAGTAAACTCTGGAGCATTCTTTCAAATCCAGTAAAAGGTTTGTCTGAGTTATTAGAATTTAA
GGGTACAGCCAGCACCGTCAATGATTTACTCAAAACTCTTGGACTCGCAGGGAATGTTCCAACCGCCAGTACCTATGCAG
AGACTGGTGCCTATTTAATGGGAACCACTACCAAGGGTACTGGTGCTAGACAAATAGCAATGAGAATTATTCCAGTAAAA
ATTCTATTTCTTCCACTATTAGGCTGGTACGATTATAAATGTGATAAATACGGAACAGATAGCTGGCAATCTGATGGTGT
AACGTGTAAGGATGATGCTTGGCAGTATAAGTCTGGTGGAATATCAACAACTAATCTTATACAGGTAAATGATGATCTGG
TCAGCTCCCTTATCAGTGATGGAGTGGTAGGGGGGCTTTTAAATGGTGTGGTTGGTAACATCTTGGGAATCAAGGTCTCA
GAGAAACATCTCTACTATGGCTATGATTCACAAGCTGATATTGAATATAGCGGCTTTAGTAACTCGGTCGATTCCAGCAA
ACGTTCTACTACATTTTATCGCGCACCGACAGACACACCGCAATGTAATGCTAACGGTATTATGGTGATTACTGGTGGCG
TACCAAATATCACCCCAACGGTTACCGATGCACTGCTTAAGCAAGGCGCGGAAGGGCTGGGCACACAAAATGCGATTGAG
CGTTTGATGGGCCGTTCACTCAATACTTCGGCCACAGACTTGTCGAAGCTCGACCTCAGTAACGTTTTTCAGTGTGATGC
ATCGGCAGGATTAAAATCGACCACTTTAAGTAGCCGTGATTATGCCACGTGGAGCTGCATTGGTAATTACACCAAAAAGC
TACTCGATAAAAATATCACCACGGGTGTGATTGGCGTTGGGCGTGAATTTACCACGATTCCAAGCACTACAGATTCAGCC
CAGCTTGAAGCATCTATGACGGGTGTAAATAATTCGCTATTGGGTCAGATTGTTCCGAATTTATTAAAGGATACCTTAAA
CGTATTGCTTGGTGGTACGTTGGTGAATGGTTTAACCAGTTTGCTTGGCAATGTGTTTCCAACCGATGCCGAGGATGTCA
AAAATTTGGCACGGTGGGGCGTATTGGGTAAAGGGGGCTGGTATAACTCAGCCAGCTCAGAAAATATTGCCAACAGTATT
TATAACTTCTATGCCAATTTGGGCATTAGTAACCGGGAAACTTTTTTAGGCGTACCTGTGGTGCCAACAGATCCACTTAC
CCCATACAATCTTGATAACTATGTTTTTCAAAACATGTTTGTTCCCGACGACAAACAAACTTGGTTTGGGAATGTAAAAA
AATATTTGGCCGATACAGGTGAAACCATACAGACCAAAACCATCACCGATGTCTGGAGTAATAGCAACATACAATCCACC
AAAACCAGTATTTTAACGGGTGGGTTTGCAGACAAATTACCTACAGCGACCAGTAGCAGTGCATCTAGCCGGCAGTTATA
CGTGAACCGCGCGTGTATCACGAAAGATAAAAAATATGATGTTTCCGCTTCAATTGGGCTGATTGGAAACGACTACTACA
CCAAGCTCTGCGAAAGTCAAAGTAAAGACCCGCGTCGTAGCGATTTGATGAACCTGCTGGGCTATCAAATTCAAACCACC
AATAATCAAGAAAGTCTGATTGCCAAACCTGAATATAAAAAAGTCGGGATGGTGCTTCATTCATCGCCAATCAAAATTAC
CCAGTCGGCTACCATCAAAAATGATGGTTCATTGACACGTGATGATTATTTGGTATTTGGTACCGTCCAAGGGCTTTTAC
ATGTGGTCAATGCCAGTACAGGTGTAGAAAAATTTGCATTTTTACCCAATGAGCTACTTGAAAATGCCAAGCAGCGTAAA
GCTTTTACAGGTACAAATCTTGAGGGACACGACAACTTTAACAATATGCAATACGGTATTGATGCGCCGTGGACCGCCTA
CAGTGAATACGTGTGGAACACTAAAGATGGCAAACTCACTGTTGGTAAGTCGGCCACAGATGCAACCTGTATAAAAGATG
GCGTCTCGACGGGCGCCTGCGGTAAGCAATACCTTTATGGTGGATTACGTATGGGTGGACGCAGTTATTATGCCCTGGAT
TTGAGCCTATGGGATGATCAAACCAAACCAAGCTTAAAGTTCTATATTGATCCTGCTTCTGGCAAAGTTTATTCATCGAC
CACTCCAGAAGGAAAAAGCTTCGATGCCATCAAGTACATGGGGCAAAGCTGGTCGAAACCAACCGTGGCATGGGTAAACT
GGCAGGGCAAACGCAAATTGGTGATGTTTGTGGGTGGTGGCTACGATGCAGGTGGAAACGATGGTAATGCCAATAATGGC
GGATACGAACAGGTCAGTTACGCTCAGAGCAACAAGATTGGTTCGGGTGTGTATATGTTTGATGCCGAAAATGGCGATCT
GCTCTGGTGGGCCAGCAATTTGGCCACCACAGCCAATACCGGTGCCACCATGGCCCTGAAAACCAACATGCAATATAGCG
TCGTCAGCCGGATCAATGCAGTAGATCGAAATGGTGATGGTCTGGTCGATCATTTGTACTTTGGTGATTTGGGCGGACAA
CTCTGGCGTGTCGATATTAATAACGGCAGTAATCCTCAACAGTTTGCACAGGCAGCACAACTGCTCGATCTGCAAAACAA
TAAACCAAGTGACTTAACCGATGCCAATTTACGTTTTTATGAAGCACCTATTTTTAGTGTGTATGGATATGGCGCCAATT
CACTTGCTGTGTTGAGTATTGCAACCACTAACCGCAGTTTACCGATTAGTGATCAAAGTGCCGGGGTGGTATTTAATCTT
TTCGACAACGATGTCACACAGGTGAGTTTTAGCACCCGTGATATGGCAGCAAACAATAAAAATTTAGTCGCTTATGCACA
ATTACCAAAATATGGGGAGCTGACCATGGTCAATCGCCCGCAATATGGTTGGTATGCCAAATTAAGCAACCAGCAAAAGG
TGATGGATGAACCTGTGGTCATCAATAAAAACCTGTATGTGAGTATTTTTGATCCGATTGCACAAGAAGGACGTGTGGCA
GATTGCTCAATTGGTGTGCAAGGTTTGAGTAGTGTCCGTCGCTTTTGTCTGCCATATGGTGTGTGCGAAAAGTCGGCTGA
CCTGACAACAACACTTAAACTTGGAAAAGGTATTTTACCTGTGACGATTGGCGCCGGTACCACCAGCAATGGCGTAACGA
CCCGACAAATCTTGGGGGGAAGTTCAACCGATCGTGACTCGACCAAAGATGTGCTCTCAAGTACCCAAACCACGCGGCGT
CAAATTGTGCCATTAAAATGGTATGAACAGTACACCCCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comC Acinetobacter baylyi ADP1

79.375

99.105

0.787


Multiple sequence alignment