Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilY1   Type   Machinery gene
Locus tag   FDQ49_RS11215 Genome accession   NZ_CP040105
Coordinates   2380173..2384357 (+) Length   1394 a.a.
NCBI ID   WP_022575990.1    Uniprot ID   -
Organism   Acinetobacter nosocomialis M2     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2375173..2389357
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FDQ49_RS11185 (FDQ49_11185) gmk 2375465..2376094 (-) 630 WP_002051739.1 guanylate kinase -
  FDQ49_RS11190 (FDQ49_11190) ispH 2376217..2377167 (+) 951 WP_002051768.1 4-hydroxy-3-methylbut-2-enyl diphosphate reductase -
  FDQ49_RS11195 (FDQ49_11195) fimU 2377336..2377809 (+) 474 WP_022575992.1 Tfp pilus assembly protein FimT/FimU Machinery gene
  FDQ49_RS11200 (FDQ49_11200) pilV 2377803..2378363 (+) 561 WP_079377028.1 type IV pilus modification protein PilV Machinery gene
  FDQ49_RS11205 (FDQ49_11205) pilW 2378363..2379361 (+) 999 WP_022575991.1 PilW family protein Machinery gene
  FDQ49_RS11210 (FDQ49_11210) pilX 2379358..2380173 (+) 816 WP_004711686.1 PilX N-terminal domain-containing pilus assembly protein Machinery gene
  FDQ49_RS11215 (FDQ49_11215) pilY1 2380173..2384357 (+) 4185 WP_022575990.1 PilC/PilY family type IV pilus protein Machinery gene
  FDQ49_RS11220 (FDQ49_11220) pilY2 2384370..2384852 (+) 483 WP_022575989.1 type IV pilin protein Machinery gene
  FDQ49_RS11225 (FDQ49_11225) pilE 2384849..2385274 (+) 426 WP_017392607.1 type IV pilin protein Machinery gene
  FDQ49_RS11230 (FDQ49_11230) rpsP 2385421..2385672 (+) 252 WP_000260334.1 30S ribosomal protein S16 -
  FDQ49_RS11235 (FDQ49_11235) rimM 2385692..2386240 (+) 549 WP_004711694.1 ribosome maturation factor RimM -
  FDQ49_RS11240 (FDQ49_11240) trmD 2386287..2387042 (+) 756 WP_002051760.1 tRNA (guanosine(37)-N1)-methyltransferase TrmD -
  FDQ49_RS11245 (FDQ49_11245) rplS 2387258..2387626 (+) 369 WP_002051668.1 50S ribosomal protein L19 -
  FDQ49_RS11250 (FDQ49_11250) - 2387678..2388619 (-) 942 WP_080660964.1 triacylglycerol lipase -

Sequence


Protein


Download         Length: 1394 a.a.        Molecular weight: 149123.08 Da        Isoelectric Point: 7.3912

>NTDB_id=362242 FDQ49_RS11215 WP_022575990.1 2380173..2384357(+) (pilY1) [Acinetobacter nosocomialis M2]
MGNIMKLHFKPNKLWYAIYSTSMTFTWLMSSSVVQASDLQIYASPTAGKKTIVMMLDTSGSMGRTIQSGYSIYDDYGITS
CSTTYVNSTTTPSYRRYYCGVSSNTTNSKVTNLATGCEKQADNSYRCYDRLTRLKDGMFSFLENNNPIFNNVSVGLGHFS
TYSSGTTGDGSSGEILVPAANLGVVGSAQRVALKNAVAGLEASGGTPTANAYAEAAAYLMGTRTLSSTTANVAMYFKYYP
VLTQTSQTVYYACGNNSASGSTDNTKCTYTPTTIPSQSNLSNYSSNTTSSTNGNITTYTTYYYKTTGWFYPTTTYYYKTT
YSETKVTSSTPNYQSCTAYNSNGCTTWSAASLTNPTTDTYDTQCTVNTVAGTCVYQTKTILGTNSYSGFNSSVQSSKDPN
TNYTTYKSPLPPVSQRQSCDGQGVYILSDGEPTNNVNSSVLASALNLSSFSNSGGLSGGTNWDYMSNFAKALFNGGVIQS
DANNATNPANVSIQTAFVGFGSALNALSTTDAKNACKMSSRTQIDRSGDDACSPNQGTNAVSAPGYGNGGFFPTQSAQGV
TDSVIAFINNLGKVPLEPLTTGAISVPYDALNPKNLQEYGYLRAFEPDPANAYLTWRGNLKKYHVVLSGSNAGAFEANSG
GLVYDANGAFRSGTKDYWNSSSYNDGGKVFLGGAYSNVPLPTLGQPEEVNSQGNITKYYYAAKNKIRNLFTDVSTVATDG
SLTKISTTNTNLLKIPAAPAVNTNPFDTAANTASYVLGKFNSSTGQDVLKAFPVSLKLKLLNYLGYPTDITATALPSTLT
TSNAPYLSMGGSIHSFPVQLTYSGTLDQNGNLTSAREQSILYGSMEGGLHIVDASSGVEQMVFIPADILNDTVASKALVV
GQSDSTAPAQGMDGAWVSDPAYNITTTGSGSSAVSKVTAKQMNIYGGMRMGGSSYYGLNVLNPSSPQLMFRIGADQSDYS
RMGQSWSKPVLANIRYNGAIRRVMIVGGGYDQCYEKPNITLSDSCFSNGKAKGNAVYIIDAKTGERLWWASDTGSNNDNA
NMKHSIVSRISTLDRDGDGLVDHLYFGDLGGQVFRVDLNNNQTQTNSTYSGFGIRVVRLANLATNDTANDSGNDYTGGNA
PRFYEPPTVTIHDYGVRSFITVGIASGDRSTPLDVYPIIGREGMSPSTALSGRPVNNVYGIIDRDFIKKNLMSLSDSQLE
TKDLIRSNLRKNPQILRAGETSVGQVFFPSTGTGQAGWYRSLSSTSNGTEKADNSFRIKGGMKAFEEPIAITGNLIVPVY
DPQGTGIVASDPCLPRVVGETDRQTYCLPFGVCLNTDGSINHNKEDNSGFGTDGTKNLNVIGSGVRNITFVPSEDNPSPQ
NSCGKLKLSGNEQGTGQWQCTSYLIPARWYERYR

Nucleotide


Download         Length: 4185 bp        

>NTDB_id=362242 FDQ49_RS11215 WP_022575990.1 2380173..2384357(+) (pilY1) [Acinetobacter nosocomialis M2]
ATGGGGAATATCATGAAACTACATTTCAAACCAAATAAATTATGGTATGCCATCTATTCAACTTCGATGACATTTACATG
GCTTATGTCAAGTTCTGTAGTGCAGGCAAGTGATTTACAAATTTATGCCTCACCTACAGCAGGTAAAAAAACAATTGTTA
TGATGCTAGATACTTCTGGATCAATGGGGCGTACGATCCAGTCCGGTTATAGTATTTATGATGATTACGGTATTACAAGT
TGTTCAACTACTTATGTAAACTCAACAACTACTCCTAGTTATCGTCGGTATTATTGCGGGGTTTCCTCAAATACCACTAA
TAGCAAAGTAACTAATTTAGCAACCGGATGTGAGAAACAGGCAGATAATAGTTATCGTTGTTATGACCGGTTAACGCGCC
TAAAAGATGGTATGTTTTCTTTTTTAGAAAATAATAACCCGATATTTAATAATGTAAGTGTTGGATTGGGGCATTTCTCT
ACATACAGTAGTGGTACTACTGGTGATGGCAGTAGCGGCGAAATTTTGGTTCCAGCAGCAAATTTGGGCGTTGTAGGATC
TGCACAAAGAGTTGCATTAAAAAATGCTGTAGCTGGATTAGAAGCAAGTGGAGGAACGCCTACTGCTAATGCTTATGCTG
AGGCAGCAGCTTACTTAATGGGAACACGAACATTAAGCTCGACTACAGCAAATGTGGCAATGTATTTTAAATATTATCCA
GTTTTAACTCAGACGTCACAAACCGTTTACTATGCCTGTGGTAATAATTCAGCTAGTGGTTCAACAGATAATACAAAGTG
TACATATACGCCGACCACAATTCCAAGCCAGTCTAATTTATCTAACTATAGTAGTAATACTACAAGCTCAACAAATGGAA
ATATAACGACTTACACCACTTATTATTATAAAACGACTGGATGGTTTTATCCTACAACGACCTATTACTATAAAACTACT
TATAGTGAAACTAAAGTAACCAGTTCAACACCTAATTATCAAAGCTGTACAGCATATAACTCTAATGGATGTACTACTTG
GAGTGCGGCTTCTTTAACTAATCCAACAACAGATACCTATGATACGCAATGTACAGTTAATACAGTGGCTGGTACATGTG
TCTATCAAACTAAAACAATTTTAGGTACAAACAGTTATAGTGGATTTAATAGCTCCGTTCAAAGTTCAAAAGATCCAAAT
ACCAACTACACTACATATAAATCACCATTACCTCCCGTTAGTCAGCGTCAAAGTTGTGATGGACAAGGGGTTTATATTTT
GTCTGATGGTGAACCAACGAATAACGTAAACTCTTCTGTTTTAGCCTCTGCATTGAACCTTAGCTCGTTTAGTAATAGTG
GTGGACTGAGCGGTGGTACAAACTGGGATTATATGAGTAACTTCGCTAAGGCCTTATTTAATGGCGGGGTAATTCAAAGT
GATGCTAATAATGCTACAAACCCAGCTAATGTATCTATACAAACTGCTTTTGTTGGTTTTGGTTCAGCATTAAATGCCTT
GAGCACTACAGATGCGAAGAACGCTTGTAAAATGAGTTCTCGTACTCAAATAGATAGAAGTGGAGATGATGCATGTTCTC
CAAATCAGGGAACAAATGCTGTATCTGCCCCAGGTTATGGTAATGGTGGATTTTTTCCTACCCAAAGTGCGCAGGGAGTA
ACCGATAGCGTAATTGCATTTATTAATAACTTAGGGAAGGTGCCTTTAGAGCCTCTTACTACTGGTGCGATTTCTGTACC
TTATGATGCATTAAATCCTAAAAACCTACAGGAATATGGTTATTTACGGGCTTTTGAGCCAGATCCGGCAAATGCTTATC
TGACTTGGCGAGGTAACTTAAAAAAATATCATGTGGTTTTATCTGGTTCAAATGCTGGTGCATTTGAAGCGAACTCTGGT
GGGCTGGTCTATGATGCAAATGGAGCATTCCGTAGTGGTACAAAAGATTATTGGAATAGTTCTAGTTATAATGATGGCGG
AAAAGTTTTCTTAGGAGGAGCCTATAGCAATGTACCATTACCAACTTTAGGTCAGCCAGAAGAGGTAAATTCTCAAGGTA
ATATTACTAAATATTATTATGCTGCAAAAAATAAAATCCGAAATCTGTTTACTGATGTTTCTACCGTGGCTACAGATGGT
AGTTTAACTAAAATATCAACTACAAATACTAACTTACTGAAAATTCCTGCTGCACCTGCTGTAAATACAAATCCTTTTGA
TACAGCGGCTAATACTGCAAGTTATGTGCTAGGAAAATTTAATAGCTCTACGGGACAAGATGTTTTGAAGGCATTTCCTG
TGAGCTTAAAATTAAAATTATTAAATTACTTGGGTTATCCGACAGATATTACGGCGACAGCTCTACCAAGCACATTAACC
ACATCAAATGCCCCTTATTTATCGATGGGCGGGAGTATCCACTCTTTTCCGGTACAGCTGACTTATAGTGGTACTTTAGA
TCAAAATGGAAATTTAACCAGTGCACGTGAGCAATCTATTCTTTACGGAAGTATGGAAGGAGGATTGCATATCGTAGATG
CCTCTTCTGGGGTTGAGCAAATGGTATTTATCCCAGCCGATATATTAAATGATACGGTAGCGTCTAAAGCTTTAGTGGTA
GGGCAAAGTGATTCAACAGCCCCTGCTCAAGGTATGGATGGAGCATGGGTATCAGATCCAGCATATAACATTACTACTAC
AGGTTCAGGTAGTTCAGCAGTATCAAAAGTTACTGCTAAACAAATGAATATTTATGGTGGAATGCGTATGGGGGGCAGTA
GCTATTATGGATTAAATGTATTAAATCCAAGTTCCCCACAATTAATGTTTAGAATAGGTGCAGATCAATCTGACTATAGC
CGTATGGGGCAAAGCTGGTCTAAGCCTGTACTTGCAAATATTCGTTATAACGGTGCGATTAGACGTGTCATGATTGTTGG
TGGTGGTTATGATCAATGCTACGAGAAACCAAATATCACGTTGAGTGACTCATGCTTTAGCAATGGTAAGGCAAAAGGAA
ATGCTGTCTATATTATCGATGCTAAAACCGGAGAGCGTTTATGGTGGGCCAGTGATACAGGGTCCAATAATGATAACGCT
AATATGAAGCATAGTATTGTTAGCCGTATTAGTACTTTGGACCGAGATGGTGATGGCTTAGTAGATCATTTGTATTTTGG
CGATTTAGGTGGACAAGTCTTTCGTGTAGATCTTAATAATAATCAGACTCAAACTAACTCGACTTATAGTGGTTTCGGTA
TAAGAGTTGTGAGATTGGCAAATCTGGCAACAAACGATACAGCAAATGATAGTGGAAATGATTACACAGGTGGAAATGCC
CCTCGTTTTTATGAGCCGCCTACCGTAACAATTCATGATTATGGTGTTCGCTCATTTATTACTGTTGGTATCGCTTCTGG
TGATCGAAGTACACCACTAGATGTTTATCCAATTATTGGCCGGGAAGGTATGTCTCCTAGTACGGCGCTTAGTGGTCGAC
CTGTAAATAATGTTTATGGAATTATTGATAGAGACTTTATCAAAAAGAATTTAATGTCGTTAAGTGATAGTCAGCTTGAA
ACTAAGGATCTTATTCGCTCAAATCTGCGGAAAAACCCACAAATATTAAGAGCAGGCGAAACGAGCGTAGGGCAAGTTTT
CTTTCCAAGTACGGGTACAGGTCAAGCAGGTTGGTACCGATCACTTTCTAGTACGAGCAATGGTACTGAAAAAGCTGATA
ATAGCTTCCGTATTAAAGGGGGAATGAAAGCCTTTGAAGAACCAATAGCAATTACAGGTAATTTAATAGTTCCTGTCTAT
GATCCTCAAGGAACGGGTATTGTTGCGTCAGACCCTTGTTTACCACGTGTAGTAGGTGAAACAGATCGACAAACTTATTG
TCTGCCATTTGGAGTGTGTCTGAATACTGATGGATCGATTAATCATAATAAGGAAGATAATAGTGGTTTTGGGACAGATG
GAACAAAGAATCTAAATGTAATTGGTTCAGGGGTTCGCAATATTACGTTTGTGCCTAGTGAAGATAACCCATCTCCGCAA
AACAGTTGTGGAAAATTAAAGCTATCTGGTAATGAGCAAGGAACAGGGCAGTGGCAATGCACAAGCTATTTGATCCCTGC
ACGCTGGTATGAGCGTTATCGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilY1 Acinetobacter baumannii D1279779

76.4

78.121

0.597


Multiple sequence alignment