Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   EGH11_RS00325 Genome accession   NZ_CP034018
Coordinates   53009..56170 (-) Length   1053 a.a.
NCBI ID   WP_124693316.1    Uniprot ID   -
Organism   Neisseria gonorrhoeae strain FQ82     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 38561..52357 53009..56170 flank 652


Gene organization within MGE regions


Location: 38561..56170
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EGH11_RS00235 (EGH11_00235) orn 38940..39503 (-) 564 WP_003687260.1 oligoribonuclease -
  EGH11_RS00240 (EGH11_00240) prmA 39521..40408 (-) 888 WP_033910287.1 50S ribosomal protein L11 methyltransferase -
  EGH11_RS00245 (EGH11_00245) accC 40512..41873 (-) 1362 WP_003687264.1 acetyl-CoA carboxylase biotin carboxylase subunit -
  EGH11_RS00250 (EGH11_00250) accB 41985..42404 (-) 420 WP_003690508.1 acetyl-CoA carboxylase biotin carboxyl carrier protein -
  EGH11_RS00255 (EGH11_00255) - 42386..42643 (-) 258 WP_003687268.1 hypothetical protein -
  EGH11_RS00260 (EGH11_00260) queA 42760..43800 (+) 1041 WP_003687270.1 tRNA preQ1(34) S-adenosylmethionine ribosyltransferase-isomerase QueA -
  EGH11_RS00265 (EGH11_00265) carB 44001..47216 (-) 3216 WP_050303823.1 carbamoyl-phosphate synthase large subunit -
  EGH11_RS00270 (EGH11_00270) - 47223..47771 (-) 549 WP_003687274.1 hypothetical protein -
  EGH11_RS00275 (EGH11_00275) - 47774..48337 (-) 564 WP_002214615.1 phosphoribosyltransferase family protein -
  EGH11_RS00290 (EGH11_00290) - 49240..49629 (-) 390 WP_003687278.1 endonuclease domain-containing protein -
  EGH11_RS13970 - 49792..49959 (-) 168 WP_003687280.1 glyoxalase -
  EGH11_RS00295 (EGH11_00295) - 49887..50171 (-) 285 WP_003687282.1 VOC family protein -
  EGH11_RS00300 (EGH11_00300) carA 50225..51358 (-) 1134 WP_003687283.1 glutamine-hydrolyzing carbamoyl-phosphate synthase small subunit -
  EGH11_RS00315 (EGH11_00315) - 52073..52357 (-) 285 WP_003701816.1 GIY-YIG nuclease family protein -
  EGH11_RS00325 (EGH11_00325) pilC 53009..56170 (-) 3162 WP_124693316.1 PilC family type IV pilus tip adhesin Machinery gene

Sequence


Protein


Download         Length: 1053 a.a.        Molecular weight: 115427.59 Da        Isoelectric Point: 9.7969

>NTDB_id=328074 EGH11_RS00325 WP_124693316.1 53009..56170(-) (pilC) [Neisseria gonorrhoeae strain FQ82]
MNKTLKRRVFRHTALYAAILMFSHTGGGGAMAQTSNYAIIMNERKQPEVKWEGQYNQSALKDKSRERTFSHTSQKNSLGR
TSNFISFNNNDTLVSQQSGTAVFGTATYLPPYGKVSGFDTDSLKGRANAAGWIRTTRPGLAGYAYTGIRCGHARDCPKLT
YKTRFSFDNPNLAKTGGRLDRHTESSRENSPIYKLKDYPWLGVSFNLGAEGTAKDGRSSSRLISSFDENNSNSNQNLVYT
TEGRDISLGNWQSESTAVAYYLNAKLHLLDKKKIKDITGKTVQLGVLKPSIDVKTQNTGLAGLLNFWSKWDIKDNGQIPV
KLGLPEVKAGRCTNKPNPNNNTKAPSPALTAPALWFGPVQNGKVQMYSASVSTYPGSSSSRIFLQELKTQTDPARPGRHS
LAALNARDIKSREPNFNSRQTVIRLPGGVYRIAPTRDRIVGLNGNDGKNDTFGIYKERLVTPDDDEWAKVLLPWTVRYYG
NDDIFKTFNQPNNKKQSDKKQYSQKYRIRTKEDDNDKPRDLGDIVNSPIVAVDGYLATSANDGMVHLFKRNGTDQRGYEL
KLSYIPGTMPRQYFDNDTSALQDSDLAKELRTFAEKGYVGDRYGVDGGFVLRRITDDQDRQKHFFMFGAMGFGGRGAYAL
DLSKIDSSNLTGVSMFDVKDGDNNGKNRVEVKLGYTVGTPQIGKTQNGKYAAFLASGYAAKDIVSSDNTTALYVYDLKDT
LGTPIAKIEVQGGKGGLSSPTLVDKDLDGIVDIAYAGDRGGNMYRFDLSNSDPNKWSAKAIFEGTKPITSAPAVSRLADK
RVVIFGTGSDLSEQDVVGTDQQYIYGIFDDDKPTVNVKVTNGTGGGLLEQVLSEENKILFLINNKASGGSADKGWVVKLR
EGERVTVKPTVVLRTAFVTIRKYNDGGCGAETAILGINTADGGALTPRSARPIVPDHDSVAQYSGHKKTAGGKSVPIGCM
WKNSKTVCPNGYVYDKPVNVRYLDETETDGFSTTADGDAGGSGIDPAGRRPGKNNRCFSKKGVRTLLMNDLDSLDITGPM
CGIKRLSWREVFF

Nucleotide


Download         Length: 3162 bp        

>NTDB_id=328074 EGH11_RS00325 WP_124693316.1 53009..56170(-) (pilC) [Neisseria gonorrhoeae strain FQ82]
ATGAATAAAACTTTAAAAAGGCGGGTTTTCCGCCATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGCGG
GGGGGGGGCGATGGCGCAAACCAGTAACTACGCTATTATCATGAACGAGCGAAAGCAGCCCGAGGTAAAGTGGGAGGGTC
AATATAATCAATCAGCATTAAAGGACAAAAGCAGGGAGCGGACATTTAGCCATACGAGCCAGAAAAACAGCCTCGGCAGG
ACAAGCAATTTTATCTCATTCAACAATAACGATACCCTTGTTTCTCAACAAAGCGGTACTGCCGTTTTTGGCACAGCCAC
CTACCTGCCGCCCTACGGCAAGGTTTCCGGTTTTGATACCGATAGTCTGAAAGGGCGCGCCAATGCCGCCGGTTGGATTC
GTACCACCCGGCCCGGGCTGGCAGGCTACGCCTACACCGGTATCCGTTGCGGACATGCCCGAGACTGTCCCAAACTTACC
TATAAAACCCGATTTTCCTTCGATAATCCCAACTTGGCAAAAACAGGAGGCAGGCTGGATAGGCACACAGAGTCAAGCCG
CGAAAATTCGCCCATTTACAAATTGAAGGATTATCCATGGTTGGGCGTGTCTTTCAATTTGGGCGCCGAGGGTACCGCCA
AAGATGGCAGATCATCCAGCAGATTGATATCTTCTTTTGATGAAAACAATAGTAATAGTAATCAAAACCTCGTCTATACC
ACGGAAGGCCGCGATATTTCCTTGGGCAACTGGCAGAGCGAAAGTACCGCCGTGGCCTATTATCTGAACGCCAAGCTGCA
CCTGCTGGACAAAAAAAAGATTAAAGATATCACCGGCAAAACAGTGCAGTTGGGTGTCTTGAAGCCGAGCATCGATGTGA
AGACACAAAATACGGGGCTTGCCGGCTTGCTAAATTTTTGGTCTAAGTGGGACATTAAAGATAACGGGCAGATTCCGGTC
AAGCTCGGCCTGCCGGAAGTCAAAGCCGGGCGCTGCACCAACAAACCGAACCCCAATAATAATACCAAAGCCCCTTCGCC
GGCACTGACCGCCCCCGCGCTGTGGTTCGGCCCTGTGCAAAATGGCAAGGTGCAGATGTATTCCGCTTCGGTTTCCACCT
ACCCCGGCAGCTCGAGCAGCCGCATCTTCCTCCAAGAGCTGAAAACTCAAACCGACCCCGCCCGGCCCGGCCGGCATTCC
CTCGCCGCTTTGAATGCGCGGGATATCAAATCCCGCGAGCCGAATTTCAACTCAAGGCAGACCGTGATCCGATTGCCGGG
CGGCGTGTACCGGATCGCCCCGACTCGCGACAGGATCGTGGGTTTGAATGGCAATGACGGCAAAAACGACACTTTCGGCA
TCTACAAGGAAAGGTTAGTCACACCTGATGACGACGAGTGGGCAAAAGTGCTGCTGCCTTGGACGGTCCGGTATTACGGT
AATGACGATATATTTAAAACATTCAACCAACCAAACAACAAAAAACAAAGCGACAAAAAACAATACAGCCAAAAATACCG
CATCCGCACAAAAGAAGATGACAATGACAAACCCCGCGATTTGGGCGACATCGTCAACAGCCCGATTGTCGCGGTCGACG
GGTATTTGGCAACTTCTGCCAACGACGGGATGGTGCACCTGTTTAAAAGAAACGGCACAGACCAACGAGGCTACGAACTG
AAGCTCAGCTACATCCCCGGCACGATGCCGCGCCAATATTTTGATAACGACACTTCCGCTCTCCAAGACTCCGACCTCGC
CAAAGAGCTGCGCACCTTTGCCGAAAAAGGCTATGTGGGCGACCGCTACGGCGTGGACGGCGGCTTTGTCTTGCGCCGCA
TTACAGATGACCAGGACAGGCAAAAACATTTCTTTATGTTTGGTGCGATGGGTTTTGGCGGCAGAGGCGCGTATGCCTTG
GATTTAAGCAAAATCGACAGCAGCAACCTGACCGGCGTTTCCATGTTTGATGTCAAAGATGGCGATAATAACGGCAAAAA
TCGCGTGGAAGTGAAATTAGGCTACACCGTCGGCACGCCGCAAATCGGCAAAACCCAAAACGGCAAATACGCCGCCTTCC
TCGCCTCCGGTTATGCGGCTAAAGATATTGTCAGCAGCGATAATACAACCGCGCTGTATGTGTATGATTTGAAAGACACC
TTAGGTACGCCGATTGCAAAAATCGAAGTGCAGGGCGGCAAAGGCGGGCTTTCGTCCCCCACGCTGGTGGATAAAGATTT
GGACGGCATTGTCGATATCGCCTATGCCGGCGACCGGGGCGGCAATATGTACCGCTTTGATTTGAGCAATTCCGATCCTA
ATAAATGGTCTGCAAAGGCTATTTTCGAAGGCACAAAACCGATTACTTCCGCGCCCGCCGTTTCCCGACTGGCAGACAAA
CGCGTCGTCATCTTCGGCACGGGCAGCGATTTGAGTGAACAGGATGTAGTCGGTACGGATCAACAATATATTTACGGTAT
CTTTGACGACGATAAGCCGACGGTTAATGTAAAGGTAACAAACGGCACGGGAGGCGGGCTGCTCGAGCAAGTGCTTAGCG
AGGAAAATAAAATCTTGTTCCTGATAAATAATAAGGCATCCGGCGGATCGGCCGATAAAGGCTGGGTAGTGAAATTGAGG
GAAGGAGAACGCGTTACCGTCAAACCGACCGTGGTATTGCGTACCGCCTTCGTAACCATCCGCAAATATAACGACGGCGG
CTGCGGCGCGGAAACCGCCATTTTGGGCATCAATACCGCCGACGGCGGCGCATTGACTCCGAGAAGCGCGCGCCCGATTG
TGCCGGATCACGATTCGGTTGCGCAATATTCCGGCCATAAGAAAACCGCCGGCGGCAAGTCCGTCCCCATAGGCTGCATG
TGGAAAAACAGCAAAACCGTCTGCCCGAACGGATATGTTTACGACAAACCGGTTAATGTGCGTTATCTGGATGAAACGGA
AACAGACGGATTTTCAACGACGGCGGACGGCGATGCGGGCGGCAGCGGTATAGACCCCGCCGGCAGGCGTCCCGGCAAAA
ACAACCGCTGCTTCTCCAAAAAAGGGGTGCGCACCCTGCTGATGAACGATTTGGACAGCTTGGATATTACCGGCCCGATG
TGCGGTATCAAACGCTTAAGCTGGCGCGAAGTCTTCTTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Neisseria meningitidis A1493

69.303

100

0.699


Multiple sequence alignment