Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   G4V49_RS06020 Genome accession   NZ_CP048911
Coordinates   1152196..1152645 (-) Length   149 a.a.
NCBI ID   WP_002214937.1    Uniprot ID   A0AA44U8B7
Organism   Neisseria gonorrhoeae strain SRRSH204     
Function   DNA binding; DNA uptake; receptor of DNA uptake sequence (DUS) (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1148671..1159925 1152196..1152645 within 0


Gene organization within MGE regions


Location: 1148671..1159925
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G4V49_RS12135 - 1148671..1149267 (-) 597 WP_012503916.1 TIGR02391 family protein -
  G4V49_RS12140 - 1149457..1150314 (-) 858 WP_012503917.1 ATP-binding protein -
  G4V49_RS06005 - 1150328..1150699 (-) 372 WP_012503918.1 DNA cytosine methyltransferase -
  G4V49_RS06010 - 1150696..1151304 (-) 609 WP_080229275.1 DNA cytosine methyltransferase -
  G4V49_RS06020 comP 1152196..1152645 (-) 450 WP_002214937.1 type IV pilin protein Machinery gene
  G4V49_RS06025 comE 1152733..1153128 (-) 396 WP_003703428.1 helix-hairpin-helix domain-containing protein Machinery gene
  G4V49_RS12145 - 1153230..1153349 (-) 120 Protein_1191 competence protein ComE -
  G4V49_RS06055 - 1158974..1159627 (-) 654 Protein_1192 TIGR01621 family pseudouridine synthase -
  G4V49_RS06060 - 1159581..1159904 (-) 324 WP_003689613.1 HAD hydrolase family protein -

Sequence


Protein


Download         Length: 149 a.a.        Molecular weight: 16834.81 Da        Isoelectric Point: 9.7951

>NTDB_id=424096 G4V49_RS06020 WP_002214937.1 1152196..1152645(-) (comP) [Neisseria gonorrhoeae strain SRRSH204]
MTDNRGFTLVELISVVLILSVLALIVYPSYRNYVEKAKINAVRAALLENAHFMEKFYLQNGRFKQTSTKWPSLPIKEAEG
FCIRLNGIARGALDSKFMLKAVAIDKDKNPFIIKMNENLVTFICKKSASSCSDGLDYFKGNDKDCKLLK

Nucleotide


Download         Length: 450 bp        

>NTDB_id=424096 G4V49_RS06020 WP_002214937.1 1152196..1152645(-) (comP) [Neisseria gonorrhoeae strain SRRSH204]
ATGACTGATAATCGGGGGTTTACGCTGGTTGAATTAATATCAGTGGTCTTGATATTGTCTGTACTTGCTTTAATTGTTTA
TCCGAGCTATCGCAATTATGTTGAGAAAGCAAAGATAAATGCAGTGCGGGCAGCCTTGTTAGAAAATGCACATTTTATGG
AAAAGTTTTATCTGCAGAATGGGAGATTTAAACAAACATCTACCAAATGGCCAAGTTTGCCGATTAAAGAGGCAGAAGGC
TTTTGTATCCGTTTGAATGGAATCGCGCGCGGGGCTTTAGACAGTAAATTCATGTTGAAGGCGGTAGCCATAGATAAAGA
TAAAAATCCTTTTATTATTAAGATGAATGAAAATCTAGTAACCTTTATTTGCAAGAAGTCCGCCAGTTCGTGTAGTGACG
GGCTGGATTATTTTAAAGGAAATGATAAGGACTGCAAGTTACTTAAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Neisseria gonorrhoeae MS11

100

100

1

  comP Neisseria meningitidis 8013

99.329

100

0.993

  comP Neisseria subflava NJ9703

49.66

98.658

0.49