Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | OGY80_RS03005 | Genome accession | NZ_OX336253 |
| Coordinates | 630472..632697 (-) | Length | 741 a.a. |
| NCBI ID | WP_263337382.1 | Uniprot ID | - |
| Organism | Neisseria sp. Marseille-Q5346 | ||
| Function | DNA processing; DNA transport into the cytoplasm (predicted from homology) DNA processing DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 625657..674734 | 630472..632697 | within | 0 |
Gene organization within MGE regions
Location: 625657..674734
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| OGY80_RS02990 | - | 625657..626259 (-) | 603 | WP_263337375.1 | DNA glycosylase | - |
| OGY80_RS02995 | - | 626296..626916 (-) | 621 | WP_049328503.1 | phosphoribosylanthranilate isomerase | - |
| OGY80_RS03000 | - | 627019..630156 (-) | 3138 | WP_263337380.1 | Mbeg1-like protein | - |
| OGY80_RS03005 | comA | 630472..632697 (-) | 2226 | WP_263337382.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| OGY80_RS03010 | - | 632755..633807 (-) | 1053 | WP_263337385.1 | RHS repeat protein | - |
| OGY80_RS03015 | - | 634085..634624 (-) | 540 | WP_263337388.1 | hypothetical protein | - |
| OGY80_RS03020 | - | 634621..636756 (-) | 2136 | WP_263337391.1 | RHS repeat-associated core domain-containing protein | - |
| OGY80_RS03025 | - | 636901..637482 (-) | 582 | WP_058948182.1 | Imm26 family immunity protein | - |
| OGY80_RS03030 | - | 637513..641637 (-) | 4125 | WP_263337394.1 | DUF6531 domain-containing protein | - |
| OGY80_RS03035 | tagF | 641971..642525 (+) | 555 | WP_263337396.1 | type VI secretion system-associated protein TagF | - |
| OGY80_RS03040 | tssA | 642530..643588 (+) | 1059 | WP_263337399.1 | type VI secretion system protein TssA | - |
| OGY80_RS03045 | tssB | 643665..644183 (+) | 519 | WP_070606732.1 | type VI secretion system contractile sheath small subunit | - |
| OGY80_RS03050 | tssC | 644215..645714 (+) | 1500 | WP_070606729.1 | type VI secretion system contractile sheath large subunit | - |
| OGY80_RS03055 | - | 645795..646277 (+) | 483 | WP_003761730.1 | type VI secretion system tube protein Hcp | - |
| OGY80_RS03060 | - | 646443..647258 (+) | 816 | WP_263337409.1 | type VI secretion system accessory protein TagJ | - |
| OGY80_RS03065 | tssE | 647292..647804 (+) | 513 | WP_263337410.1 | type VI secretion system baseplate subunit TssE | - |
| OGY80_RS03070 | - | 647801..649777 (+) | 1977 | WP_263337411.1 | type VI secretion system baseplate subunit TssF | - |
| OGY80_RS03075 | tssG | 649774..650814 (+) | 1041 | WP_263337412.1 | type VI secretion system baseplate subunit TssG | - |
| OGY80_RS03080 | tssH | 650899..653550 (+) | 2652 | WP_263337413.1 | type VI secretion system ATPase TssH | - |
| OGY80_RS03085 | tssJ | 653686..654309 (+) | 624 | WP_150538119.1 | type VI secretion system lipoprotein TssJ | - |
| OGY80_RS03090 | tssK | 654343..655686 (+) | 1344 | WP_263337415.1 | type VI secretion system baseplate subunit TssK | - |
| OGY80_RS03095 | tssL | 655711..656973 (+) | 1263 | WP_263337417.1 | type VI secretion system protein TssL, long form | - |
| OGY80_RS03100 | tssM | 657005..660562 (+) | 3558 | WP_263337420.1 | type VI secretion system membrane subunit TssM | - |
| OGY80_RS03105 | tssI | 660693..662966 (+) | 2274 | WP_150538115.1 | type VI secretion system tip protein TssI/VgrG | - |
| OGY80_RS03110 | - | 662983..664119 (+) | 1137 | WP_263337424.1 | DUF2169 domain-containing protein | - |
| OGY80_RS03115 | - | 664201..668346 (+) | 4146 | WP_263337426.1 | DUF6531 domain-containing protein | - |
| OGY80_RS03120 | - | 668795..672961 (+) | 4167 | WP_263341825.1 | RHS repeat-associated core domain-containing protein | - |
| OGY80_RS03125 | - | 672958..673251 (+) | 294 | WP_070695324.1 | hypothetical protein | - |
| OGY80_RS03130 | - | 673254..673646 (+) | 393 | WP_263337439.1 | hypothetical protein | - |
| OGY80_RS03135 | - | 673900..674385 (+) | 486 | WP_150538093.1 | hypothetical protein | - |
| OGY80_RS03140 | - | 674411..674734 (+) | 324 | WP_070588183.1 | sel1 repeat family protein | - |
Sequence
Protein
Download Length: 741 a.a. Molecular weight: 80938.44 Da Isoelectric Point: 9.8394
>NTDB_id=1154746 OGY80_RS03005 WP_263337382.1 630472..632697(-) (comA) [Neisseria sp. Marseille-Q5346]
MAWRFVLPAWVVGVVLSFALPFVPPWWVWLAVIIGALALCRKFAVAWLVLAVCVGMAFGVWRTGLVLAAQWPVGEASAKV
LTVEVADMPRDDGRRVQFAARAWDERGQAFDLMLSDYQRRDWAVGSRWSVSARVRPVIGEVNARGLNRETWALANGIDGM
GTLGRDRKLMRQGGGFGLANMRDAVSRSWQGTADKYPEFSDGIGLMRALSIGEQSALRPPLWQAFRPLGLTHLVSISGLH
VTMVAVLFAWLIKRIFRYLPWIPAKPRVWILAGGVASALFYALLAGFSVPTQRSVLMLAAFAWAWWHGSGGSGWTAWWQA
LAVVLLLDPLAVLGVGTWLSFGLVAALIWASSGRLKESGWRLAVRGQWAATVLSVVLLGYLFASLPLLSPLVNALAIPWF
SWVLTPLALLGSVFSFELVQLVAVFLAEYTLRGLVWLATVSPEFAVAAAPVPLLVLAMMAALLLLLPKGMGLKPWACLVL
LSFVFYRPAKLEEGVARVTVMDAGQGLSVLIQTRNRNLLFDTGTEQVAQTGIVPSLNAMGVRRLDSLILSHHDIDHDGGF
QSVAAVGTDKLLAGQPEFYPNAEFCQEDKWQWDGVDFELLRPSENAGKEDNDRSCVLRVVANGKALLITGDLGVKGEAGL
IEKYGNALYSQVLVLGHHGSSTASSGSFLHTVSPQYAVASSGYANAYKHPTVAVQNRVRAHGITLLRTDLSGALVFALGQ
DDIFQGRLKKDKFYWQKKPFE
MAWRFVLPAWVVGVVLSFALPFVPPWWVWLAVIIGALALCRKFAVAWLVLAVCVGMAFGVWRTGLVLAAQWPVGEASAKV
LTVEVADMPRDDGRRVQFAARAWDERGQAFDLMLSDYQRRDWAVGSRWSVSARVRPVIGEVNARGLNRETWALANGIDGM
GTLGRDRKLMRQGGGFGLANMRDAVSRSWQGTADKYPEFSDGIGLMRALSIGEQSALRPPLWQAFRPLGLTHLVSISGLH
VTMVAVLFAWLIKRIFRYLPWIPAKPRVWILAGGVASALFYALLAGFSVPTQRSVLMLAAFAWAWWHGSGGSGWTAWWQA
LAVVLLLDPLAVLGVGTWLSFGLVAALIWASSGRLKESGWRLAVRGQWAATVLSVVLLGYLFASLPLLSPLVNALAIPWF
SWVLTPLALLGSVFSFELVQLVAVFLAEYTLRGLVWLATVSPEFAVAAAPVPLLVLAMMAALLLLLPKGMGLKPWACLVL
LSFVFYRPAKLEEGVARVTVMDAGQGLSVLIQTRNRNLLFDTGTEQVAQTGIVPSLNAMGVRRLDSLILSHHDIDHDGGF
QSVAAVGTDKLLAGQPEFYPNAEFCQEDKWQWDGVDFELLRPSENAGKEDNDRSCVLRVVANGKALLITGDLGVKGEAGL
IEKYGNALYSQVLVLGHHGSSTASSGSFLHTVSPQYAVASSGYANAYKHPTVAVQNRVRAHGITLLRTDLSGALVFALGQ
DDIFQGRLKKDKFYWQKKPFE
Nucleotide
Download Length: 2226 bp
>NTDB_id=1154746 OGY80_RS03005 WP_263337382.1 630472..632697(-) (comA) [Neisseria sp. Marseille-Q5346]
ATGGCGTGGCGGTTTGTGTTGCCTGCTTGGGTGGTGGGTGTAGTGCTTTCGTTTGCGTTGCCGTTTGTGCCGCCATGGTG
GGTTTGGTTGGCTGTAATTATTGGCGCGCTTGCGTTGTGTCGAAAATTTGCTGTAGCTTGGTTGGTGCTGGCCGTCTGCG
TGGGCATGGCGTTTGGAGTGTGGCGGACAGGTCTGGTTTTGGCTGCTCAATGGCCTGTGGGGGAGGCATCGGCTAAAGTT
TTGACGGTTGAAGTGGCGGATATGCCGCGTGATGATGGACGGCGTGTGCAGTTTGCGGCAAGGGCTTGGGATGAACGTGG
GCAGGCTTTTGATTTGATGTTGTCGGATTATCAGCGGCGCGATTGGGCGGTGGGCAGCAGGTGGTCTGTATCGGCGCGGG
TAAGGCCGGTGATCGGTGAAGTGAATGCGCGCGGTTTGAACCGTGAAACATGGGCTTTGGCCAATGGTATCGATGGCATG
GGTACTTTGGGACGCGATAGGAAACTTATGCGGCAAGGCGGCGGTTTCGGTTTGGCTAATATGCGTGATGCGGTAAGCCG
AAGTTGGCAGGGAACGGCGGACAAATATCCTGAGTTTTCAGACGGCATAGGGCTGATGCGTGCTTTGAGTATTGGCGAGC
AATCGGCGTTGCGGCCGCCTTTGTGGCAGGCATTCCGGCCTTTGGGGCTGACGCACTTGGTCAGTATTTCGGGCTTGCAT
GTAACGATGGTGGCGGTATTGTTTGCCTGGCTGATCAAGCGAATTTTCCGTTATTTGCCGTGGATTCCGGCGAAACCGCG
TGTGTGGATATTGGCAGGTGGTGTTGCTAGTGCTTTGTTTTATGCGCTTTTGGCCGGTTTTTCCGTACCGACGCAACGCA
GTGTGTTGATGTTGGCTGCGTTTGCGTGGGCATGGTGGCACGGAAGTGGCGGCTCGGGCTGGACGGCATGGTGGCAGGCT
TTGGCCGTTGTTTTGTTGTTGGATCCGTTGGCAGTATTGGGCGTGGGAACGTGGTTGTCTTTCGGCTTGGTTGCTGCTTT
GATTTGGGCTTCATCAGGCCGTCTGAAAGAGTCGGGTTGGCGTTTGGCTGTACGGGGGCAATGGGCGGCAACGGTGTTGT
CGGTGGTATTGCTGGGCTATTTGTTTGCTTCGTTACCTTTGCTCAGTCCGTTGGTTAATGCTCTGGCTATTCCTTGGTTT
TCTTGGGTGTTGACGCCGTTGGCTTTGCTGGGTTCGGTTTTTTCTTTTGAACTTGTCCAGTTGGTAGCGGTATTTTTGGC
GGAATATACTTTGCGTGGTTTGGTATGGCTGGCTACGGTATCGCCTGAATTTGCCGTCGCAGCGGCGCCTGTACCTTTGC
TGGTGTTGGCGATGATGGCGGCTTTGTTGTTATTGTTGCCTAAAGGAATGGGCTTGAAACCTTGGGCATGTCTGGTTTTA
TTGAGCTTTGTGTTTTACCGTCCTGCCAAGCTGGAGGAGGGGGTGGCAAGAGTTACGGTGATGGATGCAGGACAAGGTTT
GTCGGTGTTGATACAAACGCGTAACCGCAATCTTTTGTTTGATACGGGAACGGAGCAAGTGGCGCAAACAGGTATTGTGC
CGAGTTTGAACGCGATGGGCGTGCGCCGTTTGGACAGCCTGATTTTGTCGCACCATGATATTGACCATGACGGCGGTTTT
CAAAGTGTAGCGGCTGTCGGTACCGATAAATTGCTTGCCGGACAACCTGAGTTTTATCCGAATGCAGAGTTTTGCCAAGA
AGACAAATGGCAATGGGACGGCGTAGATTTCGAGTTGCTCAGGCCGTCTGAAAATGCTGGTAAGGAAGATAATGACCGAA
GCTGCGTATTGCGTGTCGTAGCAAACGGCAAAGCCTTGTTGATAACCGGCGATTTGGGCGTGAAGGGCGAGGCCGGTTTG
ATTGAGAAATACGGCAATGCACTGTATAGCCAAGTGTTGGTATTGGGACATCACGGCAGCAGCACGGCTTCGTCGGGCAG
CTTTCTGCATACGGTTTCGCCGCAATATGCCGTGGCATCCAGCGGCTATGCCAATGCCTACAAACATCCGACCGTTGCCG
TACAAAATCGTGTCCGCGCGCACGGCATTACTTTGTTGAGAACTGATTTGTCAGGCGCGTTGGTGTTTGCTTTGGGACAG
GACGATATATTTCAAGGCCGTCTGAAAAAGGATAAGTTTTATTGGCAGAAGAAACCGTTTGAGTAA
ATGGCGTGGCGGTTTGTGTTGCCTGCTTGGGTGGTGGGTGTAGTGCTTTCGTTTGCGTTGCCGTTTGTGCCGCCATGGTG
GGTTTGGTTGGCTGTAATTATTGGCGCGCTTGCGTTGTGTCGAAAATTTGCTGTAGCTTGGTTGGTGCTGGCCGTCTGCG
TGGGCATGGCGTTTGGAGTGTGGCGGACAGGTCTGGTTTTGGCTGCTCAATGGCCTGTGGGGGAGGCATCGGCTAAAGTT
TTGACGGTTGAAGTGGCGGATATGCCGCGTGATGATGGACGGCGTGTGCAGTTTGCGGCAAGGGCTTGGGATGAACGTGG
GCAGGCTTTTGATTTGATGTTGTCGGATTATCAGCGGCGCGATTGGGCGGTGGGCAGCAGGTGGTCTGTATCGGCGCGGG
TAAGGCCGGTGATCGGTGAAGTGAATGCGCGCGGTTTGAACCGTGAAACATGGGCTTTGGCCAATGGTATCGATGGCATG
GGTACTTTGGGACGCGATAGGAAACTTATGCGGCAAGGCGGCGGTTTCGGTTTGGCTAATATGCGTGATGCGGTAAGCCG
AAGTTGGCAGGGAACGGCGGACAAATATCCTGAGTTTTCAGACGGCATAGGGCTGATGCGTGCTTTGAGTATTGGCGAGC
AATCGGCGTTGCGGCCGCCTTTGTGGCAGGCATTCCGGCCTTTGGGGCTGACGCACTTGGTCAGTATTTCGGGCTTGCAT
GTAACGATGGTGGCGGTATTGTTTGCCTGGCTGATCAAGCGAATTTTCCGTTATTTGCCGTGGATTCCGGCGAAACCGCG
TGTGTGGATATTGGCAGGTGGTGTTGCTAGTGCTTTGTTTTATGCGCTTTTGGCCGGTTTTTCCGTACCGACGCAACGCA
GTGTGTTGATGTTGGCTGCGTTTGCGTGGGCATGGTGGCACGGAAGTGGCGGCTCGGGCTGGACGGCATGGTGGCAGGCT
TTGGCCGTTGTTTTGTTGTTGGATCCGTTGGCAGTATTGGGCGTGGGAACGTGGTTGTCTTTCGGCTTGGTTGCTGCTTT
GATTTGGGCTTCATCAGGCCGTCTGAAAGAGTCGGGTTGGCGTTTGGCTGTACGGGGGCAATGGGCGGCAACGGTGTTGT
CGGTGGTATTGCTGGGCTATTTGTTTGCTTCGTTACCTTTGCTCAGTCCGTTGGTTAATGCTCTGGCTATTCCTTGGTTT
TCTTGGGTGTTGACGCCGTTGGCTTTGCTGGGTTCGGTTTTTTCTTTTGAACTTGTCCAGTTGGTAGCGGTATTTTTGGC
GGAATATACTTTGCGTGGTTTGGTATGGCTGGCTACGGTATCGCCTGAATTTGCCGTCGCAGCGGCGCCTGTACCTTTGC
TGGTGTTGGCGATGATGGCGGCTTTGTTGTTATTGTTGCCTAAAGGAATGGGCTTGAAACCTTGGGCATGTCTGGTTTTA
TTGAGCTTTGTGTTTTACCGTCCTGCCAAGCTGGAGGAGGGGGTGGCAAGAGTTACGGTGATGGATGCAGGACAAGGTTT
GTCGGTGTTGATACAAACGCGTAACCGCAATCTTTTGTTTGATACGGGAACGGAGCAAGTGGCGCAAACAGGTATTGTGC
CGAGTTTGAACGCGATGGGCGTGCGCCGTTTGGACAGCCTGATTTTGTCGCACCATGATATTGACCATGACGGCGGTTTT
CAAAGTGTAGCGGCTGTCGGTACCGATAAATTGCTTGCCGGACAACCTGAGTTTTATCCGAATGCAGAGTTTTGCCAAGA
AGACAAATGGCAATGGGACGGCGTAGATTTCGAGTTGCTCAGGCCGTCTGAAAATGCTGGTAAGGAAGATAATGACCGAA
GCTGCGTATTGCGTGTCGTAGCAAACGGCAAAGCCTTGTTGATAACCGGCGATTTGGGCGTGAAGGGCGAGGCCGGTTTG
ATTGAGAAATACGGCAATGCACTGTATAGCCAAGTGTTGGTATTGGGACATCACGGCAGCAGCACGGCTTCGTCGGGCAG
CTTTCTGCATACGGTTTCGCCGCAATATGCCGTGGCATCCAGCGGCTATGCCAATGCCTACAAACATCCGACCGTTGCCG
TACAAAATCGTGTCCGCGCGCACGGCATTACTTTGTTGAGAACTGATTTGTCAGGCGCGTTGGTGTTTGCTTTGGGACAG
GACGATATATTTCAAGGCCGTCTGAAAAAGGATAAGTTTTATTGGCAGAAGAAACCGTTTGAGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Neisseria gonorrhoeae MS11 |
69.783 |
99.595 |
0.695 |