Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA/comEC   Type   Machinery gene
Locus tag   U2S94_RS04960 Genome accession   NZ_CP140434
Coordinates   917297..919729 (+) Length   810 a.a.
NCBI ID   WP_000472705.1    Uniprot ID   A0A231SZ39
Organism   Acinetobacter baumannii strain 2023CK-01274     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 916525..932910 917297..919729 within 0


Gene organization within MGE regions


Location: 916525..932910
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  U2S94_RS04955 (U2S94_04960) lolD 916525..917211 (+) 687 WP_000049403.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  U2S94_RS04960 (U2S94_04965) comA/comEC 917297..919729 (+) 2433 WP_000472705.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  U2S94_RS04965 (U2S94_04970) - 919682..920665 (-) 984 WP_001984639.1 lysophospholipid acyltransferase family protein -
  U2S94_RS04970 (U2S94_04975) sppA 920802..921818 (+) 1017 WP_001286615.1 signal peptide peptidase SppA -
  U2S94_RS04975 (U2S94_04980) - 921837..922646 (+) 810 WP_000114076.1 alpha/beta fold hydrolase -
  U2S94_RS04980 (U2S94_04985) purN 922710..923339 (-) 630 WP_000975536.1 phosphoribosylglycinamide formyltransferase -
  U2S94_RS04985 (U2S94_04990) purM 923336..924406 (-) 1071 WP_000071984.1 phosphoribosylformylglycinamidine cyclo-ligase -
  U2S94_RS04990 (U2S94_04995) cxpE 924542..925735 (+) 1194 WP_001153980.1 chloramphenicol efflux transporter CxpE -
  U2S94_RS04995 (U2S94_05000) hda 925756..926454 (+) 699 WP_001249535.1 DnaA regulatory inactivator Hda -
  U2S94_RS05000 (U2S94_05005) rbtA 926771..928621 (+) 1851 WP_000920012.1 rhombotarget A -
  U2S94_RS05005 (U2S94_05010) - 928637..929575 (+) 939 Protein_823 CSLREA domain-containing protein -
  U2S94_RS05010 (U2S94_05015) - 929629..930561 (-) 933 WP_001043692.1 IS5-like element ISAba13 family transposase -
  U2S94_RS05015 (U2S94_05020) - 930615..932120 (+) 1506 Protein_825 CSLREA domain-containing protein -
  U2S94_RS05020 (U2S94_05025) - 932211..932600 (-) 390 WP_000225919.1 DUF3106 domain-containing protein -
  U2S94_RS05025 (U2S94_05030) - 932590..932910 (-) 321 WP_001015073.1 hypothetical protein -

Sequence


Protein


Download         Length: 810 a.a.        Molecular weight: 92426.69 Da        Isoelectric Point: 8.6357

>NTDB_id=914278 U2S94_RS04960 WP_000472705.1 917297..919729(+) (comA/comEC) [Acinetobacter baumannii strain 2023CK-01274]
MFKIILLGWIGGIALMGIDFPLIMQYEKVGEALLLLAFIFYLYKRPMFVDRPFLKAVFCLLCTTSLFMVGYHYAEKALIE
RLEQRETDTRNLDIIVYINRLSEEKDNKVQQTAQVLNFSKEPVNWLLYLKSNNQNLLKNNQNLELGHYYRISGKTRPAHS
YATPGAFDQEKWFIQRNIMSGFNVRYIEPLSLDEIYRLGYQQHLKEQQSFSNSFRLNIEKLRLTFRQILNSSSLQQKGLI
LALLTGDESLLSDETQLQFKQLGISHLLAISGPHVLIFAIMLSWACHQFISRYYPQIYLWKPKQVLMAVPCCLGVLIYTA
FVGFEIPALRTLLSAFIFIGFLLLKQPIKPFTLLVYSASLLLLIDPFSVLSAGFWLSYGACFILLRIYQTIAQLPEQQFL
SLSSKMIFMGKVLIESQGKIFIALSPLTLLFFQQISWVAPLTNIIAVPIVGGVIVPLNIMAACAWFIVKPFGNMLFHFND
MLLSILLSCLGLLEKLSLPLQGISLTPLSLLAISFAIIILFLPKGILPKTWGILCCLPLVIMNKTSQQIQLNILDVGQGQ
AIFLQHSEQNWLIDTGGSYDEKIFSIGQNVVVPFLRQQGVRQLDHVVLSHLDQDHSGAFPLIQQEIPVKQLISNEQLPND
LKQPFQYCHQGQQWHYPELDIQILWPKEKDLAFVASNQNQYSCVVYLQFKKVGGYQNFLIMGDAGWEAEYELLKDYPNLK
IDVLVLGHHGSKHSSAYDFLATLKPKLAIASAGFDNRYGHPSQQVIARLKALHIPLKSTVEQGTLSFVLENHKIVLHDRR
LDRLWLSRGF

Nucleotide


Download         Length: 2433 bp        

>NTDB_id=914278 U2S94_RS04960 WP_000472705.1 917297..919729(+) (comA/comEC) [Acinetobacter baumannii strain 2023CK-01274]
ATGTTTAAGATTATTCTATTGGGGTGGATTGGCGGTATTGCATTGATGGGAATAGATTTCCCTTTAATCATGCAATATGA
AAAAGTGGGCGAGGCTCTACTGTTACTTGCCTTTATTTTTTATCTTTATAAACGACCAATGTTCGTTGATCGACCATTTT
TAAAGGCAGTGTTTTGCTTATTATGTACAACAAGTCTTTTTATGGTTGGTTACCACTATGCTGAAAAAGCACTGATTGAA
CGATTAGAACAAAGAGAAACGGATACCCGAAATCTCGACATTATTGTTTATATAAACCGTTTAAGTGAAGAAAAAGATAA
TAAGGTTCAACAAACTGCACAAGTTCTAAATTTTTCTAAAGAACCGGTGAATTGGTTGCTATATTTAAAAAGTAATAATC
AAAATTTATTAAAGAATAATCAGAATCTTGAATTAGGTCACTATTATCGAATATCTGGAAAAACAAGACCTGCGCATAGT
TATGCCACCCCAGGAGCTTTTGATCAGGAAAAATGGTTTATTCAGCGAAATATTATGTCTGGTTTTAATGTGAGATATAT
TGAGCCTTTAAGTCTCGATGAAATCTATCGATTGGGCTATCAGCAACATTTAAAAGAACAACAGTCTTTTTCCAATAGTT
TTCGTTTAAATATAGAAAAACTTCGCTTAACTTTTAGGCAAATATTAAACAGCTCATCTCTACAGCAAAAGGGTTTAATT
TTAGCTTTGCTGACAGGTGATGAAAGCCTTTTATCAGATGAAACTCAACTACAGTTCAAACAATTAGGAATTAGTCATTT
ATTGGCGATCTCAGGCCCACATGTGCTCATTTTTGCCATTATGTTATCTTGGGCATGTCATCAATTTATCAGTCGTTATT
ATCCTCAAATTTACTTATGGAAACCGAAACAGGTTTTGATGGCTGTACCATGCTGCCTTGGTGTTTTAATTTATACTGCA
TTTGTAGGTTTCGAAATTCCTGCACTACGAACATTATTATCAGCATTTATATTTATTGGATTTCTATTATTAAAACAACC
TATTAAACCTTTTACATTACTCGTATATAGCGCAAGTCTACTGTTGCTAATTGACCCGTTTAGTGTGCTTTCTGCAGGTT
TTTGGCTGTCTTATGGGGCATGCTTTATTTTATTAAGAATTTACCAAACTATAGCGCAGCTACCTGAGCAACAGTTTTTA
AGTCTGAGCTCAAAAATGATTTTTATGGGTAAGGTGTTAATTGAGTCTCAAGGCAAAATATTTATTGCATTGAGTCCCTT
AACCTTACTTTTCTTTCAACAAATTTCTTGGGTTGCTCCATTAACCAATATTATTGCCGTGCCTATTGTTGGTGGTGTTA
TTGTCCCTTTAAACATCATGGCTGCTTGCGCATGGTTTATAGTAAAACCGTTTGGAAATATGCTTTTTCATTTCAATGAT
ATGTTGCTCAGCATATTGCTGAGTTGTTTGGGCTTATTAGAAAAACTCTCTTTACCATTACAAGGTATAAGCTTGACGCC
ACTGTCTTTATTAGCGATTAGTTTTGCCATAATTATTTTATTTTTACCTAAAGGAATTCTGCCCAAAACTTGGGGGATAT
TATGTTGTTTACCCTTAGTTATTATGAACAAAACGAGTCAACAAATTCAGCTTAATATTTTAGATGTTGGACAGGGGCAA
GCCATTTTTTTACAACATTCCGAACAAAACTGGTTAATTGATACAGGCGGTTCTTACGATGAAAAAATATTTAGTATTGG
ACAAAATGTTGTAGTGCCTTTTCTACGTCAGCAAGGCGTAAGACAATTAGATCATGTTGTGCTATCCCATCTTGATCAAG
ACCATAGTGGCGCGTTTCCTCTTATTCAACAAGAGATTCCTGTAAAGCAGCTTATTTCGAATGAACAATTACCAAATGAT
TTAAAGCAACCATTCCAATATTGCCATCAAGGGCAACAATGGCATTATCCTGAGTTAGATATTCAAATTTTATGGCCTAA
AGAAAAAGATCTGGCTTTTGTTGCTTCTAACCAGAATCAATATTCTTGTGTTGTATATCTTCAATTTAAAAAAGTTGGTG
GTTACCAAAATTTTCTTATTATGGGCGATGCAGGATGGGAAGCCGAATACGAGTTATTAAAAGATTATCCTAACTTGAAG
ATAGATGTGTTGGTGCTAGGGCATCATGGAAGTAAGCACAGTTCGGCTTATGATTTCTTGGCGACCTTAAAACCTAAACT
GGCCATTGCATCGGCAGGGTTTGATAACCGTTATGGCCATCCTAGCCAACAAGTTATAGCACGTTTAAAAGCTCTACATA
TTCCGCTAAAAAGCACTGTGGAACAAGGGACCTTAAGTTTTGTGTTGGAAAATCACAAAATAGTTTTACATGACCGACGT
TTGGATAGGCTCTGGTTGAGTAGAGGTTTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A231SZ39

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA/comEC Acinetobacter baumannii D1279779

98.025

100

0.98

  comA/comEC Acinetobacter baumannii strain A118

97.037

100

0.97

  comA/comEC Acinetobacter baylyi ADP1

50.246

100

0.504