Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   G4G71_RS20215 Genome accession   NZ_CP048833
Coordinates   4456918..4459128 (-) Length   736 a.a.
NCBI ID   WP_169942735.1    Uniprot ID   A0A7Z3GTF1
Organism   Pseudomonas multiresinivorans strain populi     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4451918..4464128
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G4G71_RS20180 (G4G71_20135) murB 4452274..4453293 (-) 1020 WP_169939722.1 UDP-N-acetylmuramate dehydrogenase -
  G4G71_RS20185 (G4G71_20140) - 4453290..4453754 (-) 465 WP_169939723.1 low molecular weight protein-tyrosine-phosphatase -
  G4G71_RS20190 (G4G71_20145) kdsB 4453754..4454518 (-) 765 WP_065084902.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  G4G71_RS20195 (G4G71_20150) - 4454523..4454708 (-) 186 WP_009621404.1 Trm112 family protein -
  G4G71_RS20200 (G4G71_20155) lpxK 4454767..4455765 (-) 999 WP_169939724.1 tetraacyldisaccharide 4'-kinase -
  G4G71_RS20205 (G4G71_20160) - 4455765..4456214 (-) 450 WP_045211644.1 ExbD/TolR family protein -
  G4G71_RS20210 (G4G71_20165) exbB 4456211..4456846 (-) 636 WP_017519262.1 MotA/TolQ/ExbB proton channel family protein Machinery gene
  G4G71_RS20215 (G4G71_20170) comA 4456918..4459128 (-) 2211 WP_169942735.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  G4G71_RS20220 (G4G71_20175) - 4459291..4459821 (+) 531 WP_169939725.1 DUF2062 domain-containing protein -
  G4G71_RS20225 (G4G71_20180) - 4459970..4461217 (-) 1248 WP_169939726.1 lipoprotein-releasing ABC transporter permease subunit -
  G4G71_RS20230 (G4G71_20185) lolD 4461229..4461918 (-) 690 WP_024766679.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  G4G71_RS20235 (G4G71_20190) - 4461911..4463164 (-) 1254 WP_045211653.1 lipoprotein-releasing ABC transporter permease subunit -
  G4G71_RS20240 (G4G71_20195) - 4463289..4463864 (+) 576 WP_169939727.1 PilZ domain-containing protein -

Sequence


Protein


Download         Length: 736 a.a.        Molecular weight: 79449.76 Da        Isoelectric Point: 7.9897

>NTDB_id=423406 G4G71_RS20215 WP_169942735.1 4456918..4459128(-) (comA) [Pseudomonas multiresinivorans strain populi]
MFALALGLLALRWLPALPSGWVLLLLALSALPLLFTRGYFIGLFLLGFAWACQSAQWALDDRLTPELDGRTLWLEGRVEG
LPDRSGPSLRFVLSEASSPRAQVPATLRLSWFGGPPVEGGERWRLAVKLKRPHGMANGAGFDYEAWLTAQRIGATGSVKD
GQRLEVSSGPRAWREAWRQRLLAVDAQGRSGALAALVLGDASGLTTADWQVLQDTGTLHLMVISGSHISLLAGLLYAVVA
GLARLGCWPSRLPWLSCACLLAAGGAWAYSLMAGFEVPLQRACVMVSIVLLWRLRYRHRGLWTPLLGALLAVLLAEPLVV
LLPGFWLSYAAVALLIFGFSGRLGRWTAWRTWLRAQWLMAIGLLPASIALGLPLSISGVAANLIAVPWVELVVVPLALLG
SLVLGVPVLGEALLWVSGGLLEWLFRLLGWMAELAPAWQPVAAPAWTLVMAMLGALLLLAPAGLPLRALGLAMFLPVFWP
TIPLPAPGMAEVRVLDVGQGLSVLIRTRSQAWLYDTGARNGDFDIGERVVVPTLRSLGVGRLDLLMLSHADNDHAGGAVA
VKRSLRPAQVISGEPERLAPELQAQPCRIEEWTVDDVHLSSWAWAGARESNDRSCALEIEANGERILLTGDLPQSAELAW
LAAHPDEHVDWLLAGHHGSRSSSGPAFLRAIRPSTAIMSRGANNPYGHPHPSVVERFRALGIRIQDTAERGALVLTLGAH
GELGGVREGAHFWQEN

Nucleotide


Download         Length: 2211 bp        

>NTDB_id=423406 G4G71_RS20215 WP_169942735.1 4456918..4459128(-) (comA) [Pseudomonas multiresinivorans strain populi]
ATGTTCGCTCTGGCCCTGGGGCTGCTGGCGCTGCGCTGGCTGCCGGCGCTTCCATCCGGTTGGGTGCTGCTGCTTCTCGC
GCTGTCCGCGTTACCTCTGTTGTTCACCCGCGGTTATTTCATCGGGCTGTTCCTGCTCGGCTTCGCCTGGGCCTGCCAAT
CGGCGCAGTGGGCGCTGGATGACCGGCTGACGCCAGAGCTGGACGGTCGCACGCTGTGGCTGGAGGGCAGGGTAGAGGGC
TTGCCGGATCGTTCGGGGCCTTCACTGCGCTTCGTGCTCTCCGAGGCTTCGAGCCCTCGCGCGCAGGTTCCGGCAACACT
GCGCCTGTCCTGGTTCGGCGGCCCGCCGGTGGAGGGCGGCGAACGCTGGCGGCTGGCGGTGAAGCTCAAGCGTCCCCATG
GCATGGCCAATGGGGCGGGCTTCGACTACGAGGCCTGGCTCACTGCCCAGCGTATCGGCGCCACCGGCAGCGTAAAGGAT
GGCCAGCGCCTGGAAGTCTCCAGTGGCCCGCGCGCCTGGCGTGAGGCCTGGCGACAACGTTTGCTGGCGGTGGACGCCCA
GGGTCGCTCCGGTGCGCTTGCTGCGCTGGTGCTGGGCGACGCTTCCGGGTTGACCACGGCGGACTGGCAGGTCCTGCAGG
ACACCGGAACCCTGCACCTGATGGTGATTTCCGGTTCGCACATTTCCCTGCTGGCGGGCCTGCTCTACGCCGTCGTGGCG
GGGCTGGCGCGCCTGGGCTGCTGGCCATCCCGTCTGCCCTGGTTGTCCTGCGCCTGCCTGTTGGCTGCCGGCGGCGCCTG
GGCGTATAGCCTGATGGCGGGCTTCGAAGTGCCGTTGCAGCGCGCGTGCGTGATGGTCTCCATCGTCCTGCTGTGGCGCC
TGCGTTATCGCCATCGCGGCCTGTGGACGCCATTGCTCGGCGCCTTGCTGGCAGTACTGCTGGCCGAGCCGCTGGTGGTG
TTGCTGCCGGGGTTCTGGTTGTCCTATGCGGCGGTGGCCTTGCTGATCTTCGGCTTCTCCGGCCGCCTGGGGCGCTGGAC
GGCCTGGCGCACCTGGCTGCGCGCGCAGTGGCTGATGGCGATCGGGCTGTTGCCGGCGTCCATCGCGCTGGGCCTGCCGC
TGAGCATTTCCGGAGTCGCCGCGAACCTGATTGCCGTGCCCTGGGTGGAGCTGGTCGTGGTTCCACTCGCCCTGTTGGGC
AGTCTGGTGCTGGGAGTTCCCGTCCTGGGGGAGGCGCTGCTGTGGGTGTCCGGCGGCCTGCTGGAGTGGCTGTTCCGCCT
GTTGGGCTGGATGGCCGAACTGGCACCCGCCTGGCAGCCGGTGGCGGCGCCGGCCTGGACGCTGGTGATGGCGATGCTTG
GCGCGCTTCTGCTACTGGCGCCGGCCGGCTTGCCATTGCGCGCGTTGGGGCTCGCGATGTTCCTGCCGGTGTTCTGGCCG
ACCATTCCGCTACCCGCGCCGGGCATGGCTGAAGTCCGTGTGCTGGACGTTGGGCAGGGGCTCTCGGTGTTGATCCGTAC
GCGCAGCCAGGCCTGGCTCTACGACACCGGGGCGCGCAACGGTGATTTCGATATCGGCGAGCGAGTGGTGGTGCCGACCT
TGCGCAGTCTCGGCGTTGGCAGGCTCGATCTGCTGATGCTCAGCCATGCGGACAATGACCATGCCGGCGGCGCGGTCGCG
GTGAAGCGTTCGCTCCGGCCTGCGCAGGTCATCAGTGGCGAGCCGGAAAGGCTGGCGCCAGAGCTGCAGGCGCAACCCTG
CCGAATCGAGGAATGGACGGTCGATGACGTTCACCTCTCCAGTTGGGCGTGGGCAGGCGCCAGGGAAAGCAACGACCGCT
CCTGCGCGCTGGAAATCGAAGCCAATGGTGAGCGCATCCTGCTCACTGGCGACCTGCCGCAATCCGCCGAACTGGCTTGG
CTGGCGGCGCATCCGGATGAACACGTCGATTGGTTGCTGGCCGGGCACCACGGCAGCCGCAGCTCCTCCGGGCCGGCCTT
CCTGCGAGCGATCCGACCGTCCACGGCGATCATGTCCCGCGGCGCCAACAACCCCTACGGGCACCCGCATCCGTCGGTCG
TCGAGCGCTTTCGCGCGCTGGGTATCCGGATTCAGGATACGGCGGAGCGGGGCGCGCTGGTCCTGACCCTCGGTGCCCAT
GGCGAGCTTGGGGGAGTGCGGGAAGGCGCACATTTCTGGCAGGAAAATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7Z3GTF1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Pseudomonas stutzeri DSM 10701

57.521

97.554

0.561


Multiple sequence alignment