Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   Q4S45_RS07915 Genome accession   NZ_CP131935
Coordinates   1740317..1742677 (+) Length   786 a.a.
NCBI ID   WP_305510726.1    Uniprot ID   A0AAT9WK79
Organism   Massilia sp. R2A-15     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1735317..1747677
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  Q4S45_RS07890 (Q4S45_07890) - 1735503..1736765 (+) 1263 WP_305510716.1 lipoprotein-releasing ABC transporter permease subunit -
  Q4S45_RS07895 (Q4S45_07895) lolD 1736758..1737450 (+) 693 WP_305510718.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  Q4S45_RS07900 (Q4S45_07900) - 1737459..1738262 (+) 804 WP_305510720.1 TatD family hydrolase -
  Q4S45_RS07905 (Q4S45_07905) - 1738251..1738973 (-) 723 WP_305510722.1 hypothetical protein -
  Q4S45_RS07910 (Q4S45_07910) - 1739144..1740130 (+) 987 WP_305510724.1 alpha/beta hydrolase -
  Q4S45_RS07915 (Q4S45_07915) comA 1740317..1742677 (+) 2361 WP_305510726.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  Q4S45_RS07920 (Q4S45_07920) - 1742801..1743085 (-) 285 WP_305510727.1 hypothetical protein -
  Q4S45_RS07925 (Q4S45_07925) - 1743344..1744147 (+) 804 WP_305510728.1 alpha/beta fold hydrolase -
  Q4S45_RS07930 (Q4S45_07930) dcd 1744211..1744780 (+) 570 WP_305510730.1 dCTP deaminase -
  Q4S45_RS07935 (Q4S45_07935) - 1744905..1746776 (+) 1872 WP_305510732.1 GspE/PulE family protein -

Sequence


Protein


Download         Length: 786 a.a.        Molecular weight: 83911.44 Da        Isoelectric Point: 9.1750

>NTDB_id=866376 Q4S45_RS07915 WP_305510726.1 1740317..1742677(+) (comA) [Massilia sp. R2A-15]
MRCAILGFVAGAAWLQTMGELPDRALMAVCAAVALALLLALRGAVRAAAAGALLGFCWAALIAHLALAPQLAKEDEGRDV
TLVGTIDSLPYRFEQGVRFNFAVERVVGEPMAVPPRVSLAWYAGFRDQVQQVGAVQPGERWQLTVRLQRPHGNANPYGFD
YEVWQLEQGVRATGYVRPAGEANRRLDSFVFSIGNLVEHCRATLRDRILAALPGKQYAGVIVALVVGDQRAIDQSDWTVF
NRTGISHLISISGLHITMIAGVFALLAGWLWRRSFFTGAQLPLMLPAQKVAALAGAGAALLYVLLAGFGVPAQRTLYMLA
VVAAALWSGRIASVSHVLCAALGLVVLLDPWAVLWPGFWLSFGAVAVILYATVGRVGARQAGFKAALRLGAHTQYVVTLG
LVPLTMLLFAQVSLASPIANAVAIPLVSFVVTPLALAGSMAPAPLSGALLALAHFAVEGLAGFLEWMSASPLAVWSAPTP
APWVFCFAFAGTLWMLAPRGWPHRWTGAAAWLPLLASLPSHPAQGEMAVTAFDVGQGMALLIETSGHRLLYDAGPLYSPD
ANGGNRVIGPYLKARGIDRLDGMVITHSDADHAGGALAVMAAVKVGWVASSLPPAHPIVRAAGTHSRCAAGQQWSWDGVQ
FEMLQPTLASYDNAALKPNARGCTLRISAGKHAVLLAADIEAAQEAQLVLASAGQLRADVLLAPHHGSGTSSTPGFLRAV
QPALGIFQVGYRNRYHHPKAEVYERYREMGIARMRTDESGAIQLQFGNVVVASEYRRDHARYWYEH

Nucleotide


Download         Length: 2361 bp        

>NTDB_id=866376 Q4S45_RS07915 WP_305510726.1 1740317..1742677(+) (comA) [Massilia sp. R2A-15]
ATGCGATGTGCGATCCTCGGATTCGTGGCGGGCGCTGCCTGGCTCCAGACCATGGGCGAGCTGCCCGATCGCGCGCTGAT
GGCGGTGTGCGCGGCAGTGGCGCTGGCGCTGCTCCTGGCCTTGCGCGGCGCGGTACGCGCGGCCGCAGCCGGCGCCTTGT
TGGGTTTCTGCTGGGCCGCGCTGATCGCCCACCTGGCGCTGGCGCCCCAGTTGGCGAAGGAAGACGAAGGCCGTGATGTC
ACGCTGGTGGGCACCATCGACAGCCTGCCATACCGGTTCGAGCAGGGCGTCCGGTTCAACTTCGCCGTCGAACGCGTGGT
GGGAGAGCCGATGGCGGTGCCGCCGCGCGTGTCGCTGGCCTGGTACGCCGGCTTTCGCGACCAGGTCCAGCAGGTCGGCG
CGGTGCAGCCCGGCGAACGCTGGCAACTGACGGTGCGGCTGCAGCGCCCGCACGGCAACGCCAACCCCTACGGTTTCGAC
TATGAAGTGTGGCAGCTCGAGCAGGGCGTGCGAGCGACCGGCTACGTGCGGCCTGCCGGCGAGGCCAATCGCCGGCTAGA
CAGCTTCGTTTTCAGCATCGGCAACCTGGTCGAGCATTGCAGGGCCACGCTGCGCGACCGCATTCTCGCGGCGCTGCCGG
GCAAGCAGTACGCCGGCGTGATCGTGGCGCTGGTTGTCGGCGACCAGCGCGCGATAGACCAATCCGACTGGACGGTATTC
AACCGCACCGGAATCAGCCACCTGATCTCGATCTCGGGCCTGCACATCACGATGATCGCCGGCGTGTTTGCCTTGTTGGC
GGGCTGGCTGTGGAGGCGCTCCTTCTTCACCGGCGCGCAGTTGCCGTTGATGCTGCCGGCGCAAAAGGTCGCTGCGCTCG
CGGGCGCCGGCGCCGCGCTGTTGTACGTGCTGCTGGCCGGCTTTGGCGTCCCCGCCCAGCGCACGCTCTATATGCTGGCG
GTGGTCGCCGCGGCGCTGTGGTCGGGCCGCATCGCCAGCGTCTCCCATGTGCTGTGCGCGGCGCTCGGCCTGGTGGTGCT
GCTCGATCCCTGGGCAGTGCTCTGGCCAGGCTTCTGGCTGTCGTTCGGCGCGGTGGCGGTGATCCTGTACGCGACCGTCG
GCCGGGTGGGCGCGCGGCAGGCCGGGTTCAAGGCCGCGCTGCGGCTGGGCGCGCACACGCAATACGTCGTCACGCTTGGG
CTGGTGCCGCTGACCATGCTGCTGTTCGCCCAGGTCTCGCTGGCCAGCCCAATTGCGAATGCGGTGGCGATCCCGCTGGT
GAGTTTCGTCGTGACGCCGCTGGCGCTGGCGGGCAGCATGGCGCCGGCGCCGCTGTCCGGCGCGCTGCTCGCACTTGCCC
ACTTTGCGGTCGAAGGCCTGGCTGGTTTTCTCGAATGGATGTCCGCAAGCCCGCTGGCGGTATGGAGCGCGCCGACGCCG
GCGCCGTGGGTGTTCTGCTTTGCGTTCGCCGGAACCCTGTGGATGCTGGCGCCGCGCGGATGGCCGCACCGTTGGACCGG
CGCCGCGGCATGGCTGCCGTTGCTGGCCAGCCTGCCGTCGCACCCGGCGCAAGGAGAGATGGCAGTCACCGCCTTCGATG
TCGGGCAGGGCATGGCGCTGCTGATCGAAACATCGGGCCACCGGCTGCTGTACGACGCCGGACCACTGTACTCGCCGGAC
GCGAACGGCGGCAACCGCGTGATCGGGCCTTACCTGAAGGCGCGCGGAATCGACCGGCTCGACGGCATGGTCATCACGCA
CAGCGACGCGGACCACGCCGGCGGCGCCCTGGCCGTGATGGCCGCCGTGAAGGTCGGCTGGGTAGCGTCGTCGCTGCCGC
CGGCGCATCCCATCGTGCGCGCCGCCGGCACGCACTCGCGCTGCGCGGCGGGCCAGCAATGGAGCTGGGACGGCGTGCAG
TTCGAAATGCTGCAGCCGACACTGGCCAGCTACGACAATGCCGCGCTCAAGCCGAACGCGCGCGGCTGCACATTGCGCAT
CAGCGCCGGCAAGCACGCGGTCCTGCTGGCGGCGGACATCGAAGCGGCGCAGGAAGCGCAGCTGGTGCTGGCGTCCGCCG
GCCAGCTGCGCGCCGACGTGCTGCTCGCGCCGCATCACGGCAGCGGCACATCGTCGACGCCGGGCTTCTTGCGGGCGGTG
CAGCCGGCGCTGGGCATATTCCAGGTCGGGTACCGCAATCGATATCACCATCCAAAGGCGGAGGTCTACGAGCGATACCG
CGAGATGGGCATCGCCCGCATGCGCACCGACGAATCGGGCGCAATTCAGCTGCAATTCGGAAATGTGGTCGTTGCCAGCG
AGTATCGGCGCGACCATGCACGTTACTGGTACGAACATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

46.145

100

0.487