Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   R5H22_RS05935 Genome accession   NZ_CP137764
Coordinates   1276663..1278873 (+) Length   736 a.a.
NCBI ID   WP_156483617.1    Uniprot ID   -
Organism   Azotobacter sp. NL3     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1271663..1283873
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R5H22_RS05910 (R5H22_05910) - 1271817..1272539 (-) 723 WP_012700103.1 glycerophosphodiester phosphodiesterase -
  R5H22_RS05915 (R5H22_05915) - 1272787..1274037 (+) 1251 WP_012700104.1 lipoprotein-releasing ABC transporter permease subunit -
  R5H22_RS05920 (R5H22_05920) lolD 1274030..1274728 (+) 699 WP_012700105.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  R5H22_RS05925 (R5H22_05925) - 1274731..1275975 (+) 1245 WP_012700106.1 lipoprotein-releasing ABC transporter permease subunit -
  R5H22_RS05930 (R5H22_05930) - 1276000..1276527 (-) 528 WP_012700107.1 DUF2062 domain-containing protein -
  R5H22_RS05935 (R5H22_05935) comA 1276663..1278873 (+) 2211 WP_156483617.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  R5H22_RS05940 (R5H22_05940) - 1278945..1279577 (+) 633 WP_041807047.1 MotA/TolQ/ExbB proton channel family protein -
  R5H22_RS05945 (R5H22_05945) - 1279574..1280002 (+) 429 WP_012700110.1 biopolymer transporter ExbD -
  R5H22_RS05950 (R5H22_05950) lpxK 1280002..1281003 (+) 1002 WP_012700111.1 tetraacyldisaccharide 4'-kinase -
  R5H22_RS05955 (R5H22_05955) - 1281064..1281249 (+) 186 WP_012700112.1 Trm112 family protein -
  R5H22_RS05960 (R5H22_05960) kdsB 1281246..1282010 (+) 765 WP_012700113.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  R5H22_RS05965 (R5H22_05965) - 1282010..1282474 (+) 465 WP_012700114.1 low molecular weight protein-tyrosine-phosphatase -
  R5H22_RS05970 (R5H22_05970) murB 1282471..1283490 (+) 1020 WP_012700115.1 UDP-N-acetylmuramate dehydrogenase -

Sequence


Protein


Download         Length: 736 a.a.        Molecular weight: 79919.77 Da        Isoelectric Point: 10.2397

>NTDB_id=900600 R5H22_RS05935 WP_156483617.1 1276663..1278873(+) (comA) [Azotobacter sp. NL3]
MLALTAGLLVLRFLPALPPWWLWSPMALGAAILLAARRYSPALFLFGLGWACLSAHWALEERLPVELDGRTLWLEGLVVG
LPARIDGTLHFQLEEASSRRAELPGRLRLAWHAGPEVRAGERWRLAVSLKRPRGLVNPQGFDYEAWLLAQRIGATGTVKA
GERLGTPENADGWRDSLRQRLLQVDAHGREGALAALVMGDASGLSVADWKLLQDTGTVHLMVISGQHVGLLAGLVYGLVV
LLARFGLWPGFLPWLPCACGLAFATALGYGWLAGFGVPVQRACAMLAVVLFWRLRFRHLGLWLPILLALDGVLLLEPLAS
LQPGFWLSFGAVVILVLAFGGRLGAWSWRQTLWRAQWTSALGLLPLLLALGLPISLSGPLANLVAVPWVGFAVVPLALLG
TLLLPLPAMGEGLLWLAGALLETLFRLLGEIAGAVPAWLPHAVPVWGWLLALLGTLLILLPAGVPLRVPGLALLLPLAFP
PQERIPQARADVWLLDVGQGLAVLVRTRGHDLLYDAGPRFGDFDLGERVVLPSLRNLGVGRLDRLLLSHADGDHAGGALA
VRRALPVGEVVAGEAQAQSAALAAQPCARRAWQWDGVRFATWHWTAVQEGNRASCVLLVEAAGERLLLTGDIDAAAERAL
LDSHPEWRADWLLAPHHGSRSSSSPALLKALAPRAVLISRGWNNGFGHPHAQVVERYRKLPAVIHDTARQGALRFRLGDW
GRARGLREEPRFWREK

Nucleotide


Download         Length: 2211 bp        

>NTDB_id=900600 R5H22_RS05935 WP_156483617.1 1276663..1278873(+) (comA) [Azotobacter sp. NL3]
ATGTTGGCGCTCACCGCAGGTCTGCTCGTCCTGCGTTTCCTTCCGGCCCTACCGCCCTGGTGGCTGTGGTCGCCGATGGC
GCTGGGCGCTGCGATCCTTCTGGCGGCGCGTCGCTATTCGCCGGCGCTCTTTCTCTTCGGTCTCGGCTGGGCCTGCCTGT
CGGCGCACTGGGCGCTGGAGGAACGGTTGCCTGTCGAACTCGATGGTCGCACCCTGTGGCTGGAGGGGCTGGTGGTCGGT
CTGCCGGCGCGCATCGACGGCACGCTGCATTTCCAACTGGAGGAGGCCTCTTCCCGGCGCGCCGAACTGCCCGGGCGACT
GCGCCTAGCGTGGCACGCCGGGCCGGAGGTCCGCGCCGGGGAGCGCTGGCGCCTGGCGGTCAGCCTCAAGCGTCCGCGCG
GCCTGGTCAACCCGCAGGGTTTCGATTACGAGGCCTGGCTGCTGGCCCAGCGGATCGGCGCCACCGGGACGGTGAAAGCG
GGAGAGCGACTCGGAACGCCGGAAAACGCCGACGGTTGGCGCGATTCCCTGCGCCAGCGCCTGCTGCAGGTCGATGCCCA
TGGCCGTGAGGGCGCGCTCGCCGCGCTGGTGATGGGCGACGCGTCCGGGCTGAGCGTGGCGGACTGGAAGCTCCTGCAGG
ATACCGGCACCGTGCATCTGATGGTGATCTCCGGCCAGCATGTCGGCCTGCTTGCCGGCCTGGTCTACGGGCTGGTGGTC
CTGCTGGCGAGATTTGGCCTGTGGCCGGGTTTTCTGCCCTGGTTGCCCTGTGCCTGCGGCCTGGCCTTCGCCACCGCGCT
CGGTTATGGCTGGCTGGCCGGCTTCGGGGTACCAGTACAGCGGGCCTGCGCCATGCTCGCCGTGGTGCTGTTCTGGCGCC
TGCGTTTCCGCCACCTGGGTCTCTGGCTACCCATCCTGCTGGCGCTGGACGGCGTACTGCTGCTCGAGCCCCTGGCCAGC
CTGCAGCCGGGGTTCTGGCTGTCGTTCGGTGCGGTGGTGATCCTCGTCCTGGCCTTCGGCGGCCGGCTGGGTGCCTGGTC
GTGGCGGCAGACCCTGTGGCGAGCGCAGTGGACCAGTGCGCTGGGACTGCTACCGTTGTTGCTGGCCTTGGGCCTGCCGA
TCAGTCTCAGCGGTCCGTTGGCCAATCTGGTCGCGGTACCCTGGGTCGGTTTCGCGGTGGTCCCGCTGGCTCTGCTCGGA
ACCCTGCTGCTGCCCTTGCCGGCAATGGGCGAGGGCCTTCTCTGGCTGGCCGGCGCCTTGCTGGAGACGCTGTTTCGGCT
GCTCGGCGAGATCGCCGGCGCCGTACCGGCCTGGCTGCCCCACGCGGTGCCGGTCTGGGGCTGGCTGCTGGCGCTGCTCG
GGACCCTGCTGATCCTGCTGCCGGCGGGAGTGCCGCTGCGTGTCCCGGGACTGGCGCTGCTGCTGCCCCTGGCATTTCCG
CCGCAGGAGCGAATCCCGCAGGCACGGGCCGATGTCTGGCTGCTGGATGTCGGGCAGGGCCTTGCCGTGCTTGTGCGTAC
CCGCGGGCACGACCTGCTCTATGATGCTGGGCCGCGTTTCGGCGATTTCGATCTGGGCGAGCGCGTGGTCCTGCCTTCGC
TGCGCAATCTCGGCGTGGGCCGCCTGGATCGCCTGCTGCTCAGCCATGCCGATGGCGACCACGCCGGTGGCGCCCTGGCC
GTGCGGCGCGCTCTGCCGGTGGGCGAGGTCGTCGCCGGCGAGGCGCAGGCGCAATCGGCGGCGCTCGCCGCGCAGCCTTG
CGCCCGTCGCGCCTGGCAGTGGGATGGTGTGCGTTTCGCCACCTGGCACTGGACGGCCGTGCAAGAGGGCAATCGGGCTT
CCTGCGTGCTGCTGGTCGAGGCCGCCGGCGAGCGCCTGCTGCTGACCGGCGATATCGATGCCGCAGCCGAGCGGGCACTG
CTCGACAGCCACCCGGAGTGGCGCGCCGACTGGCTGCTGGCGCCTCACCACGGCAGCCGCAGTTCGTCTTCGCCGGCTCT
GCTCAAGGCCCTGGCGCCGCGCGCGGTGCTGATCTCGCGCGGCTGGAACAACGGCTTCGGCCATCCCCATGCGCAGGTCG
TGGAGCGTTACCGGAAGCTGCCGGCCGTGATTCACGATACTGCGCGCCAGGGGGCCCTGCGGTTTCGCCTGGGCGACTGG
GGCCGGGCGCGCGGGCTGCGCGAAGAGCCCCGCTTCTGGCGGGAAAAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Pseudomonas stutzeri DSM 10701

65.912

96.06

0.633

  comA Ralstonia pseudosolanacearum GMI1000

34.814

100

0.394