Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   AVIN_RS06735 Genome accession   NC_012560
Coordinates   1454808..1457018 (+) Length   736 a.a.
NCBI ID   WP_156483617.1    Uniprot ID   -
Organism   Azotobacter vinelandii DJ     
Function   ssDNA transport through the inner membrane (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1449808..1462018
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AVIN_RS06710 (Avin_14680) - 1449962..1450684 (-) 723 WP_012700103.1 glycerophosphodiester phosphodiesterase -
  AVIN_RS06715 (Avin_14690) - 1450932..1452182 (+) 1251 WP_012700104.1 lipoprotein-releasing ABC transporter permease subunit -
  AVIN_RS06720 (Avin_14700) lolD 1452175..1452873 (+) 699 WP_012700105.1 lipoprotein-releasing ABC transporter ATP-binding protein LolD -
  AVIN_RS06725 (Avin_14710) - 1452876..1454120 (+) 1245 WP_012700106.1 lipoprotein-releasing ABC transporter permease subunit -
  AVIN_RS06730 (Avin_14720) - 1454145..1454672 (-) 528 WP_012700107.1 DUF2062 domain-containing protein -
  AVIN_RS06735 (Avin_14730) comA 1454808..1457018 (+) 2211 WP_156483617.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  AVIN_RS06740 (Avin_14740) - 1457090..1457722 (+) 633 WP_041807047.1 MotA/TolQ/ExbB proton channel family protein -
  AVIN_RS06745 (Avin_14750) - 1457719..1458147 (+) 429 WP_012700110.1 biopolymer transporter ExbD -
  AVIN_RS06750 (Avin_14760) lpxK 1458147..1459148 (+) 1002 WP_012700111.1 tetraacyldisaccharide 4'-kinase -
  AVIN_RS06755 (Avin_14770) - 1459209..1459394 (+) 186 WP_012700112.1 Trm112 family protein -
  AVIN_RS06760 (Avin_14780) kdsB 1459391..1460155 (+) 765 WP_012700113.1 3-deoxy-manno-octulosonate cytidylyltransferase -
  AVIN_RS06765 (Avin_14790) - 1460155..1460619 (+) 465 WP_012700114.1 low molecular weight protein-tyrosine-phosphatase -
  AVIN_RS06770 (Avin_14800) murB 1460616..1461635 (+) 1020 WP_012700115.1 UDP-N-acetylmuramate dehydrogenase -

Sequence


Protein


Download         Length: 736 a.a.        Molecular weight: 79919.77 Da        Isoelectric Point: 10.2397

>NTDB_id=33769 AVIN_RS06735 WP_156483617.1 1454808..1457018(+) (comA) [Azotobacter vinelandii DJ]
MLALTAGLLVLRFLPALPPWWLWSPMALGAAILLAARRYSPALFLFGLGWACLSAHWALEERLPVELDGRTLWLEGLVVG
LPARIDGTLHFQLEEASSRRAELPGRLRLAWHAGPEVRAGERWRLAVSLKRPRGLVNPQGFDYEAWLLAQRIGATGTVKA
GERLGTPENADGWRDSLRQRLLQVDAHGREGALAALVMGDASGLSVADWKLLQDTGTVHLMVISGQHVGLLAGLVYGLVV
LLARFGLWPGFLPWLPCACGLAFATALGYGWLAGFGVPVQRACAMLAVVLFWRLRFRHLGLWLPILLALDGVLLLEPLAS
LQPGFWLSFGAVVILVLAFGGRLGAWSWRQTLWRAQWTSALGLLPLLLALGLPISLSGPLANLVAVPWVGFAVVPLALLG
TLLLPLPAMGEGLLWLAGALLETLFRLLGEIAGAVPAWLPHAVPVWGWLLALLGTLLILLPAGVPLRVPGLALLLPLAFP
PQERIPQARADVWLLDVGQGLAVLVRTRGHDLLYDAGPRFGDFDLGERVVLPSLRNLGVGRLDRLLLSHADGDHAGGALA
VRRALPVGEVVAGEAQAQSAALAAQPCARRAWQWDGVRFATWHWTAVQEGNRASCVLLVEAAGERLLLTGDIDAAAERAL
LDSHPEWRADWLLAPHHGSRSSSSPALLKALAPRAVLISRGWNNGFGHPHAQVVERYRKLPAVIHDTARQGALRFRLGDW
GRARGLREEPRFWREK

Nucleotide


Download         Length: 2211 bp        

>NTDB_id=33769 AVIN_RS06735 WP_156483617.1 1454808..1457018(+) (comA) [Azotobacter vinelandii DJ]
ATGTTGGCGCTCACCGCAGGTCTGCTCGTCCTGCGTTTCCTTCCGGCCCTACCGCCCTGGTGGCTGTGGTCGCCGATGGC
GCTGGGCGCTGCGATCCTTCTGGCGGCGCGTCGCTATTCGCCGGCGCTCTTTCTCTTCGGTCTCGGCTGGGCCTGCCTGT
CGGCGCACTGGGCGCTGGAGGAACGGTTGCCTGTCGAACTCGATGGTCGCACCCTGTGGCTGGAGGGGCTGGTGGTCGGT
CTGCCGGCGCGCATCGACGGCACGCTGCATTTCCAACTGGAGGAGGCCTCTTCCCGGCGCGCCGAACTGCCCGGGCGACT
GCGCCTAGCGTGGCACGCCGGGCCGGAGGTCCGCGCCGGGGAGCGCTGGCGCCTGGCGGTCAGCCTCAAGCGTCCGCGCG
GCCTGGTCAACCCGCAGGGTTTCGATTACGAGGCCTGGCTGCTGGCCCAGCGGATCGGCGCCACCGGGACGGTGAAAGCG
GGAGAGCGACTCGGAACGCCGGAAAACGCCGACGGTTGGCGCGATTCCCTGCGCCAGCGCCTGCTGCAGGTCGATGCCCA
TGGCCGTGAGGGCGCGCTCGCCGCGCTGGTGATGGGCGACGCGTCCGGGCTGAGCGTGGCGGACTGGAAGCTCCTGCAGG
ATACCGGCACCGTGCATCTGATGGTGATCTCCGGCCAGCATGTCGGCCTGCTTGCCGGCCTGGTCTACGGGCTGGTGGTC
CTGCTGGCGAGATTTGGCCTGTGGCCGGGTTTTCTGCCCTGGTTGCCCTGTGCCTGCGGCCTGGCCTTCGCCACCGCGCT
CGGTTATGGCTGGCTGGCCGGCTTCGGGGTACCAGTACAGCGGGCCTGCGCCATGCTCGCCGTGGTGCTGTTCTGGCGCC
TGCGTTTCCGCCACCTGGGTCTCTGGCTACCCATCCTGCTGGCGCTGGACGGCGTACTGCTGCTCGAGCCCCTGGCCAGC
CTGCAGCCGGGGTTCTGGCTGTCGTTCGGTGCGGTGGTGATCCTCGTCCTGGCCTTCGGCGGCCGGCTGGGTGCCTGGTC
GTGGCGGCAGACCCTGTGGCGAGCGCAGTGGACCAGTGCGCTGGGACTGCTACCGTTGTTGCTGGCCTTGGGCCTGCCGA
TCAGTCTCAGCGGTCCGTTGGCCAATCTGGTCGCGGTACCCTGGGTCGGTTTCGCGGTGGTCCCGCTGGCTCTGCTCGGA
ACCCTGCTGCTGCCCTTGCCGGCAATGGGCGAGGGCCTTCTCTGGCTGGCCGGCGCCTTGCTGGAGACGCTGTTTCGGCT
GCTCGGCGAGATCGCCGGCGCCGTACCGGCCTGGCTGCCCCACGCGGTGCCGGTCTGGGGCTGGCTGCTGGCGCTGCTCG
GGACCCTGCTGATCCTGCTGCCGGCGGGAGTGCCGCTGCGTGTCCCGGGACTGGCGCTGCTGCTGCCCCTGGCATTTCCG
CCGCAGGAGCGAATCCCGCAGGCACGGGCCGATGTCTGGCTGCTGGATGTCGGGCAGGGCCTTGCCGTGCTTGTGCGTAC
CCGCGGGCACGACCTGCTCTATGATGCTGGGCCGCGTTTCGGCGATTTCGATCTGGGCGAGCGCGTGGTCCTGCCTTCGC
TGCGCAATCTCGGCGTGGGCCGCCTGGATCGCCTGCTGCTCAGCCATGCCGATGGCGACCACGCCGGTGGCGCCCTGGCC
GTGCGGCGCGCTCTGCCGGTGGGCGAGGTCGTCGCCGGCGAGGCGCAGGCGCAATCGGCGGCGCTCGCCGCGCAGCCTTG
CGCCCGTCGCGCCTGGCAGTGGGATGGTGTGCGTTTCGCCACCTGGCACTGGACGGCCGTGCAAGAGGGCAATCGGGCTT
CCTGCGTGCTGCTGGTCGAGGCCGCCGGCGAGCGCCTGCTGCTGACCGGCGATATCGATGCCGCAGCCGAGCGGGCACTG
CTCGACAGCCACCCGGAGTGGCGCGCCGACTGGCTGCTGGCGCCTCACCACGGCAGCCGCAGTTCGTCTTCGCCGGCTCT
GCTCAAGGCCCTGGCGCCGCGCGCGGTGCTGATCTCGCGCGGCTGGAACAACGGCTTCGGCCATCCCCATGCGCAGGTCG
TGGAGCGTTACCGGAAGCTGCCGGCCGTGATTCACGATACTGCGCGCCAGGGGGCCCTGCGGTTTCGCCTGGGCGACTGG
GGCCGGGCGCGCGGGCTGCGCGAAGAGCCCCGCTTCTGGCGGGAAAAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Pseudomonas stutzeri DSM 10701

65.912

96.06

0.633

  comA Ralstonia pseudosolanacearum GMI1000

34.814

100

0.394


Multiple sequence alignment