Detailed information    

insolico Bioinformatically predicted

Overview


Name   comFA   Type   Machinery gene
Locus tag   ACFMPA_RS18260 Genome accession   NZ_CP172605
Coordinates   3535059..3536450 (-) Length   463 a.a.
NCBI ID   WP_029318788.1    Uniprot ID   -
Organism   Bacillus sp. SG20032     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3530059..3541450
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACFMPA_RS18225 (ACFMPA_18225) flgL 3530231..3531127 (-) 897 WP_015714797.1 flagellar hook-associated protein FlgL -
  ACFMPA_RS18230 (ACFMPA_18230) flgK 3531138..3532661 (-) 1524 WP_029318789.1 flagellar hook-associated protein FlgK -
  ACFMPA_RS18235 (ACFMPA_18235) flgN 3532680..3533162 (-) 483 WP_032722609.1 flagellar protein FlgN -
  ACFMPA_RS18240 (ACFMPA_18240) flgM 3533178..3533444 (-) 267 WP_014478128.1 flagellar biosynthesis anti-sigma factor FlgM -
  ACFMPA_RS18245 (ACFMPA_18245) - 3533525..3533944 (-) 420 WP_003227995.1 TIGR03826 family flagellar region protein -
  ACFMPA_RS18250 (ACFMPA_18250) comFC 3534017..3534739 (-) 723 WP_014481051.1 comF operon protein ComFC Machinery gene
  ACFMPA_RS18255 (ACFMPA_18255) comFB 3534703..3534999 (-) 297 WP_015483764.1 late competence protein ComFB -
  ACFMPA_RS18260 (ACFMPA_18260) comFA 3535059..3536450 (-) 1392 WP_029318788.1 ATP-dependent helicase ComFA Machinery gene
  ACFMPA_RS18265 (ACFMPA_18265) - 3536556..3537401 (-) 846 WP_003244125.1 DegV family protein -
  ACFMPA_RS18270 (ACFMPA_18270) degU 3537499..3538188 (-) 690 WP_003219701.1 two-component system response regulator DegU Regulator
  ACFMPA_RS18275 (ACFMPA_18275) degS 3538271..3539428 (-) 1158 WP_003227983.1 two-component sensor histidine kinase DegS Regulator
  ACFMPA_RS18280 (ACFMPA_18280) - 3539645..3540298 (+) 654 WP_003227979.1 YigZ family protein -

Sequence


Protein


Download         Length: 463 a.a.        Molecular weight: 52587.64 Da        Isoelectric Point: 9.9687

>NTDB_id=1068252 ACFMPA_RS18260 WP_029318788.1 3535059..3536450(-) (comFA) [Bacillus sp. SG20032]
MNVPVEKNGSFSKELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRRYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWKEENESNWQSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHIKEGRPVFLFVPSVSILEKAAACFRGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESPIFTESALVQIAGRTGRHKEYADGDVIYFHFGKTKSMLDARKHIKEMNELAAKVECTD

Nucleotide


Download         Length: 1392 bp        

>NTDB_id=1068252 ACFMPA_RS18260 WP_029318788.1 3535059..3536450(-) (comFA) [Bacillus sp. SG20032]
GTGAATGTGCCAGTTGAAAAAAACGGTTCCTTTTCAAAAGAATTGCAGCAGACGCTTCGAAGCCGTCATTTGCTCAGGAC
TGAGCTCTCATTTTCCGATGAGATGATTGAATGGCATATCAAGAATGGATATATCACTGCTGAAAATTCTATATCCATAA
ATAAACGGAGATATAGATGTAATAGGTGCGGACAAACTGATCAGCGGTATTTTTCTTTTTATCACTCATCTGGAAAGAAT
AAGCTGTATTGCCGTTCCTGTGTCATGATGGGCAGAGTGAGTGAGGAGGTTCCTTTATATTCATGGAAAGAGGAAAATGA
ATCAAACTGGCAGTCAATTAAACTGACATGGGATGGCAAGCTTTCAAGCGGACAGCAAAAAGCCGCCAATGTTTTAATTG
AAGCAATATCAAAAAAAGAAGAGCTCCTCATCTGGGCGGTTTGCGGCGCTGGCAAAACAGAAATGCTGTTTCCTGGTATA
GAATCAGCGTTAAATCAAGGACTGCGTGTATGTATTGCAACACCTCGCACCGATGTTGTATTAGAGCTTGCTCCAAGACT
CAAGGCTGCCTTTCAGGGTGCTGACATTTCAGCGCTTTACGGAGGAAGCGATGACAAAGGGCGGCTATCTCCGCTTATGA
TTTCCACTACGCATCAGCTTTTGCGATATAAAGATGCAATCGATGTTATGATCATTGATGAAGTTGACGCTTTTCCATAT
TCTGCTGATCAAACCCTTCAATTCGCTGTTCAAAAAGCAAGAAAGAAAAACAGCACCCTCGTTTATTTAAGTGCAACACC
TCCTAAAGAATTAAAAAGAAAAGCACTGAACGGACAGTTACATTCAGTTCGCATCCCCGCAAGACACCACCGGAAACCTT
TACCCGAACCGCGCTTTGTATGGTGTGGAAACTGGAAGAAGAAATTAAACCGAAATAAAATTCCGCCAGCGGTGAAAAGA
TGGATAGAGTTTCATATAAAAGAAGGGAGGCCTGTTTTTTTATTCGTTCCTTCCGTTTCTATTCTGGAAAAGGCTGCTGC
GTGCTTTAGAGGTGTTCATTGCCGAACCGCATCTGTGCACGCGGAAGACAAGCATAGAAAGGAGAAAGTGCAGCAATTCA
GAGATGGTCAGCTCGATTTATTAATCACAACAACAATACTGGAAAGAGGCGTCACAGTCCCCAAGGTGCAAACGGGTGTA
CTAGGAGCGGAATCACCTATCTTTACGGAAAGCGCACTTGTTCAAATTGCAGGAAGAACCGGCCGGCATAAAGAATATGC
GGACGGCGATGTCATTTACTTTCACTTCGGCAAAACAAAGAGTATGCTCGATGCAAGAAAGCATATAAAAGAAATGAATG
AATTGGCAGCAAAAGTTGAATGTACAGACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comFA Bacillus subtilis subsp. subtilis str. 168

98.92

100

0.989


Multiple sequence alignment