Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   GTN31_RS06875 Genome accession   NZ_CP047361
Coordinates   1335405..1336376 (-) Length   323 a.a.
NCBI ID   WP_164942281.1    Uniprot ID   -
Organism   Macrococcoides canis strain SD607     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1330405..1341376
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GTN31_RS06840 (GTN31_06830) gcvT 1331100..1332185 (-) 1086 WP_164943023.1 glycine cleavage system aminomethyltransferase GcvT -
  GTN31_RS06845 (GTN31_06835) - 1332323..1332829 (-) 507 WP_164942276.1 shikimate kinase -
  GTN31_RS06850 (GTN31_06840) - 1332990..1333442 (-) 453 WP_164942277.1 competence type IV pilus minor pilin ComGF -
  GTN31_RS06855 (GTN31_06845) - 1333387..1333629 (-) 243 WP_164942278.1 hypothetical protein -
  GTN31_RS06860 (GTN31_06850) comGD 1333616..1334041 (-) 426 WP_280514960.1 competence type IV pilus minor pilin ComGD -
  GTN31_RS06865 (GTN31_06855) comGC 1334034..1334366 (-) 333 WP_174804787.1 competence type IV pilus major pilin ComGC -
  GTN31_RS06870 (GTN31_06860) comGB 1334381..1335430 (-) 1050 WP_164942280.1 competence type IV pilus assembly protein ComGB -
  GTN31_RS06875 (GTN31_06865) comGA 1335405..1336376 (-) 972 WP_164942281.1 competence type IV pilus ATPase ComGA Machinery gene
  GTN31_RS06880 (GTN31_06870) - 1336589..1336837 (+) 249 WP_086042733.1 DUF2626 domain-containing protein -
  GTN31_RS06885 (GTN31_06875) - 1336865..1337485 (-) 621 WP_164942282.1 MBL fold metallo-hydrolase -
  GTN31_RS06890 (GTN31_06880) - 1337575..1337814 (+) 240 WP_174804788.1 DUF2759 domain-containing protein -
  GTN31_RS06895 (GTN31_06885) - 1337841..1338152 (-) 312 WP_086042736.1 MTH1187 family thiamine-binding protein -
  GTN31_RS06900 (GTN31_06890) - 1338152..1339120 (-) 969 WP_164942283.1 ROK family glucokinase -
  GTN31_RS06905 (GTN31_06895) - 1339117..1339308 (-) 192 WP_086042738.1 YqgQ family protein -
  GTN31_RS06910 (GTN31_06900) - 1339313..1340761 (-) 1449 WP_164942284.1 rhomboid family protein -
  GTN31_RS06915 (GTN31_06905) - 1340737..1341291 (-) 555 WP_164942285.1 5-formyltetrahydrofolate cyclo-ligase -

Sequence


Protein


Download         Length: 323 a.a.        Molecular weight: 37275.84 Da        Isoelectric Point: 6.5751

>NTDB_id=412959 GTN31_RS06875 WP_164942281.1 1335405..1336376(-) (comGA) [Macrococcoides canis strain SD607]
MEQLFNEIIEEAIIQSASDIHFIPNDKSISIKFRIHGDIEDYREIDDMLFRKLLSYIKFTAHLDVSEKNKAQSGIIQFNL
KDLRYNIRASTLPRSLGDEACVLRIIRQNFIDEYQTNDKILYNQIKKSSGIIIISGPTGSGKSTLMYQLVHYARNTLKRQ
IISIEDPVEQHIEGIIQVNVNEKAEITYHTAIKAILRCDPDIIMLGEIRDSVVAHQVINAGLSGHLVLTTLHANDCIGAL
FRLKEMGINAVDLYQSINLIINQRLVRKKDGMGRLLAYEFLTKKDIEKYLQGEHINYRTLTDQFKELYETNQISHYEFEK
FDL

Nucleotide


Download         Length: 972 bp        

>NTDB_id=412959 GTN31_RS06875 WP_164942281.1 1335405..1336376(-) (comGA) [Macrococcoides canis strain SD607]
ATGGAACAGTTATTCAATGAAATAATAGAGGAAGCAATTATACAAAGTGCATCAGATATTCACTTCATACCAAACGATAA
AAGTATATCTATCAAATTTAGGATACACGGTGATATTGAAGACTATAGAGAAATCGACGACATGTTATTCAGGAAATTAC
TTTCTTATATTAAATTTACGGCACATCTTGATGTATCAGAAAAGAATAAGGCACAGAGTGGAATCATCCAATTTAATCTG
AAAGACTTAAGATATAATATTCGCGCATCTACTTTACCTCGATCATTAGGCGATGAAGCATGTGTACTGAGAATCATCAG
ACAAAATTTTATAGATGAATATCAGACGAATGACAAGATATTGTACAATCAAATAAAAAAATCAAGCGGTATTATCATCA
TAAGTGGACCAACTGGAAGCGGGAAAAGCACATTGATGTATCAGCTTGTACATTATGCGAGGAACACATTGAAGAGACAA
ATAATATCAATAGAAGATCCTGTTGAACAGCATATTGAAGGAATTATACAAGTTAATGTTAACGAAAAAGCAGAGATAAC
GTATCATACTGCGATAAAGGCAATCCTTAGATGTGACCCGGATATCATTATGCTAGGTGAAATCAGAGATTCAGTCGTAG
CACATCAAGTGATCAATGCAGGACTAAGCGGTCATCTCGTTTTAACAACATTACATGCGAATGATTGCATCGGTGCATTA
TTTCGGTTAAAAGAAATGGGTATTAATGCTGTTGATCTATATCAAAGTATCAATCTGATTATTAATCAAAGGCTTGTAAG
AAAAAAAGATGGGATGGGGCGTCTATTGGCCTATGAATTTTTAACAAAAAAGGATATTGAAAAATATTTACAAGGTGAAC
ATATTAACTATAGAACGTTGACCGATCAATTTAAGGAGCTCTATGAAACAAATCAGATTTCACACTATGAATTCGAAAAA
TTTGATCTATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

49.39

100

0.502

  comGA Staphylococcus aureus N315

49.39

100

0.502

  comGA Bacillus subtilis subsp. subtilis str. 168

37.73

100

0.381

  pilB Glaesserella parasuis strain SC1401

34.302

100

0.365

  pilB Vibrio parahaemolyticus RIMD 2210633

35.03

100

0.362


Multiple sequence alignment