Services - EMBOSS compseq (SOAP and Soaplab) [Provided by: European Bioinformatics Institute (EMBL-EBI)]

This looks like a Soaplab service. Click here for more info and guidance on how to use this service )

About Soaplab

Soaplab services are command line applications, wrapped as SOAP services, and served from a Soaplab server. All Soaplab services have the same generic set of SOAP operations (depending on the Soaplab version) as they all share a standardised interface.

Certain tools, like the Taverna workflow workbench, provide automatic support for the Soaplab way of executing these services. In some cases you will need to use the Soaplab Server Base URL rather than the WSDL location in these tools.
More information on Soaplab clients is available here.

Further documentation on Soaplab services is available:

Provider:
European Bioinformatics Institute (EMBL-EBI)

Location:
United Kingdom

Submitter/Source:
SoapLab Support (over 11 years ago)

Base URL:
http://www.ebi.ac.uk:80/soaplab/typed/services/nucleic_composition.compseq

WSDL Location:
http://www.ebi.ac.uk/soaplab/typed/services/nucleic_composition.compseq?wsdl(download last cached WSDL file)

Documentation URL(s): SoapLab Support (over 10 years ago) http://emboss.sourceforge.net/apps/release/6.0/emboss/apps/compseq.html Login to add a documentation URL Description(s): from provider’s description doc (over 10 years ago) Calculate the composition of unique words in sequences

SoapLab Support(over 10 years ago)

Calculate the composition of unique words in sequences

Login to add a description ELIXIR Description(s): No info yet Login to add an elixir description Details (from Soaplab server): from Soaplab server(over 10 years ago)

ds_lsr_analysis :
- analysis :
  - name : compseq
  - installation : Soaplab2 default installation
  - type : Nucleic Composition, Protein Composition
  - output :
  - description : Calculate the composition of unique words in sequences
  - version : 6.1.0
  - analysis_extension :
  - input :

Show all

ds_lsr_analysis :
- analysis :
  - name : compseq
  - installation : Soaplab2 default installation
  - type : Nucleic Composition, Protein Composition
  - output :
  - description : Calculate the composition of unique words in sequences
  - version : 6.1.0
  - analysis_extension :
    - option :
      - name : emboss
      - type : normal
      - value : true
      - name : installation
      - type : normal
      - value : Soaplab2 default installation
      - name : version
      - type : normal
      - value : 6.1.0
    - event :
      - action :
      - id : _E_1
    - app_info :
      - category : nucleic_composition
      - help_url : http://emboss.sourceforge.net/apps/release/6.1/emboss/apps/compseq.html
    - parameter :
      - data :
        
        list :
        
        list_item :
        
        level : 0
        
        value : gcg
        
        level : 0
        
        value : gcg8
        
        level : 0
        
        value : embl
        
        level : 0
        
        value : swiss
        
        level : 0
        
        value : fasta
        
        level : 0
        
        value : ncbi
        
        level : 0
        
        value : genbank
        
        level : 0
        
        value : nbrf
        
        level : 0
        
        value : pir
        
        level : 0
        
        value : codata
        
        level : 0
        
        value : strider
        
        level : 0
        
        value : clustal
        
        level : 0
        
        value : phylip
        
        level : 0
        
        value : acedb
        
        level : 0
        
        value : msf
        
        level : 0
        
        value : jackknifer
        
        level : 0
        
        value : jackknifernon
        
        level : 0
        
        value : nexus
        
        level : 0
        
        value : nexusnon
        
        level : 0
        
        value : treecon
        
        level : 0
        
        value : mega
        
        level : 0
        
        value : meganon
        
        level : 0
        
        value : ig
        
        level : 0
        
        value : staden
        
        level : 0
        
        value : text
        
        level : 0
        
        value : raw
        
        type : full
      - base :
      - range :
        
        format : %d
        
        max :
        
        min : 1
        
        repeatable :
      - base :
        
        name : word
        
        help : This is the size of word (n-mer) to count. Thus if you want to count codon frequencies for a nucleotide sequence, you should enter 3 here.
        
        default : 2
        
        option :
        
        name : calculated_hardmax
        
        type : normal
        
        value : ${acdprotein}? 4 : 6
        
        name : scalemin
        
        type : style
        
        value : 1
        
        mandatory : false
        
        qualifier : word
        
        prompt : Word size to consider (e.g. 2=dimer)
        
        type : long
        
        ordering : 6
      - data :
        
        result :
        
        extension : composition
        
        iotype : output
      - base :
      - data :
        
        ioformat : unspecified
        
        iotype : input
        
        repeatable :
      - base :
      - range :
        
        format : %d
        
        max :
        
        min : 0
        
        repeatable :
      - base :
        
        name : frame
        
        help : The normal behaviour of ‘compseq’ is to count the frequencies of all words that occur by moving a window of length ‘word’ up by one each time. This option allows you to move the window up by the length of the word each time, skipping over the intervening words. You can count only those words that occur in a single frame of the word by setting this value to a number other than zero. If you set it to 1 it will only count the words in frame 1, 2 will only count the words in frame 2 and so on.
        
        default : 0
        
        option :
        
        name : calculated_hardmax
        
        type : normal
        
        value : ${word}
        
        mandatory : false
        
        qualifier : frame
        
        prompt : Frame of word to look at (0=all frames)
        
        type : long
        
        ordering : 8
      - base :
        
        name : ignorebz
        
        help : The amino acid code B represents Asparagine or Aspartic acid and the code Z represents Glutamine or Glutamic acid. These are not commonly used codes and you may wish not to count words containing them, just noting them in the count of ‘Other’ words.
        
        default : false
        
        mandatory : false
        
        qualifier : ignorebz
        
        prompt : Ignore the amino acids B and Z and just count them as ‘Other’
        
        type : boolean
        
        ordering : 9
      - base :
        
        name : reverse
        
        help : Set this to be true if you also wish to also count words in the reverse complement of a nucleic sequence.
        
        default : false
        
        mandatory : false
        
        qualifier : reverse
        
        prompt : Count words in the forward and reverse sense
        
        type : boolean
        
        ordering : 10
      - base :
        
        name : calcfreq
        
        help : If this is set true then the expected frequencies of words are calculated from the observed frequency of single bases or residues in the sequences. If you are reporting a word size of 1 (single bases or residues) then there is no point in using this option because the calculated expected frequency will be equal to the observed frequency. Calculating the expected frequencies like this will give an approximation of the expected frequencies that you might get by using an input file of frequencies produced by a previous run of this program. If an input file of expected word frequencies has been specified then the values from that file will be used instead of this calculation of expected frequency from the sequence, even if ‘calcfreq’ is set to be true.
        
        default : false
        
        mandatory : false
        
        qualifier : calcfreq
        
        prompt : Calculate expected frequency from sequence
        
        type : boolean
        
        ordering : 11
      - base :
        
        name : zerocount
        
        help : You can make the output results file much smaller if you do not display the words with a zero count.
        
        default : false
        
        mandatory : false
        
        qualifier : zerocount
        
        prompt : Display the words that have a frequency of zero
        
        type : boolean
        
        ordering : 15
  - input :
    - name : word
    - default : 2
    - mandatory : false
    - type : long
    - name : frame
    - default : 0
    - mandatory : false
    - type : long
    - name : ignorebz
    - default : false
    - mandatory : false
    - type : boolean
    - name : reverse
    - default : false
    - mandatory : false
    - type : boolean
    - name : calcfreq
    - default : false
    - mandatory : false
    - type : boolean
    - name : zerocount
    - default : false
    - mandatory : false
    - type : boolean

License(s): No info yet Login to add license info Cost: No info yet Login to add cost info Usage conditions: No info yet Login to add usage conditions info Contact info: No info yet Login to add contact info How to cite this service: No info yet Login to add how to cite info Publications about this service: for this service (this can be in a common citation format like Bibtex, MLA or APA, a DOI, a URL, etc.) No info yet Login to add publications info Citations of this service: No info yet Login to add citations info Example workflows using this service: See all workflows on myExperiment that use this service Login to add workflows info