OPH_REDUCE2

Description

Type

Data Process.

Behaviour

It performs a reduction operation based on hierarchy on a datacube.

Parameters

  • cube: name of the input datacube. The name must be in PID format.
  • schedule: scheduling algorithm. The only possible value is 0, for a static linear block distribution of resources.
  • dim: name of dimension on which the operation will be applied. By default the operator considers the implicit dimension with the highest level.
  • concept_level: concept level inside the hierarchy used for the operation.
  • midnight: if 00, then the edge point of two consecutive aggregate time sets will be aggregated into the right set; if 24 (default), then the edge point will be aggregated into the left set.
  • operation: reduction operation. Possible values are:
    • “count”: to evaluate the actual values (not missing)
    • “max”: to evaluate the maximum value
    • “min”: to evaluate the minimum value
    • “avg”: to evaluate the mean value
    • “sum”: to evaluate the sum
    • “std”: to evaluate the standard deviation
    • “var”: to evaluate the variance
    • “cmoment”: to evaluate the central moment
    • “acmoment”: to evaluate the absolute central moment
    • “rmoment” to evaluate the raw moment
    • “armoment” to evaluate the absolute raw moment
    • “quantile”: to evaluate the quantile
    • “arg_max” to evaluate the index of the maximum value
    • “arg_min” to evaluate the index of the minimum value
  • order: order used in evaluation of the moments or value of the quantile in range [0, 1].
  • missingvalue: value to be considered as missing value; by default it is NAN (for float and double).
  • grid: optional argument used to identify the grid of dimensions to be used (if the grid already exists) or the one to be created (if the grid has a new name). If it isn’t specified, no grid will be used.
  • container: name of the container to be used to store the output cube; by default, it is the input container.
  • check_grid: optional flag to be enabled in case the values of grid have to be checked (valid only if the grid already exists).
  • description: additional description to be associated with the output cube.

System parameters

  • exec_mode: operator execution mode. Possible values are async (default) for asynchronous mode, sync for synchronous mode with json-compliant output.
  • ncores: number of parallel processes to be used (min. 1).
  • nthreads: number of parallel threads per process to be used (min. 1).
  • sessionid: session identifier used server-side to manage sessions and jobs. Usually, users don’t need to use/modify it, except when it is necessary to create a new session or switch to another one.
  • objkey_filter: filter on the output of the operator written to file (default=all => no filter, none => no output, reduce2 => shows operator’s output PID as text).

Examples

Do a data reduction to compute the maximum value:

[OPH_TERM] >>  oph_reduce2 operation=max;dim=time;concept_level=A;cube=URL/1/1;grid=new_grid;

Arguments

Argument name Type Mandatory Values Default Min/Max-value
sessionid “string” “no”   “null”  
ncores “int” “no”   “1” “1” /
nthreads “int” “no”   “1” “1” /
exec_mode “string” “no” “async|sync” “async”  
cube “string” “yes”      
schedule “int” “no” “0” “0”  
dim “string” “no”   “-“  
concept_level “char” “no”   “A”  
midnight “char” “no” “00|24” “24”  
operation “string” “yes” “count|max|min|avg|sum|std|var|cmoment|acmoment|rmoment|armoment|quantile|arg_max|arg_min”    
order “real” “no”   “2” “0” /
missingvalue “real” “no”   “NAN”  
grid “string” “no”   “-“  
container “string” “no”   “-“  
check_grid “string” “no” “yes|no” “yes”  
description “string” “no”   “-“  
objkey_filter “string” “no” “all|none|reduce2” “all”