You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 15 Next »

This page describes Odysseus Script, a language that allows to run (sequences of) queries in different query languages like Continuous Query Language (CQL) or Procedural Query Language (PQL) and to configure the system.

 

Structure

The structure of an Odysseus Script may contain different things: commands, comments, variables, constants, macros or control flows.

Commands

Commands are normally those statements that are send to Odysseus, e.g. to install a query or to configure a setting. Each command in Odysseus Script begins with a hash/number sign (#) followed by its name and, finally, by some parameters (if the command needs some parameters).

#COMMAND parameter1 parameter2

Normally, one command is executed for its own and has no impact to other commands. However, the #QUERY command needs current settings like the parser, which is set by the command #PARSER or the transformation configuration, which is set by the command #TRANSCFG. Thus, you should normally always begin with the follwing stub (assuming that you use PQL and the standard transformation configuration):

#PARSER PQL
#TRANSCFG Standard
...

Comments

Comments mark lines that should be ignored by the parser. They are defined per line by using three slashes.

///this is ignored by the parser

Variables

Variables can be used to reuse certain values or can be used for changeable parameters that are moved to the top of a file so that it becomes more clearly. A variable can be set by using #DEFINE and be unset using #UNDEF. The access to a varibale is possible with ${....}. The existence of a variable (if it is defined or not) can be used and checked with the #IFDEF control flow. The following example shows two variables: an integer called "currentid" that has the value "1234" and a variable named "path", which has the value "F:/odysseus/example/".

#DEFINE currentid 1234
#DEFINE path F:/odysseus/example/
#RUNQUERY
SELECT * FROM example WHERE id = ${currentid}
#RUNQUERY
CREATE STREAM source (id Double, data STRING)
    WRAPPER 'GenericPush'
    PROTOCOL 'CSV'
    TRANSPORT 'File'
    DATAHANDLER 'Tuple'
    OPTIONS ( 'filename' '${path}input.csv')

The first variable "currentid" is used in the first query, so that it is equal to "SELECT * FROM example WHERE id = 1234". The second variable is used in the second query as a prefix for the filename. Notice, that variables are simply replaced by its values (there is no string-concatenation like you may see in the path-example).

Control Flows

Control flows are statements that are used to define which of the commands are executed, which not or how often they are executed. There are simple control flows like a for-loop (#LOOP) or a if-then-else (#IFDEF)

Constants

Constants are varibales that exists without defining them explicitly. For example, a default variable is NOW so that ${NOW} can be used to get the current time in millis. This is might be useful if the time of the script execution is needed (e.g. for filenames).

Procedures and Macros

Procedures and macros gives the user a possibility to reuse a certain snippet of the code. They can be distinguished between parameterizable procedures and simply reusable macros. Another advantage: Procedures (#PROCEDURE) are stored in the data dictionary so that their availablity is (according to the user's rights) system wide.

Control Flows

There are some control flows that allows to define how certain commands are executed.

#LOOP

This control flow allows a simple for-loop, which may be used to execute the same queries or commands two or more times.

Usage

Example

The examples shows a loop that repeats 10 times (i=0 until i<10) and executes the "#RUNQUERY SELECT ..." accordingly ten times. Furthermore, the counter "i" is used within the query, so that each ${i} is replaced by the current value of i from the loop.

#LOOP i 0 UPTO 10
    #RUNQUERY
    SELECT ${i} AS b, * FROM bid 
#ENDLOOP

You may also use ${i-1} or ${i+1} (only this two!). For example, this

#LOOP i 2 UPTO 4
    #RUNQUERY
    SELECT ${i-1} AS a, ${i+1} AS b, * FROM bid WHERE b>${i}
#ENDLOOP

is equal to

#RUNQUERY
SELECT 1 AS b, * FROM bid3 WHERE b>2
#RUNQUERY
SELECT 2 AS b, * FROM bid4 WHERE b>3


You may also use an additional offset variable. This offset variable adds a defined value to the current value of the actual loop variable. Following example uses an offset of x = 5:

#RUNQUERY
#LOOP i 2 UPTO 5 WITH x 5
    SELECT ${x} AS b, * FROM bid 
#ENDLOOP

This is equal to:

#RUNQUERY
SELECT 7 AS b, * FROM bid 
SELECT 8 AS b, * FROM bid 
SELECT 9 AS b, * FROM bid 

#IFDEF

With #IFDEF it is possible to check whether a variable exists and was set by #DEFINE or not. This is useful, for example, to run certain queries corresponding to the current setting.

Usage

Example

The example defines a variable called latencyOn and uses the #IFDEF command to use either StandardLatency for the transformation configuration, if latencyOn is set or Standard if it is not set. Obviously, this example uses allways the <then-command>-part (since latencyOn is set), so you may switch to another transformation config by simply commenting the #DEFINE command out so that the <else-commands> are used.

#DEFINE latencyOn
....
#IFDEF latencyOn
	#TRANSCFG StandardLatency	
#ELSE
	#TRANSCFG Standard
#ENDIF

 

Stored Procedures and Macros

Stored procedures and macros allows to reuse written Odysseu Script.

#PROCEDURE

The #PROCEDURE allows to create stored procedures that are saved into the data dictionary and can be reused by any other scripts, see #EXECUTE how to run them.

Usage

First, the name, which must be after the #PROCEDURE in the same line, is defined. After that, there might be an ordered list of variables. The order is important! These variables will be used as the parameters. Finally, there can be any Odysseus-Script code including queries etc. between the BEGIN and END. The only exception are "global" variables. Between BEGIN and END, only variables are allowed that are defined before (after the procedure name). You can execute the procedure by using the #EXECUTE command.

There cannot be installed more than one procedure with the same name, so you may delete a procedure by calling #DROPPROCEDURE before.

 

Example

The example creates a procedure with name "setSomething" and has two parameters (varX and attribute). Remeber, order is important! The procedure calls a #RUNQUERY-Command including both parameters. See  #EXECUTE command how this example is used.

#PROCEDURE setSomething
varX
attribute
BEGIN
    #RUNQUERY
    SELECT 1 AS a, 2 AS ${attribute}, * FROM bid WHERE b>${varX}
END

#EXECUTE

The #EXECUTE command can be used for running installed procedures, which were created by using the #PROCEDURE command.

Usage

The usage is similar to function calls in programming languages like Java. After #EXECUTE the name of procedure with a comma-separated list of its parameters. The parameters must be according to the definition of the procedure. If there were two parameters defined by #PROCEDURE, here are also two parameters needed. Notice, the order is important!

You can create the procedure by using the #PROCEDURE command or delete a procedure by calling #DROPPROCEDURE

 

Example

This example concludes the one from #PROCEDURE. Therefore we have two parameters (varX and attribute). The following executes the "setSomething" procedure and sets varX=1 and attribute=b.

#EXECUTE setSomething(1, b)

The variables are replaced and the according Odysseus Script of the procedure is executed at this point. Therefore, the following is inserted insted of the #EXECUTE command:

    #RUNQUERY
    SELECT 1 AS a, 2 AS b, * FROM bid WHERE b>1

Since this snippet is simply inserted, all things of the surrounding Odysseus Script is taken. In our example, this is for example, the choosen #PARSER and #TRANSCFG that are necessary for #RUNQUERY here. Therefore, it is not guaranteed that a procedure is executable of its own.

 

#DROPPROCEDURE

The command can be used to remove stored procedures, which were created by using the #PROCEDURE (see for more information about procedures) command.

Usage

You can create the procedure by using the #PROCEDURE command or execute a procedure by calling #EXECUTE

 

Example

This example concludes the one from #PROCEDURE. Therefore, we want to remove the "setSomething" procedure:

#DROPPROCEDURE setSomething

Commands

#ADDQUERY

This command executes a query in a certain langauge and is equal to #QUERY

#BUFFERPLACEMENT

This command is used to control how buffers are (automatically) placed within the query plan if a query is transformed (e.g. by #QUERY).

Parameters

Examples

No buffers:

#BUFFERPLACEMENT None

Adds a buffer before each operator:

#BUFFERPLACEMENT Standard Buffer Placement

Adds a buffer after each source:

#BUFFERPLACEMENT Source Buffer Placement

Adds a buffer for each query:

#BUFFERPLACEMENT Query Buffer Placement

 

#DEFINE

This command is used to define variables to reuse certain values. See also at Variables how to use a defined variable or at #IFDEF to see how to use defined variables within if-statements.

Parameters

Example

The first variable is called "one" and has no value. The second variable is called "two" and has the value "1234". See at Variables or at #IFDEF for examples how to use a variable.

#DEFINE one
#DEFINE two 1234

#DOQUERYSHARING

This command switches the query sharing (which tries to optimize a query be reusing parts of already installed query plans) on or off.

Parameters

Example

/// query sharing off
#DOQUERYSHARING false
/// query sharing on
#DOQUERYSHARING true

 

#DOREWRITE

This command switches the rewriting (tries to optimize a query plan by switching, deleting, splitting or merging operators without changing the query's semantics) on or off.

Parameters

Example

/// query rewrite off
#DOREWRITE false
/// query rewrite on
#DOREWRITE true

 

#DROPALLQUERIES

This command drops all installed queries. It does not remove andy sources or sinks, but you can use #DROPALLSINKS or #DROPALLSOURCES for this.

Parameters

Example

#DROPALLQUERIES

#DROPALLSINKS

This command drops all installed sinks. It does not remove andy queries or sources, but you can use #DROPALLQUERIES or #DROPALLSOURCES

Parameters

Example

#DROPALLSINKS

#DROPALLSOURCES

This command drops all installed sources. It does not remove andy queries or sinks, but you can use #DROPALLQUERIES or #DROPALLSINKS

Parameters

Example

#DROPALLSOURCES

 

#LOGIN

Changes the login that is used by other commands like #QUERY

Parameters

Example

This example changes the user to "System" with password "manager"

#LOGIN System manager

 

#LOGOUT

Logs the current used user out

Parameters

Example

#LOGOUT

 

#ODYSSEUS_PARAM

Can be used to set internal Odysseus configuration params. This should be only

Parameters

Example

#ODYSSEUS_PARAM scheduler_TimeSlicePerStrategy 10

 

#PARSER

This command sets the current parser for following commands, e.g. by #QUERY or #ADDQUERY. The according parser is used until another parser is set.

Parameters

The parser: Which parsers are available strongly depends on the current system setting and installed features. Normally in the default product, there is "PQL" for Procedural Query Language (PQL) and "CQL" for Continuous Query Language (CQL).

Example

#PARSER PQL

#QName

Set the name of the following queries.

#QNAME Query1

#QUERY

This command executes a query in a certain language. This might be, for example Procedural Query Language (PQL) or Continuous Query Language (CQL). There are three different commands to execute such a query: #QUERY, #ADDQUERY and #RUNQUERY. While #QUERY and #ADDQUERY (they are one and the same) only passes the definied query to Odysseus, the #RUNQUERY additionally starts the query. This means, a query that was added with #QUERY or #ADDQUERY is inactive and not started until it is explicetely started. The #RUNQUERY in contrast immediatly starts a query after it is added, e.g. by using #STARTQUERIES .

Parameters

The query command is dependent on the current parser (which is set by #PARSER) and the current transformation configuration (which is set by #TRANSCFG). Therefore, it is necessary to run these two commands before. Furthermore, you can switch to other parsers / transformation within one script by using #PARSER or #TRANSCFG again. Thus, if you want to run a query in CQL that last #PARSER command before should set the parser to "CQL".

If #QName is defined before, the query will get this name.

Example

The example shows four queries after the parser is set to CQL and the transformation configuration is set to Standard. The first one uses #QUERY and it is executed as a CQL-Query, but not started. The second query is equal to the first one (it still uses CQL and is not started). The third query also uses CQL and the Standard transformation configuration, but is (in contrast to the first and second) started (it is directly running). Then, the parser is switched to PQL, so that the fourth query is parsed by the PQL-Parser and not  by the CQL-Parser anymore.

#PARSER CQL
#TRANSCFG Standard

#QUERY
SELECT * FROM bid

#ADDQUERY
SELECT * FROM bid

#RUNQUERY
SELECT * FROM bid

#PARSER PQL

#QUERY
result =  PROJECT({ATTRIBUTES=['id','name']}, person)

#RELOADFROMLOG

The reload log is a file that logs all queries that were sucessfully installed into the system. This command can be used to run these logged queries from the log, e.g. to recreat an old ystem state.

Parameters

Example

#RELOADFROMLOG

 

#RUNQUERY

This command installs a query and starts it immediately. See #QUERY for parameters, examples and details.

#SCHEDULER

Sets the used scheduler and its scheduling strategy.

Parameters

Example

Uses the "Single Thread Scheduler" with a "Round Robin" scheduling strategy

#SCHEDULER "Single Thread Scheduler RR" "Round Robin"

Uses the "Single Thread Scheduler" with a "Aurora Min Cost" scheduling strategy

#SCHEDULER "Single Thread Scheduler RR" "Aurora Min Cost"

Uses the "Single Thread Scheduler" with a "Aurora Min Latency" scheduling strategy

#SCHEDULER "Single Thread Scheduler RR" "Aurora Min Latency"

Uses the "Single Thread Scheduler" with a "Chain" scheduling strategy

#SCHEDULER "Single Thread Scheduler RR" "Chain"

Uses the "Single Thread Scheduler" with a "Biggest Queue" scheduling strategy

#SCHEDULER "Single Thread Scheduler RR" "Biggest Queue"

 Uses the "Simple Dynamic Priority  Scheduler" with a "Round Robin" scheduling strategy

#SCHEDULER "Simple Dynamic Priority  Scheduler" "Round Robin"

#SLEEP

This command can be used to wait a certain time before executing the next command

Parameters

Example

Waiting 2 seconds (2000 milliseconds) until the next command is invoked.

#SLEEP 2000

 

#STARTQUERIES

This command starts all installed queries that are not running at the moment.

Parameters

Example

#STARTQUERIES

 

#STARTSCHEDULER

This command starts the scheduling.Notice that the scheduling strongly influences the processing and should be carefully used. The scheduler is running by default. You can stop it by using #STOPSCHEDULER

Parameters

Example

#STARTSCHEDULER

 

#STOPSCHEDULER

This command stops the scheduling.Notice that the scheduling strongly influences the processing and should be carefully used. The scheduler is running by default. You can start it by using #STARTSCHEDULER

Parameters

Example

#STOPSCHEDULER

 

#TRANSCFG

This command sets the transformation configuration for following commands. The transformation configuration defines how a query is transformed into an executable plan. The transformation configuration that was set is used until another configuration is explicitly set.

Parameters

Example

#TRANSCFG Standard

#UNDEF

This command sets the transformation configuration for following commands. The transformation configuration defines how a query is transformed into an executable plan. The transformation configuration that was set is used until another configuration is explicitly set.

Parameters

Example

#TRANSCFG Standard

 

 

  • No labels