Remark: Work in progess
Based on the DEBS 2015 Tutorial "Tutorial 1: Patterns for Realtime Streaming Analytics" by Srinath Perera, Sriskandarajah Suhothayan we will show how Odysseus can be used for the following streaming analytics pattern.
In the following we will show different processing scenarios. All scenarios are build with Procedural Query Language (PQL) and for many cases we use the Nexmark scenario (Getting Started with Nexmark). You should also have installed the nexmark source (Simple Query Processing).
This is one of the most and simple operation. Filtering in Odysseus can be done with the Select operator.
See Examples for Select here.
This can be done in different ways. If you just want to remove Attributes, you can use the Project operator.
If you want to rename attributes, use the Rename operator.
If you want to add attributes, use the Map operator. With this operator it is also possible to remove and rename attributes. For just removing or renaming you should use one of the above operators, because they provide a much lower footprint.
Examples: Selection, Projection and Map
Splitting and combining of attributes in a single stream can be done with the Map operator. This operator allows many different mathematical expressions over all attributes inside the input stream.
out = MAP({ expressions = [ ['bid','renamed_bid'] /// Renaming of attribute /// Remark: The output only contains attributes/expressions that are given here, so removing an attribute is the by not using it here ]} ,nexmark:person) /// Simple split name in forename and lastname by looking for the first blanc out2 = MAP({ expressions = [ ['Substring(name,0,indexOf(name," "))','forename'], ['Substring(name,indexOf(name," ")+1)','lastname'], ] },nexmark:person ) /// Splitting of attribute into list /// Split string into substrings by " " presplitted = MAP({ expressions = [ ['split(name," ")','splittedName'] ] },nexmark:person ) /// access first and last element of list out3 = MAP({ EXPRESSIONS = [ ['splittedName[0]','forename'], ['elementAt(splittedName,size(splittedName)-1)','lastname'] ] }, presplitted ) /// access first and last element of list with special function out4 = MAP({ EXPRESSIONS = [ ['first(splittedName)','forename'], ['last(splittedName)','lastname'] ] }, presplitted ) |
Tranformation can also be done with the Map operator.
This pattern detects a condition and generates alerts based on a condition. (e.g., Alarm on high temperature). These alerts can be based on a simple value or more complex conditions such as rate of increase etc.