Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • ATTRIBUTES: The attributes of the incoming tuples that should be recognized for clustering by the distance/similarity function. Notice, not all kinds of attribute types work here
  • CLUSTERERlearner: The clustering algorithm that should be used
    • Currently implemented: kMeans, Weka (which in turn has further algorithms)
  • ALGORITHM: A set of options to describe the algorithm

...

Code Block
themeEclipse
languagejavascript
titleOperator
linenumberstrue
clustered = CLUSTERING({
                attributes=['agesource', 'income'],
                learner='weka',                
                algorithm =                'SimpleKMeans',
               options =  [
                  'model'='SimplekMeans'
                  , 'arguments'=		['arguments','-N 3'
 -I 500 -S 10  -O']]            ]                                 
  
              }, inputoperatorinput)

For weka, there are currently the following algorithms that can be used as the "model". Further details and possible arguments can be  found in the Weka Docs

...