The Document protocol handler parses complete text documents.

Options

  • delay: Delay of reading in milliseconds (default 0).
  • nanodelay: Delay of reading in nanoseconds (default 0).
  • oneDocPerCall: If set to true (default) there will be an complete open/process/close call on the underlying transporthandler for each new element. If set to false, the source (transporthandler) will only be called once.

Example

PQL

Document Protocol Handler
input = ACCESS({source='document', wrapper='GenericPush',
transport='File',protocol='Document',
  dataHandler='Tuple',options=[['delay','100']],
  schema=[
    ['content','String']]
})

CQL

Document Protocol Handler
CREATE STREAM document (content String)   
    WRAPPER 'GenericPush'
    PROTOCOL 'Document'
    TRANSPORT 'File'
    DATAHANDLER 'Tuple'
    OPTIONS ( 'delay' '100')
  • No labels