tokenize (Stream Processor)
Stream processor tokenizes the given JSON into to multiple JSON string elements and sends them as separate events.
Syntax
json:tokenize(<STRING|OBJECT> json, <STRING> path)
json:tokenize(<STRING|OBJECT> json, <STRING> path, <BOOL> fail.on.missing.attribute)
Query Parameters
| Name | Description | Default Value | Possible Data Types | Optional | Dynamic |
|---|---|---|---|---|---|
| json | The input JSON that needs to be tokenized. | STRING OBJECT | No | Yes | |
| path | The path of the set of elements that will be tokenized. | STRING | No | Yes | |
| fail.on.missing.attribute | If there are no element on the given path, when set to true the system will drop the event, and when set to false the system will pass null value to the jsonElement output attribute. | true | BOOL | Yes | No |
Extra Return Attributes
| Name | Description | Possible Types |
|---|---|---|
| jsonElement | The JSON element retrieved based on the given path will be returned as a JSON string. If the path selects a JSON array then the system returns each element in the array as a JSON string via a separate events. | STRING |
Example 1
CREATE STREAM InputStream (json STRING, path STRING);
@info(name = 'query1')
INSERT INTO OutputStream
SELECT path, jsonElement
FROM InputStream#json:tokenize(json, path);
In this example, the query named 'query1' processes records from the InputStream and extracts JSON elements based on the provided path using the json:tokenize(json, path) function. The resulting (path, jsonElement) pairs are inserted into the OutputStream.
For example, if the input json is {name:'John', address:{country:'CA', city: 'Toronto'}, phoneNo: [9876756567, 8778787768]}, and the path is passed as $.address, then the query generates an event with the escaped JSON as a string ('$.address', '{\'country\':\'CA\',\'city\':\'Toronto\'}'). For the same input JSON, if the path is passed as $.name, then it will produce an event without escaping ('$.name', 'John').
For the same input JSON, if the path is passed as $.phoneNo, then the query generates an event each for both the elements in the selected JSON array as a string ('$.phoneNo', '9876756567') and ('$.phoneNo', '8778787768').
Example 2
CREATE STREAM InputStream (json STRING, path STRING);
@info(name = 'query2')
INSERT INTO OutputStream
SELECT path, jsonElement
FROM InputStream#json:tokenize(json, path, true);
In this example, the query named 'query2' processes records from the InputStream and extracts JSON elements based on the provided path using the json:tokenize(json, path, true) function. The true parameter means that the function will fail on missing attributes. The resulting (path, jsonElement) pairs are inserted into the OutputStream.
For example, if the input json is {name:'John', age:25}, and the path is passed as $.salary, then the system will produce ('$.salary', null), as the fail.on.missing.attribute is true and there are no matching elements for $.salary.