Describe Data
This task gives some basic type information relative for the incoming Dex data stream. The datastream is left untouched.
It is a nice component for getting some quick high level information about a dataset.
OPTION | DESCRIPTION |
---|---|
N/A | N/A |
INPUT
Data Stream Requirements
The input to this task must be a valid Dex data stream.
OUTPUT
The current Dex data stream is left intact and passed as output to the next task. However, information about the dataset is gathered and sent to the screen. This information gathered is represented in the table below.
COLUMN NAME | DESCRIPTION |
---|---|
Column | The number of the column, starting at 0. |
Column Name | The name of the column. |
Distinct | The number of distinct values within this column. |
Trimmed Distinct | The number of distinct values within this column after whitespace is trimmed from the beginning and ends. |
String | The number of rows within this column which qualify as a string. |
Double | The number of rows within this column which qualify to be a double. |
Integer | The number of rows within this column which qualify to be an integer. |
EXAMPLES
This tool is useful for ensuring that data is of a certain types, as some charts are pickier than others. It is also useful for ensuring that messy data with untrimmed whitespace is not present.