Qore Mapper Module Reference
1.4.1
|
This module provides classes that help with structured data mapping, meaning the transformation of data in one or more input formats to a different output format.
Classes provided by this module:
The following is an example map hash with comments:
If this map is applied to the following data in the following way:
The result will be:
list: (2 elements) [0]=hash: (5 members) id : 1 name : "John Smith" explicit_count : 1 implicit_count : 1 order_date : 2014-01-02 10:37:45.103948 Thu +01:00 (CET) [1]=hash: (5 members) id : 2 name : "Steve Austin" explicit_count : 2 implicit_count : 2 order_date : 2014-01-04 19:21:08.882634 Sat +01:00 (CET))
The mapper hash is made up of target (ie output) field names (note that dotted output field names result in a nested hash output unless the allow_output_dot option is set) as the key values assigned to field specifications as follows:
"name"
key)"code"
key); the closure or call reference must accept the following arguments:
"code"
: a closure or call reference to process the field data; cannot be used with the "constant"
or "index"
keys"constant"
: the value of this key will be returned as a constant value; this key cannot be used with the "name"
, "struct"
, "code"
, "index"
or "default"
keys"index"
: gives current index/count of the row. The initial int value is the start offset. So value 0 means that mapped values will be: 0, 1, ..., N; 1 means: 1, 2, ..., N; etc."date_format"
: gives the format for converting an input string to a date; see Date Formatting Codes for the format of this string; note that this also implies "type"
= "date"
"default"
: gives a default value for the field in case no input or translated value is provided"mand"
: assign to boolean True if the field is mandatory and an exception should be thrown if no input data is supplied"maxlen"
: an integer giving the maximum output string field length in bytes"name"
: the value of this key gives the name of the input field; only use this if the input record name is different than the output field name; note that if this value contains "."
characters and the allow_dot option is not set (see Mapper Options), then the value will be treated like "struct"
(the "struct"
key value will be created automatically); cannot be used with the "constant"
ior "index"
keys"number_format"
: gives the format for converting an input string to a number; see Qore::parse_number() for the format of this string; note that this also implies "type"
= "number"
"output_key_path"
: gives the output path for hash output values; each element in the list is a string key name"runtime"
: a reference to Mapper Runtime Options current status. The value is key in the current runtime structure."struct"
: the value of this key gives the location of the input field in an input hash in dot notation, ex: "element.name"
would look for the field's value in the "name"
key of the "element"
hash in the input record; cannot be used with the "constant"
or "index"
keys; this option is only necessary in place of the "name" option if the allow_dot option is set, otherwise use "name"
instead"trunc"
: assign to boolean True if the field should be truncated if over the maximum field length; this key can only be set to True if the "maxlen"
key is also given"type"
: this gives the output field type, can be:"date"
: date/time field"int"
: fields accepts only integer values (any non-integer values on input will cause an exception to be thrown when mapping; note: also "integer"
is accepted as an alias for "int"
)"number"
: field accepts only numeric values (any non-numeric values on input will cause an exception to be thrown when mapping); numeric values are left in their original types, any other type is converted to a arbitrary-precision numeric value"string"
: field accepts string values; in this case any other value will be converted to a string in the output"hash"
: field accepts hash values"any"
: field accepts any valueMapper objects accept the following options in the option hash:
"allow_dot"
: if True (as evaluated by parse_boolean()) then field names with "."
characters do not imply a structured internal element lookup; in this case input field names may have "."
characters in them, use the "struct"
key to use structured internal element loopups (see Mapper Specification Format "struct"
docs for more info)"allow_output_dot"
: if True (as evaluated by parse_boolean()) then output field names with "."
characters do not imply a structured/hash output element; in this case output field names may have "."
characters in them"date_format"
: gives the global format for converting a string to a date; see Date Formatting Codes for the format of this string; this is applied to all fields of type "date"
unless the field has a "date_format"
value that overrides this global setting"encoding"
: the output character encoding; if not present then "UTF-8"
is assumed"info_log"
: an optional info logging callback; must accept a string format specifier and sprintf()-style arguments"input"
: an optional hash describing the input records where each key is a possible input field name (where dot notation indicates a multi-level hash) and each value is a hash describing the field with the following optional keys:"desc"
: this gives the description of the input field"input_log"
: an optional input data logging callback; must accept a hash giving the input data hash"input_timezone"
: an optional string or integer (giving seconds east of UTC) giving the time zone for parsing input data (ex: "Europe/Prague"
), if not set defaults to the current TimeZone (see Qore::TimeZone::get())"name"
: the name of the mapper for use in logging and error strings"number_format"
: gives the global format for converting a string to a number; see Qore::parse_number() for the format of this string; this is applied to all fields of type "number"
unless the field has a "number_format"
value that overrides this global setting"output"
: an optional hash describing the output data structure; each hash key is a output field name (where dot notation indicates a multi-level hash) and each value is an optional hash describing the output field taking an optional "desc"
key and a subset of mapper field hash keys as follows:"desc"
: a description of the output field"mand"
: True if the field is mandatory and an exception should be thrown if no input data is supplied"maxlen"
: an integer giving the maximum length of a string field in bytes"type"
: this gives the output field type, can be:"date"
: date/time field"int"
: fields accepts only integer values (any non-integer values on input will cause an exception to be thrown when mapping; note: also "integer"
is accepted as an alias for "int"
)"number"
: field accepts only numeric values (any non-numeric values on input will cause an exception to be thrown when mapping); numeric values are left in their original types, any other type is converted to a arbitrary-precision numeric value"string"
: field accepts string values; in this case any other value will be converted to a string in the output"hash"
: field accepts hash values"any"
: field accepts any value"output_log"
: an optional output data logging callback; must accept a hash giving the output data hash"runtime"
: an initial runtime structure for Mapper Runtime Options"timezone"
: an optional string or integer (giving seconds east of UTC) giving the time zone definition for output data (ex: "Europe/Prague"
), if not set defaults to the current TimeZone (see Qore::TimeZone::get())"trunc_all"
: if True (as evaluated by parse_boolean()) then any field without a "trunc"
key (see Mapper Specification Format "trunc"
description) will automatically be truncated if a "maxlen"
attribute is set for the field"input"
option is given, then only those defined fields can be referenced as input fields in the mapper hash; all possible input fields should be defined here if this option is used"output"
option is given, then only those defined fields can be referenced as output fields, additionally the types given in the output definition cannot be overridden in the mapper hash; all possible output fields should be defined here if this option is usedRuntime options for Mapper objects allow the programmer to use constant values provided at runtime in the Mapper output.
For example, runtime options can be useful in the following cases:
The runtime options are basically the same as setting constants in the mapper before providing runtime data to the mapper. As such, the runtime options can be changed only before the first input hash is processed by a Mapper.
Note that the Mapper::setRuntime() and Mapper::replaceRuntime() methods are deprecated - please use Mapper construction options to set runtime values instead. The methods are deprecated since runtime options duplicate existing functionality and are confusing and error-prone to use.
STRING-TOO-LONG
exception (issue 2405)TableMapper
module (issue 1736)"constant"
field tag giving a constant value for the output of a field"allow_output_dot"
option to suppress this behavior"default"
field tag giving a default value if no input value is specified"date_format"
mapper option"number_format"
field option and a global option of the same name"timezone"
and "input_timezone"
options, documented those options"number"
field type: now leaves numeric values in their original type, converts all other types to a number"crec"
option"input"
option with input record validation"output"
option with output record validation"info_log"
option and removed the "trunc"
option"runtime"
mapper option"index"
field tag for current row index