2.1 General Structure
The input data with \(n = 95'412\) rows and \(p= 479\) columns is structured as follows: \(\mathbf{D} = \{\{\mathbf{x}_i,\mathbf{y}_i\}\}, i = 1 ... n, \mathbf{x} \in \mathbb{R}^{p-3}, \mathbf{y} \in \mathbb{R}^2\).
There are \(m = p-3 = 476\) explanatory features, two targets and one unique identifier for each example.
The \(m\) features are grouped into four blocks of information:
- Member database with personal particulars, interests and organization-internal information on examples: 81 features
- Characteristics of example’s neighborhood from the US census 1990: 286 features
- Promotion history: 54 features
- Summary of promotions sent to an example in the 12 months prior to the current promotion
- Sending dates and RFA status of promotions 13-36 months prior to current promotion
- Giving history: 57 features
- Summary statistics
- Responses to promotions 13-36 months prior to current promotion