- marketing team will be interest to see if they can
increase their efficiency - the management team want to know if using advanced analytics may
increase salesin general.
Attributes:
country: Country namearticle: 6 digit article number, as unique identifier of an articlesales: total number of units sold in respective retail weekregular_price: recommended retail price of the articlecurrent_price: current selling price (weighted average over the week)ratio: price ratio as current_price/regular_price, such that price discount is 1-ratioretailweek: start date of the retailweekpromo1: indicator for media advertisement, taking 1 in weeks of activation and 0 otherwisepromo2: indicator for store events, taking 1 in weeks with events and 0 otherwisecustomer_id: customer unique identifier, one id per customerarticle: 6 digit article number, as unique identifier of an articleproductgroup: product group the article belongs tocategory: product category the article belongs tocost: total costs of the article (assumed to be fixed over time)style: description of article designsizes: size range in which article is availablegender: gender of target consumer of the articlergb_*_main_color: intensity of the red (r), green (g), and blue (b) primaries of the article‘s main color, taking values [0,250]rgb_*_sec_color: intensity of the red (r), green (g), and blue (b) primaries of the article‘s secondary color, taking values [0,250]label: advertisement result after offering/sending/presenting the offer to the customer. 0 means the customer did not buy and 1 means the costumer did buy.
article477 distinct values Highcardinalityretailweek123 distinct Weeks Highcardinalityregular_priceis highly overall correlated with current_price Highcorrelationcurrent_priceis highly overall correlated with regular_price Highcorrelationproductgroupis highly overall correlated with cost Highcorrelationcategoryis highly overall correlated with cost Highcorrelationstyleis highly overall correlated with cost Highcorrelationsizesis highly overall correlated with cost Highcorrelationgenderis highly overall correlated with cost Highcorrelationclosest_colors_mainis highly overall correlated with cost Highcorrelationclosest_colors_secis highly overall correlated with cost Highcorrelationpromo1is highly imbalanced (66.5%)Imbalancepromo2is highly imbalanced (95.5%)Imbalancesizesis highly imbalanced (53.1%)Imbalancearticle.1 is uniformly distributedUniform