Predicting Individual Responses Using Multinomial Logit Analysis - PowerPoint PPT Presentation

1 / 31
About This Presentation
Title:

Predicting Individual Responses Using Multinomial Logit Analysis

Description:

Predicting Individual Responses Using Multinomial Logit Analysis Modeling an individual s response to marketing effort The BookBinders Book Club case – PowerPoint PPT presentation

Number of Views:393
Avg rating:3.0/5.0
Slides: 32
Provided by: Arvin46
Category:

less

Transcript and Presenter's Notes

Title: Predicting Individual Responses Using Multinomial Logit Analysis


1
Predicting Individual ResponsesUsing Multinomial
Logit Analysis
  • Modeling an individuals response to marketing
    effort
  • The BookBinders Book Club case

2
The Logit Model
  • The objective of the model is to predict the
    probabilities that an individual will choose each
    of several choice alternatives (e.g., buy versus
    not buy Select from among three brands A, B, and
    C). The model has the following properties
  • The probabilities lie between 0 and 1, and sum
    to 1.
  • The model is consistent with the proposition
    that customers pick the choice alternative that
    offer them the highest utility on a purchase
    occasion, but the utility has a random
    component that varies from one purchase
    occasion to the next.
  • The model has the proportional draw property --
    each choice alternative draws from other choice
    alternatives in proportion to their utility.

3
Technical Specification of the Multinomial Logit
Model
  • Individual is probability of choosing brand
    1(Pi1) is given by
  • where Aij is the attractiveness of alternative
    j to customer i å wk bijk
  • k
  • bijk is the value (observed or measured) of
    variable k (e.g., price) for alternative j when
    customer i made a purchase.
  • Wk is the importance weight associated with
    variable k (estimated by the model)
  • Similar equations can be specified for the
    probabilities that customer i will choose other
    alternatives.

4
Technical Specification ofthe Multinomial Logit
Model
  • On each purchase occasion, the (unobserved)
    utility that customer i gets from alternative j
    is given by
  • where ?ij is an error term. Notice that utility
    is the sum of an observable term (Aij) and an
    unobservable term (?ij ).

5
Example Choosing Among Three Brands
6
Example Computations
  • (a) (b) (c) (d) (e)
  • Share ShareBrand Aij wk bijk
    estimate estimate Draw without with
    (c)(d) new brand new brand
  • A 4.70 109.9 0.512 0.407 0.105
  • B 3.30 27.1 0.126 0.100 0.026
  • C 4.35 77.5 0.362 0.287 0.075
  • D 4.02 55.7 0.206

7
An Important Logit Model Implication
High
Marginal Impact of a Marketing Action (
)
Low
0.0
0.5
1.0
Probability of Choosing Alternative 1 ( )
8
Quote for the Day
  • You will lose money sending a terrific piece of
    mail to a lousy list, but make money sending a
    lousy piece of mail to a terrific list!
  • -- Direct mail lore

9
MNL Model of Response to Direct Mail
  • Probability of function of (past response
    behavior,
  • responding to marketing effort,
  • direct mail characteristics of
  • solicitation customers)

10
BookBinders Book Club Case
  • Predict response to a mailing for the Art
    History of Florence based on the following
    variables
  • Gender
  • Amount Purchased
  • Months since first purchase
  • Months since last purchase
  • Frequency of purchase
  • Past purchases of art books
  • Past purchases of childrens books
  • Past purchases of cook books
  • Past purchases of DIY books
  • Past purchases of youth books

11
Scoring Using Current Industry Practice
  • Dominant Scoring Rule used in the industry is
    the RFM (Recency, Frequency, and Monetary) model
  • Recency
  • Last purchased in the past 3 months 25 points
  • Last purchased in the past 3 - 6 months 20
  • Last purchased in the past 6 - 9 months 10
  • Last purchased in the past 12 - 18 months 5
  • Last purchased in the past 18 months 0
  • Come up with similar scoring rules for
    Frequency and Monetary.
  • For each customer, add up his/her score on each
    of the components (recency, frequency, and
    monetary) to compute an overall score.

12
Scoring Based on Regression
  • Regression Model
  • Pij wo ?wkbijk ?ij
  • where Pij is the probability that individual i
    will choose alternative j, wk are the regression
    coefficients and bijk are the independent
    variables described earlier. Note that Pij
    computed this way need not necessarily lie
    between 0 and 1.

13
Scoring Model using Artificial Neural Networks
  • What is a neural network?
  • Determinants of network properties
  • Description of feed-forward network with back
    propagation
  • Potential value of neural networks

14
Artificial Neural Networks
  • An artificial neural network is a general
    response model that relates inputs (e.g.,
    advertising) to outputs (e.g., product
    awareness). The modeler need not specify the
    functional form of this relationship.
  • A neural net attempts to mimic how the human
    brain processes input information and consists of
    a richly interlinked set of simple processing
    mechanisms (nodes).

15
Characteristics of Biological Neural Networks
  • Massively parallel
  • Distributed representation and computation
  • Learning ability
  • Generalization ability
  • Adaptivity
  • Inherent contextual information
  • Fault tolerance
  • Low energy consumption

16
An Example Artificial Neural Network
Neurons
Inputs In humanssensory data. In
4Thoughtadvertising, selling effort, price, etc.
Outputs In humansmuscular reflexes. In
4Thoughtsales model.
Synapses
17
Determinants of the Behavior of Artificial Neural
Network
  • Network properties (depends on whether network is
    feedforward or feedback number of nodes, number
    of layers in the network, and order of
    connections between nodes).
  • Node properties (threshold, activation range,
    transfer function).
  • System dynamics (initial weights, learning rule).

18
Processing Mechanism of Individual Neurons
  • Each neuron converts input signals into an
    overall signal value by weighting and summing the
    incoming signals.
  • Z å Wi Xi
  • i
  • It transforms the overall signal value into an
    output signal (Y) using a transfer function.

19
Transfer Function Formulations
  • Hard limiter (Y 1 if Z T else 0)
  • Sigmoidal (0 Y 1)
  • 1
  • Y g(Z)
  • 1 e(ZT)
  • Tanh (1 Y 1)
  • Y g(Z) tanh (Z T)

20
Role of Hidden Unit in a Two-Dimensional Input
Space
Exclusive or Problem
Classes with meshed regions
General region shapes
Description of decision regions
Structure
Half plane bounded by hyperplane
Single layer
Arbitrary (complexity limited by number of hidden
units)
Two layer
Arbitrary (complexity limited by number of hidden
units)
Three layer
21
System Dynamics(Learning Mechanism)
  • Supervised learning using back propagation of
    errors. Goal of this process is to reduce the
    total error at output nodes
  • EP å (tPk OPk)2
  • k
  • where
  • EP error to be minimized
  • tPk target value associated with the kth
    input values to the output nodes
  • OPk Output of neural net as calculated from
    the current set of weights.

22
Error Propagation
  • The error is calculated at each node for each
    input set k
  • The error at the output node is equal to
  • diL g (ZiL)tiL YiL
  • where
  • TiL Target value on the i-th output node
    (layer L of network)
  • diL Error to be back propagated from node i
    in layer L
  • g gradient of transfer function.

23
Error Propagation
  • Error is propagated back as follows
  • dil g (Zil) å wijl1 djl1
  • j
  • for l (L1), . . . 1. (Lth layer is output)
  • The weights are then adjusted using an
    optimality rule (in conjunction with a learning
    rate) to minimize overall error EP.

24
So, Whats the Big Deal?
  • With a sigmoidal transfer function and back
    propagation, the neural network can learn to
    represent any sampled function to any required
    degree of accuracy with a sufficient number of
    nodes and hidden layers.
  • This allows us to capture underlying
    relationships without knowing the form of the
    relationship.

25
Some Successful Applications
  • Recognizing handwritten characters (e.g., zip
    codes)
  • Recognizing speech (e.g., Dragons Naturally
    Speaking software)
  • Estimating response to direct mail operations

26
Predictions of Probability of Purchase
  • RFM Model Use computed score as a measure of
    probability of purchase.
  • Regression
  • MNL
  • RFM and Regression models can be implemented in
    Excel.
  • Also, all three scoring procedures for
    probability of
  • purchase can be implemented in Excel.

27
Predictions of Probability of Purchase
  • Neural Net Use the 4Thought software to compute
    choice probability. Note, as in regression,
    these predictions need not necessarily lie
    between 0 and 1. Follow the tutorial closely in
    doing this exercise.

28
Scoring Customers for their Potential
Profitability
  • A B C D Average Cus
    tomer Purchase Purchase ScoreCustomer Probabi
    lity Volume Margin A B C
  • 1 30 31.00 0.70 6.51 2 2 143.00 0.60
    1.72 3 10 54.00 0.67 3.62 4 5 88.00
    0.62 2.73 5 60 20.00 0.58 6.96 6 22 6
    0.00 0.47 6.20 7 11 77.00 0.38 3.22 8 1
    3 39.00 0.66 3.35 9 1 184.00 0.56 1.03
    10 4 72.00 0.65 1.87
  • Average Expected Score per customer 3.72

29
Develop Tables such as the Following (Example
Shown for Mailing to the Top 60
30
Summary of Coefficients
31
Economics of Mailings
Note If we mailed to everyone on the list, we
can expect a response rate of 8.9.
Write a Comment
User Comments (0)
About PowerShow.com