Title: Metodi Quantitativi per Economia, Finanza e Management Lezione n
1Metodi Quantitativi per Economia, Finanza e
ManagementLezione n9
2Analisi fattoriale
- I problemi di una analisi di questo tipo sono
- a)-quante componenti considerare
- rapporto tra numero di componenti e variabili
- percentuale di varianza spiegata
- le comunalità
- lo scree plot
- interpretabilità delle componenti e loro
rilevanza nella esecuzione dellanalisi
successive - b)-come interpretarle
- correlazioni tra componenti principali e
variabili originarie - rotazione delle componenti
3Analisi Fattoriale
- Qualità degli ingredienti
- Genuinità
- Leggerezza
- Sapore/Gusto
- Caratteristiche Nutrizionali
- Attenzione a Bisogni Specifici
- Lievitazione Naturale
- Produzione Artigianale
- Forma/Stampo
- Richiamo alla Tradizione
- Grandezza della Confezione (Peso Netto)
- Funzionalità della Confezione
- Estetica della Confezione
- Scadenza
- Nome del Biscotto
- Pubblicità e Comunicazione
- Promozione e Offerte Speciali
- Consigli per lUtilizzo
- Prezzo
- Sono stati individuati 20 attributi
caratterizzanti il prodotto-biscotto - È stato chiesto allintervistato di esprimere un
giudizio in merito allimportanza che ogni
attributo esercita nellatto di acquisto -
4Analisi fattoriale
51. The ratio between the number of components and
the variables One out of Three 20 original
variables 6-7 Factors
62. The percentage of the explained variance
Between 60-75
7Factor Analysis
3. The scree plot The point at which the scree
begins
84. Eigenvalue Eigenvaluesgt1
9Factor Analysis
10Analisi Fattoriale
115. Communalities The quote of explained
variability for each input variable must be
satisfactory In the example the overall
explained variability (which represents the mean
value) is 0.61057
12Factor Analysis
- 6. Interpretation Component Matrix (factor
loadings) - The most relevant output of a factorial analysis
is the so called component matrix, which shows
the correlations between the original input
variables and the obtained components (factor
loadings) - Each variable is associated specifically to the
factors (components) with which there is the
highest correlation - The interpretation of the each factor has to be
guided considering the variables with the highest
correlations related to single factor
136. Interpretation Correlation between Input
Vars Factors The new Factors must have a
meaning based on the correlation structure
146. Interpretation The correlation structure
between Input Vars Factors In this case
the correlation structure is well defined and the
interpretation phase is easier
15Factor Analysis
- Issues of the Factor Analysis are the following
- a) How many Factors (or components) need to be
considered - 6. The degree of the interpretation of the
components and how they affect the next analyses - b) How to interpret
- The correlation between the principal components
and the original variables - The rotation of the principal components
16Factor Analysis
- 6. Interpretation The rotation of factors
- There are numerous outputs of factorial analysis
which can be produced through the same input data - These numerous outputs dont provide
interpretation that are remarkably different from
one another, as matter of fact they differ only
slightly and there are areas of ambiguity
17Factor Analysis
x4
x3
Interpretation of the factors
18Factor Analysis
- 6. Interpretation The rotation of factors
- The Varimax method of rotation, suggested by
Kaiser, has the purpose of minimizing the number
of variables with high saturations (correlations)
for each factor - The Quartimax method attempts to minimize the
number of factors tightly correlated to each
variable - The Equimax method is a cross between the
Varimax and the Quartimax - The percentage of the overall variance of the
rotated factors doesnt change, whereas the
percentage of the variance explained by each
factors shifts
19Analisi Fattoriale
Before the rotation step
20Analisi Fattoriale
After the rotation step
215. Communalities The communalities dont change
after the Rotation Step
226. Interpretation The correlation structure
between Input Vars Factors improves after
the rotation step
236. Interpretation The correlation structure
between Input Vars Factors The variable
with the lowest communality is not well explained
by this solution
24Factor Analysis
- Once an adequate solution is found, it is
possible to use the obtained factors as new macro
variables to consider for further analyses on the
phenomenon under investigation, thus replacing
the original variables - Again taking into consideration the example, we
may add six new variables into the data file, as
follows - Health,
- Convenience Practicality,
- Image,
- Handicraft,
- Communication,
- Taste.
- They are standardized variables zero mean and
variance equal to one. - They will be the input for further analyses of
Dependence or/and Interdependence.
25Factor Analysis
268062 Quantitative Methods for Marketing -Final
Project Submission date 5th of June 2009
Sushi Fever Team Bancheva, Kamelia Bettinali,
Francesco Bonchev, Dimitar Pasca,
Ruxandra Petranovic, Marija
Coffee Consumption in Italy
276. Factor Analysis
- We ran a Factor Analysis on two numerical
questions from the survey that we felt might have
correlated variables Q15 (What are you general
coffee preferences?) and Q16 (If you drink your
coffee outside (in a bar/coffee place) which are
the main factors that, in general, influence your
decision on where you drink your coffee?). - We used the Principal Components Method that was
supposed to solve the multicollinearity problem
among our variables and provide us with
summarized number of variables/factors which are
not correlated (standardized by definition, with
mean 0, standard deviation 1) to better explain
and understand the specific situation of coffee
consumption. - This represents a preliminary phase for cluster
analysis and regression analysis.
286.1. Initial Variables used for analysis
On the right, there are our initial 21 variables
(taken from Q15 and Q16) that we selected for
running the factor analysis. Judging by the SPSS
Correlation Matrix (that is not present in the
slide because of its big size please see the
output for the check), we have many variables
which are significantly correlated.
Need for FACTOR REDUCTION! Start real Factor
Analysis!
296.2. Choosing the right number of factors
- 1/3 criteria 21/3 7 factors
- Variance explained (60-75) 7, 8, 9, 10
factors - Scree Plot 6, 8, 10 factors
- Eigenvalues 6, 7, 8 factors
The optimal values seem to be 7 or 8 factors.
306.2. Choosing the right number of factors
continued -
The present Scree Plot represents the number 3
criteria of number of factors selection from the
previous slide.
316.3. Factor Analysis with 8 Factors
- After analyzing the Communalities table, we
identified one variable that is not properly
explained by our 8 selected factors (0.387 is not
satisfying)! This variable is Price which we
consider an important variable in our analysis!
Decreasing the number of factors to 7, will not
improve the explanatory power of the variables
for the price!
We decided to exclude the Price variable from
this factor analysis and consider it as a
separate factor (given its very high importance
from our qualitative point view) in the future
analysis cluster regression analysis.
326.4. Factor Analysis with 20 FactorsAfter
elimination of the Price variable
- 1/3 criteria 20/3 6 factors
- Variance explained (60-75) 7, 8, 9 factors
- Scree Plot 6, 7, 9 factors
- Eigenvalues 6, 7, 8 factors
The optimal choice seems to be 7 factors.
336.4. Factor Analysis with 20 FactorsAfter
elimination of the Price variable-continued-
The present Scree Plot represents the number 3
criteria of number of factors selection from the
previous slide.
346.5. Factor Analysis with 7 Factors
- After analyzing the Communalities table, we that
so far the 7 factors properly explain the initial
variables. All communalities are over 0.400,
which is a good result.
We are ready to take a look at the Rotated
Component Matrix to see if the factors make
sense/can be explained!
356.6. Factors - explained
- The method used for rotation was Varimax.
- After closely analyzing the Rotated Component
Matrix, we tried to give meaning to our 7
factors. - The names of the respective factors are the
following - Socialization factor
- Internet/ Trendiness factor
- Close meeting place factor
- Intellectual/ non-smoking factor
- Familiarity factor
- Variety/To Go factor
- Traditionality Addiction factor
366.6. Factors explained - continued -
1. Socialization Factor Socialize, sit down, being with friends, cozy atmosphere
2. Internet/Trendiness Factor Wi-Fi availability, internet, trendy place
3. Close meeting place Factor Close to home/work/school, ability to meet people, quality of coffee not important
4. Intellectual/Non-smoking Factor Non-smokers, usually snack, love to read
5. Familiarity Factor Go to the same bar, do not like trying new places, concerned about quality of coffee
6. Variety/To-go Factor Variety and coffee to go, non traditional Italian coffee, preference for taking coffee alone
7. Traditionality/Addiction Factor Italian coffee preference, addicts
37Quantitative Methodsclass 22
The consumption of Digital Music and its impact
on the Music Industry
- Simona Bara
- Claudio Boccassini
- Danilo Broseghini
- Himashu Chikara
- Isabella Rossi
- Federico Salvaggio
- Alessandro Tiso
38Factor Analysis
- Home
- Car
- Outside in general
- Office/University
- Shops
- Restaurants
- Bars/discoteque
- Record player
- Cassette player
- CD player
- Digital player
- Car stereo
- House stereo
- Radio
- Mobile phone
- USE record player
- USE cassette player
- USE CD Player
- USE digital player
- We have taken into consideration questions n
4,9,10 and therefore we have 24 variables - We asked interviewees to give a score from 1 to 9
(1 I dont like it 9 I love it) - or to use percentages
- Quest.n.4 score
- Quest.n.9 score
- Quest.n.10
39Factor Analysis
Number of factors 9 Extraction Principal
Component Analysis Max number of interaction
25 Rotation Varimax
First hypothesis
40Factor Analysis
Ratio between component number and variable number ADEQUATE For a set of 17 variables, the ideal number of components is 4-5. In this case for a set of 24 variables, we have considered 9 components
global explained variance OK About 68 - the optimal range is 60 - 70
Communalities ADEQUATE The values vary among 0,456 and 0,917
We found a problem looking at the rotated
component matrix CORRELATION AMONG COMPONENTS
AND ORIGINAL VARIABLES NON OPTIMAL
problematic 9th component
41Factor Analysis
Number of factors 8 Extraction Principal
Component Analysis Max number of interaction
25 Rotation Varimax
Second hypothesis
42Factor Analysis
Ratio between component number and variable number ADEQUATE For a set of 17 variables, the ideal number of components is 4-5. In this case for a set of 24 variables, we have considered 8 components
global explained variance OK About 63 - the optimal range is 60 - 70
Communalities ACCEPTABLE The values vary among 0,431 and 0,870
43Factor Analysis
Scree plot ADEQUATE Quite linear slope
From the 9th component , there is little increase
in significance explained.
44Factor Analysis Interpretation
- Problems with the 9th component its over.
- We choosed Varimax option to minimize the number
of variables that have elevated saturations for
each factor
WE CHOOSE THE SECOND HYPOTHESIS
45(No Transcript)
46Factor Analysis Interpretation
Office/University Shops Restaurants Bars/Discoteque OUTSIDE LISTENING
Record player Use record player Cassette player Use cassette player STEREO
Digital player Use digital player DIGITAL PLAYER
Radio Use radio RADIO
Car Car stereo CD player Use CD player CAR LISTENING
Home House stereo Use house stereo HOUSE LISTENING
Outside in general Use PC PC
Mobile phone Use mobile phone MOBILE PHONE