Causal Data Mining - PowerPoint PPT Presentation

About This Presentation
Title:

Causal Data Mining

Description:

Causal Data Mining. Richard Scheines. Dept. of Philosophy, Machine Learning, ... 1. Predictive Data Mining. Finding predictive relationships in data ... – PowerPoint PPT presentation

Number of Views:54
Avg rating:3.0/5.0
Slides: 15
Provided by: joseph158
Category:
Tags: causal | data | mining

less

Transcript and Presenter's Notes

Title: Causal Data Mining


1
Causal Data Mining
Richard Scheines Dept. of Philosophy, Machine
Learning, Human-Computer Interaction
Carnegie Mellon
2
1. Predictive Data Mining
  • Finding predictive relationships in data
  • What feature of student behavior predicts
    learning
  • Who will default on credit cards
  • Who will get an A in your course
  • Which HS students will do well at CMU
  • Do students cluster by learning style

3
Causal Data Mining
  • Finding causal relationships in data
  • What feature of student behavior causes learning
  • What will happen when we make everyone take a
    reading quiz before each class
  • What will happen when we program our tutor to
    intervene to give hints after an error

4
Predictive Data Mining
X1 X2 X3 . . Xk Y
1 1.7 28 M . . 2.4 1
2 2.0 11 F . . 1.1 0
3 1.9 17 F . . 1.1 1
. . . . . . . .
. . . . . . . .
N 2.8 12 M . . 1.8 0
Data Mining Search
Predictive Model Y f(X1, X2, Xk)
5
Predictive Data Mining
  • Model Classes
  • Simple Regression
  • Locally Weighted Regression
  • Logistic Regression
  • Neural Nets
  • Vector Support Machines
  • Decision Trees
  • Bayes Net
  • Naïve Bayes Classifier
  • Independent Components
  • Clustering
  • Etc.

Data Mining Search
Predictive Model Y f(X1, X2, Xk)
6
Predictive Data Mining
Data Mining Search
Predictive Model under Constraints Y f(X1, X2,
Xk), e.g., f ? Additive functions
7
Predictive Data Mining
Data Mining Search
Predictive Model under Constraints Y f(X1, X2,
Xk), Or Probability Model under
Constraints P(Y X1, X2, , Xk), where P ?
Gaussian, with mean 0
8
Predictive Data Mining
Decision Tree Search
9
Predictive Data Mining ?Causal Data Mining
Conditioning is not the same as intervening
  • P(Y X1, X2, , Xk)
  • ?
  • P(Y X1set, X2, , Xk)

Teeth Slides
10
Causal DiscoveryStatistical Data ? Causal
Structure
11
Causal Discovery Software TETRAD IV
www.phil.cmu.edu/projects/tetrad
12
Full Semester Online Course in Causal
Statistical Reasoning
13
Full Semester Online Course in Causal
Statistical Reasoning
  • Course is tooled to record certain events
  • Logins, page requests, print requests, quiz
    attempts, quiz scores, voluntary exercises
    attempted, etc.
  • Each event was associated with attributes
  • Time
  • student-id
  • Session-id

14
Printing and Voluntary Comprehension Checks 2002
--gt 2003
Write a Comment
User Comments (0)
About PowerShow.com