Title: AntiLearning
1Anti-Learning
- Adam Kowalczyk
- Statistical Machine Learning
- NICTA, Canberra
- (Adam.Kowalczyk_at_nicta.com.au)
National ICT Australia Limited is funded and
supported by
1
2Overview
- Anti-learning
- Elevated XOR
- Natural data
- Predicting Chemo-Radio-Therapy (CRT) response for
Oesophageal Cancer - Classifying Aryl Hydrocarbon Receptor genes
- Synthetic data
- High dimensional mimicry
- Conclusions
- Appendix A Theory of Anti-learning
- Perfect anti-learning
- Class-symmetric kernels
3Definition of anti-learning
Systematically
Random guessing accuracy
Off-training accuracy
Training accuracy
4Anti-learning in Low Dimensions
-1
1
1
-1
5Anti-Learning
Learning
6Evaluation Measure
- Area under Receiver Operating Characteristic
(AROC)
1
f
0.5
True Positive
AROC( f )
0
0
0.5
1
False Positive
7Learning and anti-learning mode of supervised
classification
8Anti-learning in Cancer Genomics
9From Oesophageal Cancer to machine learning
challenge
10Learning and anti-learning mode of supervised
classification
Test
Training
1
AROC
Learning
TP
1
AROC
0
0
1
TP
FN
1
0
0
1
FN
TP
Anti-learning
AROC
0
0
1
FN
11Anti-learning in Classification of Genes in Yeast
12KDD02 task identification of Aryl Hydrocarbon
Receptor genes (AHR data)
13Anti-learning in AHR-data set from KDD Cup 2002
Average of 100 trials random splits training
test 66 34
14KDD Cup 2002 Yeast Gene Regulation Prediction
Taskhttp//www.biostat.wisc.edu/craven/kddcup/ta
sk2.ppt
15Anti-learning in High Dimensional Approximation
(Mimicry)
16Paradox of High Dimensional Mimicry
- If detection is based of large number of
features, - the imposters are samples from a distribution
with the marginals perfectly matching
distribution of individual
features for a finite genuine sample, then - imposters are be perfectly detectable by
ML-filters in the anti-learning mode
17Mimicry in High Dimensional Spaces
18Quality of mimicry
d 1000
nE / nX
Average of independent test for of 50 repeats
19Formal result
20Proof idea 1Geometry of the mimicry data
Key Lemma
21Proof idea 1 Geometry of the mimicry data
22Proof idea 2
23Proof idea 2
24Proof idea 2
25 Proof idea 3kernel matrix
26 Proof idea 4
27Theory of anti-learning
28Hadamard Matrix
29CS-kernels
30Perfect learning/anti-learning for CS-kernels
Kowalczyk Chapelle, ALT 05
31Perfect learning/anti-learning for CS-kernels
Kowalczyk Chapelle, ALT 05
32Perfect learning/anti-learning for CS-kernels
33Perfect learning/anti-learning for CS-kernels
34Perfect anti-learning theorem
Kowalczyk Smola, Conditions for Anti-Learning
35Anti-learning in classification of Hadamard
dataset
Kowalczyk Smola, Conditions for Anti-Learning
36AHR data set from KDD Cup02
Kowalczyk Smola, Conditions for Anti-Learning
Kowalczyk, Smola, submitted
37From Anti-learning to learning Class Symmetric
CS kernel case
Kowalczyk Chapelle, ALT 05
38Perfect anti-learning i.i.d. a learning curve
n 100, nRand 1000
random
AROC mean std
2
1
4
5
0
3
nsamples i.i.d. samples from the perfect
anti-learning-set S
39Conclusions
- Statistics and machine learning are indispensable
components of forthcoming revolution in medical
diagnostics based on genomic profiling - High dimensionality of the data poses new
challenges pushing statistical techniques into
uncharted waters - Challenges of biological data can stimulate novel
directions of machine learning research
40Acknowledgements
- Telstra
- Bhavani Raskutti
- Peter MacCallum Cancer Centre
- David Bowtell
- Coung Duong
- Wayne Phillips
- MPI
- Cheng Soon Ong
- Olivier Chapelle
- NICTA
- Alex Smola