VOStat HEAD 2004 - PowerPoint PPT Presentation

About This Presentation
Title:

VOStat HEAD 2004

Description:

5% have 'statistics' in their abstract. 20% treat variable objects or multivariate ... Palomar-QUEST synoptic sky survey. 9 mix-and-match colors from 8 filters ... – PowerPoint PPT presentation

Number of Views:45
Avg rating:3.0/5.0
Slides: 20
Provided by: tri5110
Category:

less

Transcript and Presenter's Notes

Title: VOStat HEAD 2004


1
VOStat Arming Astronomers with Advanced
Statistics
  • Caltech A. Mahabal, M. Graham, S.G.Djorgovski,
  • R. Williams
  • Penn State J. Babu (PI), E. Feigelson
  • CMU R. Nichol, D. Van DenBerk, L.Wasserman

2
Use of statistics
  • 15000 astronomical studies per year
  • 5 have statistics in their abstract
  • 20 treat variable objects or multivariate
    datasets

3
Traditional methods
  • Fourier transform (Fourier 1807)
  • Least sq. and chisq (Legendre 1805, Pearson 1901)
  • Kolmogorov-Smirnov test (Kolomogrov 1933)
  • Principal Component Analysis (Hotelling 1936)

4
VOStat
  • Web based service
  • Simple and sophisticated statistical routines
  • Large datasets
  • Public domain (R)/ specially written
  • General purpose and Virtual Observatory

5
VOStat
  • ASCII / VOTABLE as input (can be used as an
    intermediate block for a VO based pipeline)
  • CGI routines as prototypes (few 1000 lines)
  • Webservices (Java GUI) - hundreds of thousands of
    lines (limited by Rs capabilities) -
    distributed, multi-OS, multi-language

6
Examples of available functions
  • Descriptive statistics (e.g. boxplot)
  • Two- and k-sample tests (e.g. Wilcoxon rank-sum
    test)
  • Density estimation (e.g. Kernel smoothing)
  • Correlation and regression (e.g. PCA)
  • Censored data (e.g. Survival)
  • Multivariate classification (e.g. H clustering)
  • External functions (e.g. K-density)

7
User-friendly GUI
  • Columns are autoselected (and can be deselected)
  • Parameter choices for functions are conveniently
    placed
  • Can be used from your own webpages on tables
    residing elsewhere

8
Toy Demos
  • Rediscovering HR diagram
  • Rediscovering FP of Globular Clusters
  • Looking for outliers in color-color space

9
Rediscovering HR diagram
  • Hyades stars (Hipparcus main catalog)
  • Mean/median/boxplot
  • Density estimation (Histogram)
  • Kernel smoothing
  • Correlation matrix
  • X-Y plot
  • Multivariate clustering

10
  • X-Y plot between Vmag and B-V reveals the famous
    structure in the dataset the color-magnitude of
    bright stars showing the main sequence, giant
    branch (with red clump stars), and a few Hyades
    white dwarfs.

11
FP of Globular clusters
  • Matrix of pairwise correlation coefficients
  • Pairwise plots
  • Principal Component Analysis

12
  • Core parameters as a group tend to be highly
    correlated, unlike the half-light parameters.
    This is indicative of the dynamical evolution
    driven by the core collapse.

13
Exploring outliers
  • Palomar-QUEST synoptic sky survey
  • 9 mix-and-match colors from 8 filters
  • Aim finding outliers in color-color space for
    spectroscopic follow-up
  • 1000 random objects

14
Boxplot
  • Reveals relationships between colors
  • (mean, median, overlap, outliers)

15
Clustering
  • K-means provides various cluster centers along
    with withinss and a list of possible outliers

16
(No Transcript)
17
K-density
  • Probability - density association for outliers

18
Visual confirmation(found from 1000 random
objects)
19
Summary
  • Web-based
  • VO compatible
  • Public domain and specialized routines
Write a Comment
User Comments (0)
About PowerShow.com