The AstroStat Slog » Uncertainty http://hea-www.harvard.edu/AstroStat/slog Weaving together Astronomy+Statistics+Computer Science+Engineering+Intrumentation, far beyond the growing borders Fri, 09 Sep 2011 17:05:33 +0000 en-US hourly 1 http://wordpress.org/?v=3.4 Yes, please http://hea-www.harvard.edu/AstroStat/slog/2010/yes-please/ http://hea-www.harvard.edu/AstroStat/slog/2010/yes-please/#comments Tue, 21 Dec 2010 18:36:49 +0000 vlk http://hea-www.harvard.edu/AstroStat/slog/?p=4270 Andrew Gelman says,

Instead of “confidence interval,” let’s say “uncertainty interval”

]]>
http://hea-www.harvard.edu/AstroStat/slog/2010/yes-please/feed/ 1
[ArXiv] 1st week, Nov. 2007 http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-1st-week-nov-2007/ http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-1st-week-nov-2007/#comments Fri, 02 Nov 2007 21:59:08 +0000 hlee http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-1st-week-nov-2007/ To be exact, the title of this posting should contain 5th week, Oct, which seems to be the week of EGRET. In addition to astro-ph papers, although they are not directly related to astrostatistics, I include a few statistics papers which may be profitable for astronomical data analysis.

  • [astro-ph:0710.4966]
    Uncertainties of the antiproton flux from Dark Matter annihilation in comparison to the EGRET excess of diffuse gamma rays by Iris Gebauer
  • [astro-ph:0710.5106]
    The dark connection between the Canis Major dwarf, the Monoceros ring, the gas flaring, the rotation curve and the EGRET excess of diffuse Galactic Gamma Rays by W. de Boer et.al.
  • [astro-ph:0710.5119]
    Determination of the Dark Matter profile from the EGRET excess of diffuse Galactic gamma radiation by Markus Weber
  • [astro-ph:0710.5171]
    Systematic Bias in Cosmic Shear: Beyond the Fisher Matrix by A.Amara and A. Refregier
  • [astro-ph:0710.5560]
    Principal Component Analysis of the Time- and Position-Dependent Point Spread Function of the Advanced Camera for Surveys by M.J. Jee et.al.
  • [astro-ph:0710.5637]
    A method of open cluster membership determination by G. Javakhishvili et.al.
  • [stat.CO:0710.5670]
    An Elegant Method for Generating Multivariate Poisson Data by I. Yahav and G.Shmueli
  • [astro-ph:0710.5788]
    Variations in Stellar Clustering with Environment: Dispersed Star Formation and the Origin of Faint Fuzzies by B. G. Elmegreen
  • [math.ST:0710.5749]
    On the Laplace transform of some quadratic forms and the exact distribution of the sample variance from a gamma or uniform parent distribution by T.Royen
  • [math.ST:0710.5797]
    The Distribution of Maxima of Approximately Gaussian Random Fields by Y. Nardi, D.Siegmund and B.Yakir
  • [astro-ph:0711.0177]
    Maximum Likelihood Method for Cross Correlations with Astrophysical Sources by R.Jansson and G. R. Farrar
  • [stat.ME:0711.0198]
    A Geometric Approach to Confidence Sets for Ratios: Fieller’s Theorem, Generalizations, and Bootstrap by U. von Luxburg and V. H. Franz
]]>
http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-1st-week-nov-2007/feed/ 1
[ArXiv] An unbiased estimator, May 29, 2007 http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-unbiased-estimator/ http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-unbiased-estimator/#comments Tue, 30 Oct 2007 07:37:07 +0000 hlee http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-unbiased-estimator/ From arxiv/astro-ph:0705.4199v1
In search of an unbiased temperature estimator for statistically poor X-ray spectra
A. Leccardi and S. Molendi

There was a delay of writing about this paper, which by accident was lying under the pile of papers irrelevant to astrostatistics. (It has been quite overwhelming to track papers with various statistical applications and papers with rooms left for statistical improvements from arxiv:astro-ph). Although there is a posting about this paper (see Vinay’s posting), I’d like to give a shot. I was very excited because I haven’t seen any astronomical papers discussing unbiased estimators solely.

By the same token that the authors discussed bias in the χ^2 method and the maximum likelihood estimator, we know that the χ^2 function is not always symmetric for applying Δχ^2 =1 for a 68% confidence interval. The nominal level interval from the Δχ^2 method does not always provide the nominal coverage when the given model to be fitted does not satisfy the (regularity) conditions for approximating χ^2 distribution. The χ^2 best fit does not always observe the (in probability or almost sure) convergence to the true parameter, i.e. biased so that the coverage level misleads the information of the true parameter. The illustration of the existence of bias in traditional estimators in high energy astronomy is followed by authors’ proposals of unbiased (temperature) estimators via (variable) transformation.

Transformation is one way of reducing bias (e.g. Box-Cox transformation or power transformation is a common practice in introductory statistics to make residuals more homogeneous). Transformation leads an asymmetric distribution to (asymptotically) symmetric. Different from the author’s comment (the parametric bootstrap reached no improvement in bias reduction), reducing bias from computing likelihoods (Cash statistics) can be achieved by statistical subsampling methods, like cross-validation, jackknife, and bootstrap upon careful designs of subsampling schemes (instead of parametric bootstrap, nonparametric bootstrap could yield a different conclusion). Penalized likelihood, instead of L_2 norm (the χ^2 measure is L_2), L_1 norm penalty helps to reduce bias as well.

One of the useful discussions about unbiased estimators is the comparison between the χ^2 best fit method and Cash statistics (Maximum Poisson Likelihood Estimator). Overall, Cash statistics excels the χ^2 best fit method. Neither of these two methods overcome bias from low counts, small exposure time, background level, and asymmetry pdf (probability density function) in T(temperature), their parameter of interest. Their last passage to obtain an unbiased estimator was taking a nonparametric approach to construct a mixture model from three pdf’s to estimate the uncertainty. They concluded the results from the mixing distributions were excellent. This mixing distributions takes an effect of reducing errors by averaging. Personally, their saying “the only method that returns the expected temperature under very different situations” seems to be overstated. Either designing more efficient mixing distributions (unequal weighting triplets than equal weights) or defining M-estimators upon understanding three EPIC instruments would produce better degrees of unbiasedness.

Note that the maximum likelihood estimator (MLE) is a consistent estimator (asymptotically unbiased) under milder regularity conditions in contrast to the χ^2 best fit estimator. Instead of stating that MLE can be biased, it would have been better to discuss the suitability of regularity conditions to source models built on Poisson photon counts for estimating temperatures and XSPEC estimation procedures.

Last, I’d like to quote their question as it is:

What are the effects of pure statistical uncertainties in determining interesting parameters of highly non linear models (e.g. the temperature of th ICM), when we analyze spectra accumulated from low surface brightness regions using current X-ray experiments?

Although the authors tried to answer this question, my personal opinion is that they were not able to fully account the answer but left a spacious room for estimating statistical uncertainty and bias rigorously in high energy astrophysics with more statistical care (e.g. instead of MLE or Cash statistics, we could develop more robust but unbiased M-estimator).

]]>
http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-unbiased-estimator/feed/ 0
[ArXiv] Geneva-Copenhagen Survey, July 13, 2007 http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-geneva-copenhagen-survey/ http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-geneva-copenhagen-survey/#comments Sun, 05 Aug 2007 05:25:45 +0000 hlee http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-geneva-copenhagen-survey-july-13-2007/ From arxiv/astro-ph:0707.1891v1
The Geneva-Copenhagen Survey of the Solar neighborhood II. New uvby calibrations and rediscussion of stellar ages, the G dwarf problem, age-metalicity diagram, and heating mechanisms of the disk by Holmberg, Nordstrom, and Andersen

Researchers, including scientists from CHASC, working on color magnitude diagrams to infer ages, metalicities, temperatures, and other physical quantities of stars and stellar clusters may find this paper useful.

Methodologies for temperature calibration (fairly accurate estimation from V-K and a new calibration from b-y), metallicity calibration, absolute magnitude/distance calibration, interstellar reddening, and stellar ages were presented with reviews on stellar models and their parameters, astrophysical calibration errors, metalicity distribution function, age-metallicity diagram, age-velocity relation, and thin disk vs thick disk. It seems like that the previous methodologies for F and G stars need to be revised.

Despite my incapability of full understanding the theory of star formation history and the uncertainties of calibrations (looks like that all go toward regression problems to me), this paper fully manifests the complexity of the stellar models and their calibration process. From a statistical perspective, the complexity of the stellar models and calibrations comes from many predictors and only a few response variables with uncertainties (even more they are heteroskedastic). Furthermore, the relationship between predictors and response variables is sparsely known, which makes fitting the model to a star or a stellar cluster or inferencing physical information from them difficult. The mapping is considered to be highly structured black box and its required careful investigations.

I’d rather end this very technical preprint by citing a sentence:

The question of interest is therefore how well these relations and their intrinsic scatter can be determined from the observations

[hlee: Instead of determining, modeling seems to reflect the flexibilities and uncertainties.]

]]>
http://hea-www.harvard.edu/AstroStat/slog/2007/arxiv-geneva-copenhagen-survey/feed/ 0