Modeling Unobserved Sources of Heterogeneity in Animal Abundance Using a Dirichlet Process Prior

In this paper, we let the data speak for itself about the existence of volatility feedback and the often debated risk–return relationship. We do this by modeling the contemporaneous relationship between market excess returns and log-realized variances with a nonparametric, infinitely-ordered, mixture representation of the observables’ joint distribution. Our nonparametric estimator allows for deviation from conditional Gaussianity through non-zero, higher ordered, moments, like asymmetric, fat-tailed behavior, along with smooth, nonlinear, risk–return relationships. We use the parsimonious and relatively uninformative Bayesian Dirichlet process prior to overcoming the problem of having too many unknowns and not enough observations. Applying our Bayesian nonparametric model to more than a century’s worth of monthly US stock market returns and realized variances, we find strong, robust evidence of volatility feedback. Once volatility feedback is accounted for, we find an unambiguous positive, nonlinear, relationship between expected excess returns and expected log-realized variance. In addition to the conditional mean, volatility feedback impacts the entire joint distribution.

Download Full-text

Experience rating with Poisson mixtures

Annals of Actuarial Science ◽

10.1017/s1748499515000019 ◽

2015 ◽

Vol 9 (2) ◽

pp. 304-321 ◽

Cited By ~ 4

Author(s):

Garfield O. Brown ◽

Winston S. Buckley

Keyword(s):

Life Insurance ◽

Dirichlet Process ◽

Posterior Probability ◽

Mixture Distribution ◽

Reversible Jump ◽

Dirichlet Process Prior ◽

Poisson Mixtures ◽

Group Life ◽

Insurance Portfolio ◽

Poisson Mixture Model

AbstractWe propose a Poisson mixture model for count data to determine the number of groups in a Group Life insurance portfolio consisting of claim numbers or deaths. We take a non-parametric Bayesian approach to modelling this mixture distribution using a Dirichlet process prior and use reversible jump Markov chain Monte Carlo to estimate the number of components in the mixture. Unlike Haastrup, we show that the assumption of identical heterogeneity for all groups may not hold as 88% of the posterior probability is assigned to models with two or three components, and 11% to models with four or five components, whereas models with one component are never visited. Our major contribution is showing how to account for both model uncertainty and parameter estimation within a single framework.

Download Full-text

On several estimates to the precision parameter of Dirichlet process prior

Communications in Statistics - Simulation and Computation ◽

10.1080/03610918.2015.1078473 ◽

2016 ◽

Vol 46 (4) ◽

pp. 3187-3200

Author(s):

Xueqin Zhou ◽

Lei Yang ◽

Xianyi Wu

Keyword(s):

Dirichlet Process ◽

Dirichlet Process Prior

Download Full-text

A matrix-variate dirichlet process to model earthquake hypocentre temporal patterns

Statistical Modelling ◽

10.1177/1471082x20939767 ◽

2020 ◽

pp. 1471082X2093976

Author(s):

Meredith A. Ray ◽

Dale Bowman ◽

Ryan Csontos ◽

Roy B. Van Arsdale ◽

Hongmei Zhang

Keyword(s):

Dirichlet Process ◽

Temporal Pattern ◽

Temporal Trends ◽

Temporal Patterns ◽

The United States ◽

Seismic Zone ◽

Clustering Methods ◽

Dirichlet Process Prior ◽

And Performance ◽

Over Time

Earthquakes are one of the deadliest natural disasters. Our study focuses on detecting temporal patterns of earthquakes occurring along intraplate faults in the New Madrid seismic zone (NMSZ) within the middle of the United States from 1996–2016. Based on the magnitude and location of each earthquake, we developed a Bayesian clustering method to group hypocentres such that each group shared the same temporal pattern of occurrence. We constructed a matrix-variate Dirichlet process prior to describe temporal trends in the space and to detect regions showing similar temporal patterns. Simulations were conducted to assess accuracy and performance of the proposed method and to compare to other commonly used clustering methods such as Kmean, Kmedian and partition-around-medoids. We applied the method to NMSZ data to identify clusters of temporal patterns, which represent areas of stress that are potentially migrating over time. This information can then be used to assist in the prediction of future earthquakes.

Download Full-text

Bayesian estimation of semiparametric nonlinear dynamic factor analysis models using the Dirichlet process prior

British Journal of Mathematical and Statistical Psychology ◽

10.1348/000711010x497262 ◽

2011 ◽

Vol 64 (1) ◽

pp. 69-106 ◽

Cited By ~ 28

Author(s):

Sy-Miin Chow ◽

Niansheng Tang ◽

Ying Yuan ◽

Xinyuan Song ◽

Hongtu Zhu

Keyword(s):

Factor Analysis ◽

Bayesian Estimation ◽

Nonlinear Dynamic ◽

Dirichlet Process ◽

Dynamic Factor Analysis ◽

Dynamic Factor ◽

Dirichlet Process Prior ◽

Analysis Models

Download Full-text

Spiked Dirichlet process prior for Bayesian multiple hypothesis testing in random effects models

Bayesian Analysis ◽

10.1214/09-ba426 ◽

2009 ◽

Vol 4 (4) ◽

pp. 707-732 ◽

Cited By ~ 18

Author(s):

Sinae Kim ◽

David B. Dahl ◽

Marina Vannucci

Keyword(s):

Hypothesis Testing ◽

Random Effects ◽

Dirichlet Process ◽

Multiple Hypothesis Testing ◽

Dirichlet Process Prior ◽

Random Effects Models ◽

Multiple Hypothesis

Download Full-text

A Bayesian Nonparametric Model for Inferring Subclonal Populations from Structured DNA Sequencing Data

10.1101/2020.11.10.330183 ◽

2020 ◽

Author(s):

Shai He ◽

Aaron Schein ◽

Vishal Sarsani ◽

Patrick Flaherty

Keyword(s):

Dna Sequencing ◽

Single Cell ◽

Dirichlet Process ◽

Lymphoblastic Leukemia ◽

Nonparametric Model ◽

Dirichlet Process Mixture ◽

Sequencing Data ◽

Hierarchical Dirichlet Process ◽

Dirichlet Process Prior

There are distinguishing features or “hallmarks” of cancer that are found across tumors, individuals, and types of cancer, and these hallmarks can be driven by specific genetic mutations. Yet, within a single tumor there is often extensive genetic heterogeneity as evidenced by single-cell and bulk DNA sequencing data. The goal of this work is to jointly infer the underlying genotypes of tumor subpopulations and the distribution of those subpopulations in individual tumors by integrating single-cell and bulk sequencing data. Understanding the genetic composition of the tumor at the time of treatment is important in the personalized design of targeted therapeutic combinations and monitoring for possible recurrence after treatment.We propose a hierarchical Dirichlet process mixture model that incorporates the correlation structure induced by a structured sampling arrangement and we show that this model improves the quality of inference. We develop a representation of the hierarchical Dirichlet process prior as a Gamma-Poisson hierarchy and we use this representation to derive a fast Gibbs sampling inference algorithm using the augment-and-marginalize method. Experiments with simulation data show that our model outperforms standard numerical and statistical methods for decomposing admixed count data. Analyses of real acute lymphoblastic leukemia cancer sequencing dataset shows that our model improves upon state-of-the-art bioinformatic methods. An interpretation of the results of our model on this real dataset reveals co-mutated loci across samples.

Download Full-text

Bayesian co-estimation of selfing rate and locus-specific mutation rates for a partially selfing population

10.1101/020537 ◽

2015 ◽

Author(s):

Benjamin D Redelings ◽

Seiji Kumagai ◽

Liuyang Wang ◽

Andrey Tatarenkov ◽

Ann K. Sakai ◽

...

Keyword(s):

Mating System ◽

Dirichlet Process ◽

Mutation Rates ◽

Common Mutation ◽

Dirichlet Process Prior ◽

Coalescence Model ◽

Specific Mutation ◽

Sampling Formula ◽

Self Fertilization ◽

Infinite Alleles Model

We present a Bayesian method for characterizing the mating system of populations reproducing through a mixture of self-fertilization and random outcrossing. Our method uses patterns of genetic variation across the genome as a basis for inference about pure hermaphroditism, androdioecy, and gynodioecy. We extend the standard coalescence model to accommodate these mating systems, accounting explicitly for multilocus identity disequilibrium, inbreeding depression, and variation in fertility among mating types. We incorporate the Ewens Sampling Formula (ESF) under the infinite-alleles model of mutation to obtain a novel expression for the likelihood of mating system parameters. Our Markov chain Monte Carlo (MCMC) algorithm assigns locus-specific mutation rates, drawn from a common mutation rate distribution that is itself estimated from the data using a Dirichlet Process Prior model. Among the parameters jointly inferred are the population-wide rate of self-fertilization, locus-specific mutation rates, and the number of generations since the most recent outcrossing event for each sampled individual.

Download Full-text

Optimal selection based on relative ranks of a sequence by ties

Advances in Applied Probability ◽

10.1017/s0001867800022382 ◽

1984 ◽

Vol 16 (01) ◽

pp. 131-146

Author(s):

Gregory Campbell

Keyword(s):

Distribution Function ◽

Dirichlet Process ◽

Random Distribution ◽

Discrete Distribution ◽

Optimal Selection ◽

Dirichlet Process Prior ◽

Limiting Behavior ◽

Relative Rank ◽

Atomic Parameter ◽

Selection Of

The optimal selection of a maximum of a sequence with the possibility of ties is considered. The object is to examine each observation in the sequence of known length n and, based only on the relative rank among predecessors, either to stop and select it as a maximum or to continue without recall. Rules which maximize the probability of correctly selecting a maximum from a sequence with ties are investigated. These include rules which randomly break ties, rules which discard tied observations, and minimax rules based on the atoms of a discrete distribution function. If the sequence is random from F, a random distribution function from a Dirichlet process prior with non-atomic parameter, optimal rules are developed. The limiting behavior of these rules is studied and compared with other rules. The selection of the parameter of the Dirichlet process regulates the ties.

Download Full-text