
news
recent and upcoming talks
 TBD. Yale University (Statistics), December 2, 2013, New Haven, CT.
 Design and analysis of experiments with interfering units. Simons Institute for the Theory of Computing, November 1821, 2013, UC Berkeley, CA.
 Inference from nonignorable sampling designs. DIMACS, November 78, 2013, Piscataway, NJ.
 Inference and algorithms for highthroughput biology. Italian Embassy, October 29, 2013, Washington, DC.
 Estimating causal effects with interfering units. SAMSI, October 2123, 2013, RTP, NC.
preprints (available upon request)
 The geometry of 2x2 contingency tables.
(java app)
(source code)
 Estimating a structured covariance matrix from multilab measurements in highthroughput biology.
(IBM best student paper award, NESS 2013)
 Poisson convolution on a tree of categories for modeling topical content.
(pdf)
(a shorter version appeared at ICML 2012)
 Generalized species sampling priors with latent beta reinforcements.
(pdf)
selected publications
(see my CV
or Google Scholar
for more publications and bibliographic details)
theory and methods for network data analysis
 Stochastic blockmodels with growing number of classes.
Biometrika, 2012.
(pdf)
 Confidence sets for network structure.
Statistical Analysis and Data Mining, 2011.
(pdf)
(a shorter version appeared at NIPS 2011)
 Graphlets decomposition of a weighted network.
Journal of Machine Learning Research, W&CP, 2011.
(pdf)
(MSR best student paper award, NESS 2012)
 Network sampling and classification: An investigation of network model representations.
Decision Support Systems, 2011.
(pdf)
 A survey of statistical network models.
Foundations and Trends in Machine Learning, 2010.
(pdf)
 Mixedmembership stochastic blockmodels.
Journal of Machine Learning Research, 2008.
(pdf)
(r code)
(fast code)
(John Van Ryzin award, 2006)
geometry and inference in illposed inverse problems
 Estimating latent processes on a network from indirect measurements.
Journal of the American Statistical Association, 2013.
(pdf)
(supp)
(r code)
(IBM best student paper award, NESS 2011)
 Polytope samplers for inference in illposed inverse problems.
Journal of Machine Learning Research, W&CP, 2011.
(pdf)
 Tree preserving embedding
Proceedings of the National Academy of Sciences, 2011.
(pdf)
(r code)
(a shorter version appeared at ICML 2011)
modeling and inference in highthroughput biology
 Multiway blockmodels for analyzing coordinated highdimensional responses.
Annals of Applied Statistics, in press.
(pdf)
(supp)
 Analysis and design of RNA sequencing experiments for identifying mRNA isoform regulation.
Nature Methods, 2010.
(pdf)
(supp)
(code)
 Ranking relations using analogies in biological and information networks.
Annals of Applied Statistics, 2010.
(pdf)
(code)
 Predicting cellular growth from gene expression signatures.
PLoS Computational Biology, 2009.
(pdf)
(code & data)
(a shorter version appeared at NIPS 2008)
 Getting started in probabilistic graphical models.
PLoS Computational Biology, 2007.
(pdf)
applications in molecular biology
 Quantifying conditiondependent intracellular protein levels enables highprecision fitness estimates.
PLoS One, 2013.
(pdf)
 A conserved cell growth cycle can account for the environmental stress responses of divergent eukaryotes.
Molecular Biology of the Cell, 2012.
(pdf)
 Systemslevel dynamic analyses of fate change in murine embryonic stem cells.
Nature, 2009.
(pdf)
(supp)
(F1000)
(news & views, Nat BT)
(editor's choice, Sci Sig)
 Coordination of growth rate, cell cycle, stress response and metabolic activity in yeast.
Molecular Biology of the Cell, 2008.
(pdf)
(code & data)
applications in computational social science
 Discussion of Hennig and Liao 'How to find an appropriate clustering for mixedtype variables with application to socioeconomic stratification'.
Journal of the Royal Statistical Society, Series C, 2013.
(pdf)
(article)
 Reconceptualizing the classification of PNAS articles.
Proceedings of the National Academy of Sciences, 2010.
(pdf)
(editorial feature)
 Whose ideas? Whose words? Authorship of the Ronald Reagan radio addresses.
Political Science & Politics, 2007.
(pdf)
(oped by Skinner & Rice)
 Who wrote Ronald Reagan's radio addresses?
Bayesian Analysis, 2006.
(pdf)
(tr with detailed predictions)
(notes on Negative Binomial)
theses
 Bayesian mixedmembership models of complex and evolving networks.
Doctoral dissertation, 2007.
(Savage award honorable mention, 2007)
 The theory of weak convergence of probability measures and its applications in statistics.
Undergraduate thesis, 1999.
(Gold medal for best graduates, 1999)
