Workshop in Biostatistics

M112 Alway Building, Medical Center

(next to the Dean's courtyard)

DATE: February 9, 2017
TIME: 1:30 - 2:50 pm
TITLE: It's the Data, Stupid
SPEAKER: Mark Cullen
Professor of Medicine (General Internal Medicine), of Biomedical Data Science,
of Health Research and Policy (Epidemiology)
and Senior Fellow at the Stanford Institute for Economic Policy Research


Much of our attention as data scientists focus on issues of data quality, selection of available covariates for analysis, finding best fitting models for the data, and developing strategies for causal inference where possible. In this talk I focus on the potential for development of datasets themselves. Using the example of the Alcoa dataset on which I have worked for two decades, I demonstrate that what may appear at first blush as a limited data set could, with some foresight, be parlayed into a trove of linked data, opening up myriad research opportunities that may have been entirely obscure at first blush.

Suggested readings:

