Skip to content Skip to navigation



The Stanford Data Science Initiative (SDSI) is a university-wide organization focused on core data technologies with strong ties to application areas across campus.  SDSI comprises methods research, infrastructure, and education.

Recently there has been a paradigm shift in the way data is used.  Today researchers are mining data for patterns and trends that lead to new hypotheses.  This shift is caused by the huge volumes of data available from web query logs, social media posts and blogs, satellites, sensors, and medical devices. See our new Data Science Research at Stanford 2017-18 for more information.

Data-centered research faces many challenges.  Current data management and analysis techniques do not scale to the huge volumes of data that we expect in the future.  New analysis techniques that use machine learning and data mining require careful tuning and expert direction.  In order to be effective, data analysis must be combined with knowledge from domain experts.  Future breakthroughs will often require intimate and combined knowledge of algorithms, data management, the domain data, and the intended applications.

SDSI consists of data science research, shared data and computing infrastructure, shared tools and techniques, industrial links, and education.  SDSI has strong ties to groups across Stanford University such as medicine, computational social science, biology, energy, and theory.

Contact Steve Eglash, Executive Director, for further information.


Working Group

The Working Group is responsible for establishing the Stanford Data Science Initiative.  The members are coordinating with data science researchers across the university, meeting with prospective corporate members, and defining the research agenda and structure of this initiative.

Hector Garcia-Molina, Director, and Professor, Electrical Engineering and Computer Science

Steve Eglash, Executive Director

Russ Altman, Professor, Bioengineering, Genetics, and Medicine

Euan Ashley, Associate Professor, Medicine and Genetics

Carlos Bustamante, Professor, Genetics

Margot Gerritsen, Associate Professor, Energy Resources Engineering, and Director Institute for Computational and Mathematical Engineering

Ashish Goel, Professor, Management Science and Engineering

Trevor Hastie, Professor, Statistics and Health Research and Policy

Jure Leskovec, Assistant Professor, Computer Science

Lester Mackey, Assistant Professor, Statistics

Dan McFarland, Associate Professor, Education and Sociology, and Director Center for Computational Social Science

Balasubramanian Narasimhan, Senior Research Scientist, Statistics and Health Research and Policy

Kunle Olukotun, Professor, Electrical Engineering and Computer Science, and Director Pervasive Parallelism Laboratory

Vijay Pande, Professor, Chemistry

Balaji Prabhakar, Professor, Electrical Engineering and Computer Science, and Director Stanford Center for Societal Networks

Chris Re, Assistant Professor, Computer Science


SDSI is proud to be a supporter of Women in Data Science (WiDS).
Learn more in coverage from Forbes: 100,000 People Will Attend Global Women in Data Science Conference.