The Stanford Data Science Initiative (SDSI) is a university-wide organization focused on core data technologies with strong ties to application areas across campus. SDSI comprises methods research, infrastructure, and education.
Recently there has been a paradigm shift in the way data is used. Today researchers are mining data for patterns and trends that lead to new hypotheses. This shift is caused by the huge volumes of data available from web query logs, social media posts and blogs, satellites, sensors, and medical devices. See our new Data Science Research at Stanford 2017-18 for more information.
Data-centered research faces many challenges. Current data management and analysis techniques do not scale to the huge volumes of data that we expect in the future. New analysis techniques that use machine learning and data mining require careful tuning and expert direction. In order to be effective, data analysis must be combined with knowledge from domain experts. Future breakthroughs will often require intimate and combined knowledge of algorithms, data management, the domain data, and the intended applications.
SDSI consists of data science research, shared data and computing infrastructure, shared tools and techniques, industrial links, and education. SDSI has strong ties to groups across Stanford University such as medicine, computational social science, biology, energy, and theory.
The Working Group is responsible for establishing the Stanford Data Science Initiative. The members are coordinating with data science researchers across the university, meeting with prospective corporate members, and defining the research agenda and structure of this initiative.
SDSI is proud to be a supporter of Women in Data Science (WiDS).
Learn more in coverage from Forbes: 100,000 People Will Attend Global Women in Data Science Conference.