Session 1 - Introduction to Big Data, methodology, and ecosystems

What is Big Data

Industry and research stretched the term as storage, computing, and pipelines changed.

"Data storage is growing at a higher rate than ever before, and coupled with rapidly increasing demand for instant access, will cause great stress on both the physical and the human infrastructure of computing."

(Mashey, 1999)

"Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze."

(Manyika et al., 2011)

"Big Data is a cultural, technological, and scholarly phenomenon that rests on the interplay of technology, analysis, and mythology."

(boyd & Crawford, 2012)

"Big Data consists of extensive datasets that require a scalable architecture for efficient storage, manipulation, and analysis because of data volume, variety, velocity, and/or variability."

(NIST, 2015)