[vimeo 109851729 w=720 h=404]
watch time: 12 minutes
Besides being a ‘legal’ mechanism for universities to get software out into the world, open source provided a multiplier effect for Apache Spark, which came out of the University of California, Berkeley AMPLab that Michael Franklin directs. That multiplier effect allowed 175 people writing code (instead of the 30 people in the lab who were working part-time on it) — and it allowed their work to further impact the world of big data and data analytics.
In this video, Franklin shares the story of Spark; the Hadoop (which provides storage and scaleable processing with MapReduce) ecosystem; and the “badass” — as in the Berkeley Data Analytics Stack — whose goal it is to build a new analytics infrastructure spanning the Algorithms (machine learning), Machines (cluster computing), and People (crowdsourcing computation and analysis) approaches that make up the ‘AMP’ Lab.
Michael Franklin