Tag Archives: Machine Learning

The Big Data ISTC: A Retrospection by Michael Stonebraker, Samuel Madden and Timothy Mattson

The Big Data ISTC is a research project sponsored by Intel that ran for five years (August 2012- August 2017).  This blog post highlights some of the accomplishments and lessons learned during this period. Big data is usually categorized into … Continue reading

Posted in Analytics, Benchmarks, Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Storage, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , , , , , , , , , , , , , , , | Leave a comment

VisualCloud: A DBMS for Virtual Reality

By Brandon Haynes, Artem Minyaylov, Magdalena Balazinska, Luis Ceze and Alvin Cheung of the University of Washington Our ability to collect videos en masse can revolutionize how we interact with the world by enabling powerful virtual reality (VR) video applications … Continue reading

Posted in Big Data Applications, Data Management, DBMS, ISTC for Big Data Blog, Streaming Big Data | Tagged , , , , , , , , , , | Leave a comment

Improving Clinical Decision-Making with Big Data

An Interview with Peter Szolovits, MIT CSAIL Doctors, nurses and other healthcare professionals have always had to “read” and respond quickly to often-imperfect data under stressful circumstances. What has changed over time is the volume and types of data that … Continue reading

Posted in Big Data Applications, ISTC for Big Data Blog | Tagged , , , , , , | Leave a comment

ModelDB: A System for Managing Machine Learning Models

By Manasi Vartak, Harihar Subramanyam, Wei-En Lee, Srinidhi Viswanathan, Saadiyah Husnoo, Sam Madden and Matei Zaharia, MIT CSAIL Building real-world machine learning (ML) algorithms is an iterative process. A data scientist will build many 10s to 100s of models before arriving … Continue reading

Posted in Big Data Applications, Big Data Architecture, ISTC for Big Data Blog, Tools for Big Data | Tagged , , | 4 Comments

Guaranteeing Query Runtimes for Analytics-as-a-Service

By Jennifer Ortiz and Magdalena Balazinska, University of Washington A variety of data analytics systems are available as cloud services today, including Amazon Elastic MapReduce (EMR), Redshift and Azure’s HDInsight. With these services, users have access to compute clusters that come … Continue reading

Posted in Analytics, Data Management, ISTC for Big Data Blog, Tools for Big Data | Tagged , , , , , | Leave a comment

Spreadsheets and Big Data

Five Questions with Database Expert Mike Cafarella Database expert Michael Cafarella, professor at the University of Michigan, keynoted February 1 at the annual New England Database Summit, hosted at MIT CSAIL in Cambridge, Massachusetts. Professor Cafarella’s presentation mesmerized the audience with the reality that the lowly spreadsheet is, … Continue reading

Posted in Analytics, Big Data Architecture, Databases and Analytics, DBMS, ISTC for Big Data Blog | Tagged , , , , | Leave a comment

Crowdsourcing Big Data

By Barzan Mozafari, Ph.D., MIT CSAIL Crowdsourcing has become a popular means of performing tasks that are difficult for computers, including entity resolution, audio transcription, image annotation, sentiment analysis, and document summarization and editing. Although humans are often more accurate … Continue reading

Posted in Big Data Architecture, Databases and Analytics, ISTC for Big Data Blog, Math and Algorithms, Tools for Big Data | Tagged , , , | Leave a comment