The Big Data ISTC: A Retrospection by Michael Stonebraker, Samuel Madden and Timothy Mattson

The Big Data ISTC is a research project sponsored by Intel that ran for five years (August 2012- August 2017).  This blog post highlights some of the accomplishments and lessons learned during this period. Big data is usually categorized into … Continue reading

Posted in Analytics, Benchmarks, Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Storage, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , , , , , , , , , , , , , , , | Leave a comment

ISTC Researchers to Present Papers at VLDB 2017 Conference

ISTC for Big Data principal Investigators, researchers, and their students will present a number of papers at the 2017 International Conference on Very Large Databases (VLDB 2017) in Munich, Germany, August 28-September 1, 2017.  (To read any of the papers, … Continue reading

Posted in Analytics, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Visualizing Big Data | Tagged , , , , , , , , , , , , , | Leave a comment

Intel and the ISTC for Big Data (2012-2017): A Powerful Collaboration

By Jeff Parkhurst, Ph.D. and Timothy G. Mattson, Ph.D., Intel  The year 2012 was arguably the year that Big Data went mainstream. Data was being hailed as a new class of economic asset, similar to currency or gold, from the … Continue reading

Posted in Benchmarks, Big Data Architecture, Data Management, Databases and Analytics, ISTC for Big Data Blog, Polystores, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , | Leave a comment

ISTC Releases Open Source Code for BigDAWG Polystore System

By Dr. Tim Mattson, Intel and Dr. Vijay Gadepally and Kyle O’Brien, MIT Lincoln Laboratory Today, the ISTC for Big Data released the first version of BigDAWG, our polystore system for simplifying integration and analytics of disparate data at scale. BigDAWG is … Continue reading

Posted in Analytics, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Storage | Tagged , , , , , , , , | Leave a comment

ISTC Researchers Present Work at NEDB Day 2017

ISTC for Big Data principal investigators, researchers and their students presented work at North East Database Day 2017, held at MIT’s Stata Center in Cambridge, Mass., on January 27, 2017. Microsoft and Facebook sponsored the event. The 9th Annual North East … Continue reading

Posted in Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Storage, Streaming Big Data, Visualizing Big Data | Tagged , , , , , , , , , , | Leave a comment

Interactive Search and Exploration over Large Multidimensional Data

by Alexander Kalinin, Ugur Cetintemel and Stan Zdonik of Brown University In the Big Data era, professionals across scientific areas need efficient, interactive ad-hoc data analysis. Ideally, they need generic and reusable systems tools for interactive search, exploration and mining … Continue reading

Posted in Analytics, Big Data Applications, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog | Tagged , , , , , | Leave a comment

Analytic Monitoring for the Internet of Things

By Peter Bailis, Stanford Infolab, and Sam Madden, MIT CSAIL An increasing proportion of data today is generated by automated processes, sensors and devices—collectively called the Internet of Things (IoT).   Inexpensive hardware, widespread access to communication networks, and decreased … Continue reading

Posted in Analytics, Big Data Applications, Big Data Architecture, Databases and Analytics, ISTC for Big Data Blog, Streaming Big Data | Tagged , , , | Leave a comment

Genomics Data, Analytics and the Future of Climate Change

By Vijay Gadepally, MIT CSAIL, in collaboration with the Chisholm Laboratory at MIT Meet Prochlorococcus marinus, a marine cyanobacterium that’s intricately linked to the global carbon cycle, widely present in seawater, and possibly holds secrets to future climate change. These … Continue reading

Posted in Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, Graph Computation, ISTC for Big Data Blog, Polystores, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , | Leave a comment

Decibel: Dataset Branching for Collaborative Data Management

by Michael Maddox, MIT CSAIL, and Aaron J. Elmore, University of Chicago* In Big Data’s wake has come demand for tools to curate, manage and analyze shared datasets collaboratively. For instance, consider researchers in a social media company concurrently working … Continue reading

Posted in Benchmarks, Data Management, Databases and Analytics, ISTC for Big Data Blog | Tagged , , , , , , , | Leave a comment

Just-in-time Data Transformation and Migration in Polystores

by Aaron Elmore and Adam Dziedzic, University of Chicago Organizations face managing and deriving value from an ever-growing amount of data. Beyond its size, this data is often varied in both structure (e.g., relational data, linked data, numerical data and streaming data) … Continue reading

Posted in Analytics, Big Data Architecture, Data Management, Databases and Analytics, DBMS, Graph Computation, ISTC for Big Data Blog, Polystores, Query Engines, Streaming Big Data | Tagged , , , , , , , , , | Leave a comment