Intel and the ISTC for Big Data (2012-2017): A Powerful Collaboration

By Jeff Parkhurst, Ph.D. and Timothy G. Mattson, Ph.D., Intel  The year 2012 was arguably the year that Big Data went mainstream. Data was being hailed as a new class of economic asset, similar to currency or gold, from the … Continue reading

Posted in Benchmarks, Big Data Architecture, Data Management, Databases and Analytics, ISTC for Big Data Blog, Polystores, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , | Leave a comment

Polystore Databases to be Examined at IEEE, CIDR Conferences

Polystores, a more-modern approach to sharing heterogeneous data that addresses Big Data’s volume, variety and velocity demands, will be the topic of discussion at two upcoming conferences: The first IEEE Workshop on Methods to Manage Heterogeneous Big Data and Polystore Databases, … Continue reading

Posted in Big Data Applications, Big Data Architecture, Data Management, ISTC for Big Data Blog, Polystores, Tools for Big Data | Tagged , , , , , | Leave a comment

Genomics Data, Analytics and the Future of Climate Change

By Vijay Gadepally, MIT CSAIL, in collaboration with the Chisholm Laboratory at MIT Meet Prochlorococcus marinus, a marine cyanobacterium that’s intricately linked to the global carbon cycle, widely present in seawater, and possibly holds secrets to future climate change. These … Continue reading

Posted in Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, Graph Computation, ISTC for Big Data Blog, Polystores, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , | Leave a comment

ModelDB: A System for Managing Machine Learning Models

By Manasi Vartak, Harihar Subramanyam, Wei-En Lee, Srinidhi Viswanathan, Saadiyah Husnoo, Sam Madden and Matei Zaharia, MIT CSAIL Building real-world machine learning (ML) algorithms is an iterative process. A data scientist will build many 10s to 100s of models before arriving … Continue reading

Posted in Big Data Applications, Big Data Architecture, ISTC for Big Data Blog, Tools for Big Data | Tagged , , | 4 Comments

PolyPEG: A Proposal for Polystore Optimization

By Dylan Hutchison, Bill Howe, Dan Suciu, and Zachary Tatlock, University of Washington There has been a “cambrian explosion” of systems and languages for large-scale data analytics:  Postgres and H-Store accept SQL queries; Datomic and Myria accept Datalog; SciDB accepts … Continue reading

Posted in Big Data Applications, Big Data Architecture, ISTC for Big Data Blog, Polystores, Tools for Big Data | Tagged , , , | Leave a comment

ForeCache: Raising the Bar in Big Data Visual Exploration

By Leilani Battle, MIT CSAIL In many discussions with scientists across a variety of specialties, we have found that interactive visualizations are important tools for helping people make sense of massive amounts of data. In particular, interactive visualizations are critical … Continue reading

Posted in Analytics, Big Data Applications, ISTC for Big Data Blog, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , | Leave a comment

Guaranteeing Query Runtimes for Analytics-as-a-Service

By Jennifer Ortiz and Magdalena Balazinska, University of Washington A variety of data analytics systems are available as cloud services today, including Amazon Elastic MapReduce (EMR), Redshift and Azure’s HDInsight. With these services, users have access to compute clusters that come … Continue reading

Posted in Analytics, Data Management, ISTC for Big Data Blog, Tools for Big Data | Tagged , , , , , | Leave a comment

ISTC for Big Data Researchers Present at NEDB Day 2016

ISTC for Big Data principal investigators and researchers presented a broad base of research at North East Database Day 2016, which was sponsored by Microsoft and held at MIT’s Stata Center in Cambridge, Mass., January 28, 2016. The 8th Annual North … Continue reading

Posted in Analytics, Big Data Applications, Big Data Architecture, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , , | Leave a comment

Query Modeling and Optimization in the BigDAWG Polystore System

By Jennie Duggan, Northwestern University At VLDB 2015, the ISTC for Big Data team presented a demo of the BigDAWG polystore system. This blog post highlights some of the research challenges we are exploring as we build this novel system. … Continue reading

Posted in Analytics, Big Data Architecture, Databases and Analytics, Graph Computation, ISTC for Big Data Blog, Polystores, Query Engines, Tools for Big Data | Tagged , , , , , | Leave a comment

Winning at Big Data: What’s Math Got to Do with It? (A Lot)

Big Data describes a new era in the digital age in which the volume, velocity and variety of data created across a wide range of fields – from Internet search and social media to finance and healthcare to defense and … Continue reading

Posted in Databases and Analytics, Graph Computation, ISTC for Big Data Blog, Math and Algorithms, Tools for Big Data | Tagged , , , , | Leave a comment