The Big Data ISTC: A Retrospection by Michael Stonebraker, Samuel Madden and Timothy Mattson

The Big Data ISTC is a research project sponsored by Intel that ran for five years (August 2012- August 2017).  This blog post highlights some of the accomplishments and lessons learned during this period. Big data is usually categorized into … Continue reading

Posted in Analytics, Benchmarks, Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Storage, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , , , , , , , , , , , , , , , | Leave a comment

ISTC Releases Open Source Code for BigDAWG Polystore System

By Dr. Tim Mattson, Intel and Dr. Vijay Gadepally and Kyle O’Brien, MIT Lincoln Laboratory Today, the ISTC for Big Data released the first version of BigDAWG, our polystore system for simplifying integration and analytics of disparate data at scale. BigDAWG is … Continue reading

Posted in Analytics, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Storage | Tagged , , , , , , , , | Leave a comment

ISTC Researchers Present Work at NEDB Day 2017

ISTC for Big Data principal investigators, researchers and their students presented work at North East Database Day 2017, held at MIT’s Stata Center in Cambridge, Mass., on January 27, 2017. Microsoft and Facebook sponsored the event. The 9th Annual North East … Continue reading

Posted in Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Storage, Streaming Big Data, Visualizing Big Data | Tagged , , , , , , , , , , | Leave a comment

NVMRocks: RocksDB on Non-Volatile Memory Systems

By Jianhong Li (CMU), Andrew Pavlo (CMU), and Siying Dong (Facebook) Non-volatile memory (NVM) has been a game-changing memory technology. In contrast to traditional block-based durable storage devices, it provides low latency comparable to DRAM and byte-addressability. Although NVM is … Continue reading

Posted in Big Data Architecture, DBMS, ISTC for Big Data Blog, Storage | Tagged , , , , , | 2 Comments

Write-Behind Logging

By Joy Arulraj, Matthew Perron, and Andrew Pavlo, Carnegie Mellon University In a joint collaboration between Carnegie Mellon University and Intel Labs, we explore the changes required in the logging and recovery algorithms in non-volatile memory database management systems (DBMSs). The results of this work … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog, Math and Algorithms, Storage | Tagged , , , , , , , | Leave a comment

Larger-than-Memory Data Management on Modern Storage Hardware for In-Memory OLTP Database Systems

By Lin Ma, Carnegie Mellon University; Joy Arulraj, Carnegie Mellon University; Sam Zhao, Brown University; Andrew Pavlo, Carnegie Mellon University; Subramanya R. Dulloor, Intel Labs; Michael J. Giardino, Georgia Institute of Technology; Jeff Parkhurst, Jason L. Gardner, Kshitij Doshi, Intel Labs; and Col. Stanley Zdonik, Brown … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog, Storage | Tagged , , , , , | Leave a comment

2015: Momentum, Moments and Memories

Greetings of the season from the Intel Science and Technology Center for Big Data.  As 2015 comes to a close, we thought we would share some moments and memories that were captured here in the ISTC for Big Data blog … Continue reading

Posted in Analytics, Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, High-Performance Computing, ISTC for Big Data Blog, Storage, Streaming Big Data, Visualizing Big Data | Tagged , , , , , , , , , , , , , , , , , , , | Leave a comment

Interface Sharing between Data Storage and Analytics

By Jack Dongarra, Piotr Luszczek and Thomas Herault of the University of Tennessee Innovative Computing Laboratory It is trite to say that traditional RDBMS optimize the data movement by bringing the query close to the data and not the other way around. … Continue reading

Posted in Analytics, Big Data Architecture, ISTC for Big Data Blog, Math and Algorithms, Storage | Tagged , , , , | Leave a comment

ISTC to Unveil New Big Data Federation Architecture at VLDB 2015

At the upcoming Very Large Data Bases 2015 Conference in Hawaii, the Intel Science and Technology Center for Big Data will unveil its first big “capstone” project: a federated reference architecture that enables query processing over multiple databases, where each … Continue reading

Posted in Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Storage, Visualizing Big Data | Tagged , , , , , , , | Leave a comment

Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems

By Joy Arulraj and Andrew Pavlo, Carnegie Mellon; and Subramanya Dulloor, Intel Labs In a joint collaboration between Carnegie Mellon University and Intel Labs, we explore the changes required in future database management systems to fully leverage the unique set of characteristics of non-volatile memory (NVM) technologies. … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog, Storage, Streaming Big Data | Tagged , , , , , , | Leave a comment