The Big Data ISTC: A Retrospection by Michael Stonebraker, Samuel Madden and Timothy Mattson

The Big Data ISTC is a research project sponsored by Intel that ran for five years (August 2012- August 2017).  This blog post highlights some of the accomplishments and lessons learned during this period. Big data is usually categorized into … Continue reading

Posted in Analytics, Benchmarks, Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Storage, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , , , , , , , , , , , , , , , | Leave a comment

In-Database Analytics for Large Array Data

By Jack Dongarra,* Piotr Luszczek and Thomas Herault of the University of Tennessee, Knoxville  Performing analytics inside a database gets progressively important. In the context of SciDB, the data model involves arrays either fully populated (dense) or with empty entries (sparse). Very … Continue reading

Posted in Big Data Architecture, DBMS, High-Performance Computing, ISTC for Big Data Blog, Math and Algorithms | Tagged , , , , , | Leave a comment

ISTC Researchers to Present Papers at VLDB 2017 Conference

ISTC for Big Data principal Investigators, researchers, and their students will present a number of papers at the 2017 International Conference on Very Large Databases (VLDB 2017) in Munich, Germany, August 28-September 1, 2017.  (To read any of the papers, … Continue reading

Posted in Analytics, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Visualizing Big Data | Tagged , , , , , , , , , , , , , | Leave a comment

VisualCloud: A DBMS for Virtual Reality

By Brandon Haynes, Artem Minyaylov, Magdalena Balazinska, Luis Ceze and Alvin Cheung of the University of Washington Our ability to collect videos en masse can revolutionize how we interact with the world by enabling powerful virtual reality (VR) video applications … Continue reading

Posted in Big Data Applications, Data Management, DBMS, ISTC for Big Data Blog, Streaming Big Data | Tagged , , , , , , , , , , | Leave a comment

ISTC Releases Open Source Code for S-Store Transactional Streaming System

By John Meehan and Stan Zdonik, Brown University & Nesime Tatbul, Intel Labs and MIT Today, the ISTC for Big Data released the first version of our S-Store transactional stream processing system. S-Store is open-source software and available for download … Continue reading

Posted in Big Data Applications, Big Data Architecture, DBMS, ISTC for Big Data Blog, Polystores, Streaming Big Data | Tagged , , , , , , , , , , | Leave a comment

Solving the “One Concurrency Control Does Not Fit All” Problem for OLTP Databases

By Dixin Tang and Aaron J. Elmore, University of Chicago In this post, we present a new transactional database system that adaptively changes data organization and concurrency control protocols in response to workload changes. With the increasing memory sizes of … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog | Tagged , , , , , | Leave a comment

ISTC Releases Open Source Code for BigDAWG Polystore System

By Dr. Tim Mattson, Intel and Dr. Vijay Gadepally and Kyle O’Brien, MIT Lincoln Laboratory Today, the ISTC for Big Data released the first version of BigDAWG, our polystore system for simplifying integration and analytics of disparate data at scale. BigDAWG is … Continue reading

Posted in Analytics, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Storage | Tagged , , , , , , , , | Leave a comment

ISTC Researchers Present Work at NEDB Day 2017

ISTC for Big Data principal investigators, researchers and their students presented work at North East Database Day 2017, held at MIT’s Stata Center in Cambridge, Mass., on January 27, 2017. Microsoft and Facebook sponsored the event. The 9th Annual North East … Continue reading

Posted in Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Storage, Streaming Big Data, Visualizing Big Data | Tagged , , , , , , , , , , | Leave a comment

NVMRocks: RocksDB on Non-Volatile Memory Systems

By Jianhong Li (CMU), Andrew Pavlo (CMU), and Siying Dong (Facebook) Non-volatile memory (NVM) has been a game-changing memory technology. In contrast to traditional block-based durable storage devices, it provides low latency comparable to DRAM and byte-addressability. Although NVM is … Continue reading

Posted in Big Data Architecture, DBMS, ISTC for Big Data Blog, Storage | Tagged , , , , , | 2 Comments

Write-Behind Logging

By Joy Arulraj, Matthew Perron, and Andrew Pavlo, Carnegie Mellon University In a joint collaboration between Carnegie Mellon University and Intel Labs, we explore the changes required in the logging and recovery algorithms in non-volatile memory database management systems (DBMSs). The results of this work … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog, Math and Algorithms, Storage | Tagged , , , , , , , | Leave a comment