Tag Archives: Data Management

NVMRocks: RocksDB on Non-Volatile Memory Systems

By Jianhong Li (CMU), Andrew Pavlo (CMU), and Siying Dong (Facebook) Non-volatile memory (NVM) has been a game-changing memory technology. In contrast to traditional block-based durable storage devices, it provides low latency comparable to DRAM and byte-addressability. Although NVM is … Continue reading

Posted in Big Data Architecture, DBMS, ISTC for Big Data Blog, Storage | Tagged , , , , , | 2 Comments

ISTC for Big Data 2016 Research Highlights

In 2016, ISTC for Big Data principal investigators, researchers and their students continued to break down the barriers to data analytics at scale, with creative new approaches and infrastructure software. Developments are being integrated into BigDAWG, the next-generation polystore architecture … Continue reading

Posted in Big Data Architecture, Data Management, ISTC for Big Data Blog, Polystores | Tagged , , , , , , , , , , , | Leave a comment

Polystore Databases to be Examined at IEEE, CIDR Conferences

Polystores, a more-modern approach to sharing heterogeneous data that addresses Big Data’s volume, variety and velocity demands, will be the topic of discussion at two upcoming conferences: The first IEEE Workshop on Methods to Manage Heterogeneous Big Data and Polystore Databases, … Continue reading

Posted in Big Data Applications, Big Data Architecture, Data Management, ISTC for Big Data Blog, Polystores, Tools for Big Data | Tagged , , , , , | Leave a comment

PipeGen: A Data Pipe Generator for Hybrid Analytics

By Brandon Haynes, Alvin Cheung, and Magdalena Balazinska, University of Washington As the number of big data management systems continues to grow, users increasingly seek to leverage multiple systems in the context of a single data analysis task. A critical challenge … Continue reading

Posted in Big Data Applications, Big Data Architecture, Data Management, ISTC for Big Data Blog, Polystores | Tagged , , , , , | Leave a comment

Genomics Data, Analytics and the Future of Climate Change

By Vijay Gadepally, MIT CSAIL, in collaboration with the Chisholm Laboratory at MIT Meet Prochlorococcus marinus, a marine cyanobacterium that’s intricately linked to the global carbon cycle, widely present in seawater, and possibly holds secrets to future climate change. These … Continue reading

Posted in Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, Graph Computation, ISTC for Big Data Blog, Polystores, Streaming Big Data, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , , , | Leave a comment

Simplifying and Scaling Data Discovery

By Raul Castro Fernandez, MIT CSAIL People who need access to data for their jobs are spending more and more time searching for data of interest to the task at hand. This is particularly true for data-driven companies, where the … Continue reading

Posted in Data Management, ISTC for Big Data Blog, Query Engines | Tagged , , , | Leave a comment

Rethinking Streaming: Correct State Matters!

by Nesime Tatbul, Intel Labs and MIT CSAIL; Kristin Tufte, Portland State University; and Stan Zdonik, Brown University Stream processing has largely been thought of as real-time analytics. Data enters the system as streams and analytic functions (aggregates) are computed on the … Continue reading

Posted in Big Data Applications, DBMS, ISTC for Big Data Blog, Streaming Big Data | Tagged , , , , , , , , , | Leave a comment

Larger-than-Memory Data Management on Modern Storage Hardware for In-Memory OLTP Database Systems

By Lin Ma, Carnegie Mellon University; Joy Arulraj, Carnegie Mellon University; Sam Zhao, Brown University; Andrew Pavlo, Carnegie Mellon University; Subramanya R. Dulloor, Intel Labs; Michael J. Giardino, Georgia Institute of Technology; Jeff Parkhurst, Jason L. Gardner, Kshitij Doshi, Intel Labs; and Col. Stanley Zdonik, Brown … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog, Storage | Tagged , , , , , | Leave a comment

Compiling Queries for High-Performance Computing

By Brandon Myers and Bill Howe, University of Washington High performance computing (HPC) is traditionally about compute, but its users have data management problems, too. In a recent paper, we demonstrated a promising technique for bringing ad hoc query processing into … Continue reading

Posted in Big Data Applications, Data Management, High-Performance Computing, ISTC for Big Data Blog | Tagged , , , , , | Leave a comment

TileDB and GenomicsDB Now Available in Open Source

by Stavros Papadopoulos, Intel Parallel Computing Lab The ISTC for Big Data has recently released TileDB, a novel efficient data management system for scientific data, such as graphs, DNA sequences, matrices, maps, and imaging. We have also released an adaptation of TileDB … Continue reading

Posted in Big Data Applications, Data Management, DBMS, High-Performance Computing, ISTC for Big Data Blog | Tagged , , , , , , , | Leave a comment