Tag Archives: Data Management

Rethinking Streaming: Correct State Matters!

by Nesime Tatbul, Intel Labs and MIT CSAIL; Kristin Tufte, Portland State University; and Stan Zdonik, Brown University Stream processing has largely been thought of as real-time analytics. Data enters the system as streams and analytic functions (aggregates) are computed on the … Continue reading

Posted in Big Data Applications, DBMS, ISTC for Big Data Blog, Streaming Big Data | Tagged , , , , , , , , , | Leave a comment

Larger-than-Memory Data Management on Modern Storage Hardware for In-Memory OLTP Database Systems

By Lin Ma, Carnegie Mellon University; Joy Arulraj, Carnegie Mellon University; Sam Zhao, Brown University; Andrew Pavlo, Carnegie Mellon University; Subramanya R. Dulloor, Intel Labs; Michael J. Giardino, Georgia Institute of Technology; Jeff Parkhurst, Jason L. Gardner, Kshitij Doshi, Intel Labs; and Col. Stanley Zdonik, Brown … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog, Storage | Tagged , , , , , | Leave a comment

Compiling Queries for High-Performance Computing

By Brandon Myers and Bill Howe, University of Washington High performance computing (HPC) is traditionally about compute, but its users have data management problems, too. In a recent paper, we demonstrated a promising technique for bringing ad hoc query processing into … Continue reading

Posted in Big Data Applications, Data Management, High-Performance Computing, ISTC for Big Data Blog | Tagged , , , , , | Leave a comment

TileDB and GenomicsDB Now Available in Open Source

by Stavros Papadopoulos, Intel Parallel Computing Lab The ISTC for Big Data has recently released TileDB, a novel efficient data management system for scientific data, such as graphs, DNA sequences, matrices, maps, and imaging. We have also released an adaptation of TileDB … Continue reading

Posted in Big Data Applications, Data Management, DBMS, High-Performance Computing, ISTC for Big Data Blog | Tagged , , , , , , , | Leave a comment

ForeCache: Raising the Bar in Big Data Visual Exploration

By Leilani Battle, MIT CSAIL In many discussions with scientists across a variety of specialties, we have found that interactive visualizations are important tools for helping people make sense of massive amounts of data. In particular, interactive visualizations are critical … Continue reading

Posted in Analytics, Big Data Applications, ISTC for Big Data Blog, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , | Leave a comment

Guaranteeing Query Runtimes for Analytics-as-a-Service

By Jennifer Ortiz and Magdalena Balazinska, University of Washington A variety of data analytics systems are available as cloud services today, including Amazon Elastic MapReduce (EMR), Redshift and Azure’s HDInsight. With these services, users have access to compute clusters that come … Continue reading

Posted in Analytics, Data Management, ISTC for Big Data Blog, Tools for Big Data | Tagged , , , , , | Leave a comment

Decibel: Dataset Branching for Collaborative Data Management

by Michael Maddox, MIT CSAIL, and Aaron J. Elmore, University of Chicago* In Big Data’s wake has come demand for tools to curate, manage and analyze shared datasets collaboratively. For instance, consider researchers in a social media company concurrently working … Continue reading

Posted in Benchmarks, Data Management, Databases and Analytics, ISTC for Big Data Blog | Tagged , , , , , , , | Leave a comment

Just-in-time Data Transformation and Migration in Polystores

by Aaron Elmore and Adam Dziedzic, University of Chicago Organizations face managing and deriving value from an ever-growing amount of data. Beyond its size, this data is often varied in both structure (e.g., relational data, linked data, numerical data and streaming data) … Continue reading

Posted in Analytics, Big Data Architecture, Data Management, Databases and Analytics, DBMS, Graph Computation, ISTC for Big Data Blog, Polystores, Query Engines, Streaming Big Data | Tagged , , , , , , , , , | Leave a comment

2015: Momentum, Moments and Memories

Greetings of the season from the Intel Science and Technology Center for Big Data.  As 2015 comes to a close, we thought we would share some moments and memories that were captured here in the ISTC for Big Data blog … Continue reading

Posted in Analytics, Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, High-Performance Computing, ISTC for Big Data Blog, Storage, Streaming Big Data, Visualizing Big Data | Tagged , , , , , , , , , , , , , , , , , , , | Leave a comment

Pushing the Boundaries of Visual Interactive Analytics

  As the volume, variety and velocity of data grow, data analysts struggle with asking and answering big questions of the data – even with the availability of increasingly sophisticated data visualization tools. It takes far too long for analysts … Continue reading

Posted in Analytics, Big Data Applications, ISTC for Big Data Blog, Visualizing Big Data | Tagged , , , , | Leave a comment