Tag Archives: Data Sets

ISTC Releases Open Source Code for S-Store Transactional Streaming System

By John Meehan and Stan Zdonik, Brown University & Nesime Tatbul, Intel Labs and MIT Today, the ISTC for Big Data released the first version of our S-Store transactional stream processing system. S-Store is open-source software and available for download … Continue reading

Posted in Big Data Applications, Big Data Architecture, DBMS, ISTC for Big Data Blog, Polystores, Streaming Big Data | Tagged , , , , , , , , , , | Leave a comment

Improving Clinical Decision-Making with Big Data

An Interview with Peter Szolovits, MIT CSAIL Doctors, nurses and other healthcare professionals have always had to “read” and respond quickly to often-imperfect data under stressful circumstances. What has changed over time is the volume and types of data that … Continue reading

Posted in Big Data Applications, ISTC for Big Data Blog | Tagged , , , , , , | Leave a comment

ISTC Releases Open Source Code for BigDAWG Polystore System

By Dr. Tim Mattson, Intel and Dr. Vijay Gadepally and Kyle O’Brien, MIT Lincoln Laboratory Today, the ISTC for Big Data released the first version of BigDAWG, our polystore system for simplifying integration and analytics of disparate data at scale. BigDAWG is … Continue reading

Posted in Analytics, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog, Polystores, Query Engines, Storage | Tagged , , , , , , , , | Leave a comment

Analytic Monitoring for the Internet of Things

By Peter Bailis, Stanford Infolab, and Sam Madden, MIT CSAIL An increasing proportion of data today is generated by automated processes, sensors and devices—collectively called the Internet of Things (IoT).   Inexpensive hardware, widespread access to communication networks, and decreased … Continue reading

Posted in Analytics, Big Data Applications, Big Data Architecture, Databases and Analytics, ISTC for Big Data Blog, Streaming Big Data | Tagged , , , | Leave a comment

Urban Analytics for Smart Cities: Connecting Data to People

By Kristin Tufte, Portland State University “A Smart City is one where data and technology improve people’s lives.¹” Governments, NGOs and academic researchers are looking to data and analytics to create more livable cities. Ideas and innovation are flowering. The … Continue reading

Posted in Analytics, Big Data Applications, ISTC for Big Data Blog | Tagged , , , , , , | Leave a comment

Rethinking Streaming: Correct State Matters!

by Nesime Tatbul, Intel Labs and MIT CSAIL; Kristin Tufte, Portland State University; and Stan Zdonik, Brown University Stream processing has largely been thought of as real-time analytics. Data enters the system as streams and analytic functions (aggregates) are computed on the … Continue reading

Posted in Big Data Applications, DBMS, ISTC for Big Data Blog, Streaming Big Data | Tagged , , , , , , , , , | Leave a comment

Compiling Queries for High-Performance Computing

By Brandon Myers and Bill Howe, University of Washington High performance computing (HPC) is traditionally about compute, but its users have data management problems, too. In a recent paper, we demonstrated a promising technique for bringing ad hoc query processing into … Continue reading

Posted in Big Data Applications, Data Management, High-Performance Computing, ISTC for Big Data Blog | Tagged , , , , , | Leave a comment

TileDB and GenomicsDB Now Available in Open Source

by Stavros Papadopoulos, Intel Parallel Computing Lab The ISTC for Big Data has recently released TileDB, a novel efficient data management system for scientific data, such as graphs, DNA sequences, matrices, maps, and imaging. We have also released an adaptation of TileDB … Continue reading

Posted in Big Data Applications, Data Management, DBMS, High-Performance Computing, ISTC for Big Data Blog | Tagged , , , , , , , | Leave a comment

ForeCache: Raising the Bar in Big Data Visual Exploration

By Leilani Battle, MIT CSAIL In many discussions with scientists across a variety of specialties, we have found that interactive visualizations are important tools for helping people make sense of massive amounts of data. In particular, interactive visualizations are critical … Continue reading

Posted in Analytics, Big Data Applications, ISTC for Big Data Blog, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , | Leave a comment

Decibel: Dataset Branching for Collaborative Data Management

by Michael Maddox, MIT CSAIL, and Aaron J. Elmore, University of Chicago* In Big Data’s wake has come demand for tools to curate, manage and analyze shared datasets collaboratively. For instance, consider researchers in a social media company concurrently working … Continue reading

Posted in Benchmarks, Data Management, Databases and Analytics, ISTC for Big Data Blog | Tagged , , , , , , , | Leave a comment