Tag Archives: Hadoop

Tupleware: An Inside Look

By Eugene Wu, PhD Candidate, MIT CSAIL At the recent ISTC for Big Data annual Research Retreat, Tim Kraska of Brown University presented an in-depth look at Tupleware, an in-memory, read-only distributed computation framework that supports multiple languages. Typical “cloud” … Continue reading

Posted in Analytics, Big Data Applications, Big Data Architecture, Computer Architecture, ISTC for Big Data Blog, Tools for Big Data | Tagged , , , , , , , | Leave a comment

Tupleware: Redefining Modern Analytics

By Andrew Crotty, Alex Galakatos, and Tim Kraska, Brown University The increasing prevalence of big data across all industries and sciences is causing a profound shift in the nature and scope of analytics. Increasingly complex computations, ranging from machine learning to predictive … Continue reading

Posted in Analytics, Big Data Applications, Big Data Architecture, Data Management, Databases and Analytics, DBMS, ISTC for Big Data Blog | Tagged , , , , | Leave a comment

Making Big Data Management Easier

Today at the O’Reilly Strata Conference in Santa Clara, Calif., ISTC Researcher Magdalena Balazinska of the University of Washington is presenting a talk entitled, “Can We Make Big Data Management Easier?” This post summarizes her talk. It highlights her team’s … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog | Tagged , , | Leave a comment

VLDB 2013: ISTC Faculty Members to Present Keynote and Five Papers

ISTC for Big Data faculty members and their students will present five papers at the 39th International Conference on Very Large Data Bases, August 26 to 30, 2013, in Riva del Garda, Trento, Italy. In addition, ISTC for Big Data … Continue reading

Posted in Analytics, Big Data Architecture, ISTC for Big Data Blog, Query Engines, Tools for Big Data, Visualizing Big Data | Tagged , , , , , , | Leave a comment

BlinkDB: A Massively Parallel Query Engine for Big Data

By Barzan Mozafari, University of Michigan Ann Arbor* Today, if you have a few terabytes of data stored on your disk, even calculating a simple average can take up to a few days. Of course parallelism helps, but not too … Continue reading

Posted in Big Data Architecture, Databases and Analytics, DBMS, ISTC for Big Data Blog, Query Engines, Tools for Big Data | Tagged , , , | Leave a comment

Myria: Big Data Management as a Service

By Magdalena Balazinska, University of Washington Over the past year, the University of Washington Database Group has developed a new engine for managing Big Data. The system, called Myria, has been tested on 100-node Amazon EC2 deployments and on data from … Continue reading

Posted in Big Data Architecture, Data Management, DBMS, ISTC for Big Data Blog | Tagged , , , | Leave a comment

No Hadoop: The Future of the Hadoop/HDFS Stack

by Michael Stonebraker, MIT CSAIL There has been a collection of recent announcements about DBMS capabilities in the so-called Hadoop stack.  To be clear, this is a three-tier architecture with HDFS (a file system) at the bottom, Hadoop (the open source … Continue reading

Posted in Big Data Architecture, Databases and Analytics, DBMS, ISTC for Big Data Blog | Tagged , , | Leave a comment