Tag Archives: Data Science

ModelDB: A System for Managing Machine Learning Models

By Manasi Vartak, Harihar Subramanyam, Wei-En Lee, Srinidhi Viswanathan, Saadiyah Husnoo, Sam Madden and Matei Zaharia, MIT CSAIL Building real-world machine learning (ML) algorithms is an iterative process. A data scientist will build many 10s to 100s of models before arriving … Continue reading

Posted in Big Data Applications, Big Data Architecture, ISTC for Big Data Blog, Tools for Big Data | Tagged , , | 4 Comments

Decibel: Dataset Branching for Collaborative Data Management

by Michael Maddox, MIT CSAIL, and Aaron J. Elmore, University of Chicago* In Big Data’s wake has come demand for tools to curate, manage and analyze shared datasets collaboratively. For instance, consider researchers in a social media company concurrently working … Continue reading

Posted in Benchmarks, Data Management, Databases and Analytics, ISTC for Big Data Blog | Tagged , , , , , , , | Leave a comment