
September 7, 2016
ActiveClean: a tool that uses machine learning to clean dirty data in big data sets
AMPLab researchers Sanjay Krishnan, Prof. Michael Franklin, Prof. Ken Goldberg, Eugene Wu, and Jiannan Wang have developed ActiveClean, a system that uses machine learning to improve the process of removing dirty data by analyzing a user’s prediction model to decide which mistakes to…