Query Processing with People and Machines: CrowdDB
The challenge of "Big Data" analytics is not just data size -- it's about issues such as diversity, ambiguity, and incompleteness in both queries and the underlying data.
While advances in scalable data processing are helping to address the data size problem, there remain many important data-centric tasks where humans are more proficient than current state-of-the art algorithms. Crowdsourcing has emerged as a major problem-solving and data-gathering paradigm that provides the ability to leverage human intelligence and activity at large scale. Emerging popular crowdsourcing platforms have programmatic interfaces (APIs) that provide the opportunity to create hybrid human/computer systems for data-intensive applications. An ongoing effort to better understand the development of such hybrid computation systems, the CrowdDB project uses human input via crowdsourcing to process queries that neither database systems nor search engines can adequately answer. While CrowdDB leverages many aspects of traditional database systems, there are also important differences from both an implementation and conceptual perspective.
In this talk, I'll share an overview of CrowdDB (built with colleagues at ETH Zurich and developed as part of the U.C. Berkeley AMPLab) and developing hybrid human/machine query processing systems. I'll also share an overview of the broader AMPLab research agenda. AMPLab's research is supported in part by 18 leading technology companies, including founding sponsors Google and SAP.
Speaker: Michael Franklin, AMPLab, UC Berkeley & Truviso
Thursday, 05/03/12
Contact:
Website: Click to VisitCost:
FreeSave this Event:
iCalendarGoogle Calendar
Yahoo! Calendar
Windows Live Calendar
PARC Forum
Palo Alto Research Center, George E. Pake Auditorium
Palo Alto, CA 94304
USA
Phone: 650-812-4000
Website: Click to Visit
