Updated on Dec 02, 2014
1
Papers on Crowdsourced Query Processing
2
Note: if you find some interesting papers missing in the list, please fill out this form. As a contributor, your name will be shown below.
3
Contributors: Jiannan Wang, Xu Chu
4
AuthorsTitleVenueYearKeywords
5
Barzan Mozafari, Purnamrita Sarkar, Michael J. Franklin, Michael I. Jordan, Samuel MaddenScaling Up Crowd-Sourcing to Very Large Datasets: A Case for Active LearningPVLDB2015Active learning
6
Jiannan Wang, Sanjay Krishnan, Michael J. Franklin, Ken Goldberg, Tim Kraska, Tova MiloA sample-and-clean framework for fast and accurate query processing on dirty dataSIGMOD2014Data cleaning
7
Chaitanya Gokhale, Sanjib Das, AnHai Doan, Jeffrey F. Naughton, Narasimhan Rampalli, Jude W. Shavlik, Xiaojin ZhuCorleone: hands-off crowdsourcing for entity matchingSIGMOD2014Entity resolution
8
Hyunjung Park, Jennifer WidomCrowdFill: collecting structured data from the crowdSIGMOD2014Data collection
9
Chen Jason Zhang, Ziyuan Zhao, Lei Chen, H. V. Jagadish, Caleb Chen CaoCrowdMatcher: crowd-assisted schema matchingSIGMOD2014Schema matching (demo)
10
Aditya G. Parameswaran, Ming Han Teh, Hector Garcia-Molina, Jennifer WidomDataSift: a crowd-powered search toolkitSIGMOD2014Information retrieval (demo)
11
Yael Amsterdamer, Susan B. Davidson, Tova Milo, Slava Novgorodov, Amit SomechOASSIS: query driven crowd miningSIGMOD2014Data mining
12
Chong Sun, Narasimhan Rampalli, Frank Yang, AnHai DoanChimera: Large-Scale Classification using Machine Learning, Rules, and CrowdsourcingPVLDB2014Classification
13
Norases Vesdapunt, Kedar Bellare, Nilesh N. DalviCrowdsourcing Algorithms for Entity ResolutionPVLDB2014Entity resolution
14
Yael Amsterdamer, Susan B. Davidson, Tova Milo, Slava Novgorodov, Amit SomechOntology Assisted Crowd MiningPVLDB2014Data mining
15
Aditya G. Parameswaran, Stephen Boyd, Hector Garcia-Molina, Ashish Gupta, Neoklis Polyzotis, Jennifer WidomOptimal Crowd-Powered Rating and Filtering AlgorithmsPVLDB2014Filtering
16
Fu Fan, Meiyu Lu, Beng Chin Ooi, Wang-Chiew Tan, Meihui ZhangA hybrid machine-crowdsourcing system for matching web tablesICDE2014Schema matching
17
Leihao Xia, Caleb Chen Cao, Lei Chen, Zhao ChenC-DMr: Crowd-powered Decision Maker for real world Knapsack ProblemsICDE2014Knapsack problem
18
Sarath Kumar Kondreddi, Peter Triantafillou, Gerhard WeikumCombining information extraction and human computing for crowdsourced knowledge acquisitionICDE2014Knowledge base
19
Anish Das Sarma, Aditya G. Parameswaran, Hector Garcia-Molina, Alon Y. HalevyCrowd-powered find algorithmsICDE2014Data collection
20
Yongxin Tong, Caleb Chen Cao, Chen Jason Zhang, Yatao Li, Lei ChenCrowdCleaner: Data cleaning for multi-version data on the web via crowdsourcingICDE2014Data cleaning
21
Han Su, Kai Zheng, Jiamin Huang, Hoyoung Jeung, Lei Chen, Xiaofang ZhouCrowdPlanner: A crowd-based route recommendation systemICDE2014Recommendation (demo)
22
Jinyang Gao, Xuan Liu, Beng Chin Ooi, Haixun Wang, Gang ChenAn online cost sensitive decision-making method in crowdsourcing systemsSIGMOD2013Human v.s. machine decision
23
Yael Amsterdamer, Yael Grossman, Tova Milo, Pierre SenellartCrowd miningSIGMOD2013Data mining
24
Jiannan Wang, Guoliang Li, Tim Kraska, Michael J. Franklin, Jianhua FengLeveraging transitive relations for crowdsourced joinsSIGMOD2013Entity resolution
25
Haim Kaplan, Ilia Lotosh, Tova Milo, Slava NovgorodovAnswering Planning Queries with the CrowdPVLDB2013Question selection
26
Yael Amsterdamer, Yael Grossman, Tova Milo, Pierre SenellartCrowdMiner: Mining association rules from the crowdPVLDB2013Data mining
27
Hyunjung Park, Jennifer WidomQuery Optimization over Crowdsourced DataPVLDB2013Query optimization
28
Steven Euijong Whang, Peter Lofgren, Hector Garcia-MolinaQuestion Selection for Crowd Entity ResolutionPVLDB2013Entity resolution
29
Susan B. Davidson, Sanjeev Khanna, Tova Milo, Sudeepa RoyUsing the crowd for top-k and group-by queriesICDT2013Top-k and group-by queries
30
Beth Trushkowsky, Tim Kraska, Michael J. Franklin, Purnamrita SarkarCrowdsourced enumeration queriesICDE2013Data collection
31
Sean Louis Goldberg, Daisy Zhe Wang, Tim KraskaCASTLE: Crowd-Assisted System for Text Labeling and ExtractionHCOMP2013Information extraction
32
Aditya G. Parameswaran, Ming Han Teh, Hector Garcia-Molina, Jennifer WidomDataSift: An Expressive and Accurate Crowd-Powered Search ToolkitHCOMP2013Information retrieval
33
Hannes Heikinheimo, Antti UkkonenThe Crowd-Median AlgorithmHCOMP2013Median query
34
Shawn R. Jeffery, Liwen Sun, Matt DeLand, Nick Pendar, Rick Barber, Andrew GaldiArnold: Declarative Crowd-Machine Data IntegrationCIDR2013Data integration and cleaning
35
Michael Stonebraker, Daniel Bruckner, Ihab F. Ilyas, George Beskales, Mitch Cherniack, Stanley B. Zdonik, Alexander Pagan, Shan XuData Curation at Scale: The Data Tamer SystemCIDR2013Data integration and cleaning
36
Petros Venetis, Hector Garcia-Molina, Kerui Huang, Neoklis PolyzotisMax algorithms in crowdsourcing environmentsWWW2012Max query
37
Aditya G. Parameswaran, Hector Garcia-Molina, Hyunjung Park, Neoklis Polyzotis, Aditya Ramesh, Jennifer WidomCrowdScreen: algorithms for filtering data with humansSIGMOD2012Filtering
38
Chris Van Pelt, Alex SorokinDesigning a scalable crowdsourcing platformSIGMOD2012CrowdFlower
39
Stephen Guo, Aditya G. Parameswaran, Hector Garcia-MolinaSo who won?: dynamic max discovery with the crowdSIGMOD2012Max query
40
Xuan Liu, Meiyu Lu, Beng Chin Ooi, Yanyan Shen, Sai Wu, Meihui ZhangCDAS: A Crowdsourcing Data Analytics SystemPVLDB2012CDAS system (quality model)
41
Adam Marcus, David R. Karger, Samuel Madden, Rob Miller, Sewoong OhCounting with the CrowdPVLDB2012Selectivity estimation
42
Jiannan Wang, Tim Kraska, Michael J. Franklin, Jianhua FengCrowdER: Crowdsourcing Entity ResolutionPVLDB2012Entity resolution
43
Hyunjung Park, Richard Pang, Aditya G. Parameswaran, Hector Garcia-Molina, Neoklis Polyzotis, Jennifer WidomDeco: A System for Declarative CrowdsourcingPVLDB2012Deco system (demo)
44
Joachim Selke, Christoph Lofi, Wolf-Tilo BalkePushing the Boundaries of Crowd-enabled Databases with Query-driven Schema ExpansionPVLDB2012Schema expansion
45
Aditya G. Parameswaran, Hyunjung Park, Hector Garcia-Molina, Neoklis Polyzotis, Jennifer WidomDeco: declarative crowdsourcingCIKM2012Deco system
46
Michael J. Franklin, Donald Kossmann, Tim Kraska, Sukriti Ramesh, Reynold XinCrowdDB: answering queries with crowdsourcingSIGMOD2011CrowdDB system
47
Adam Marcus, Eugene Wu, David R. Karger, Samuel Madden, Robert C. MillerDemonstration of Qurk: a query processor for humanoperatorsSIGMOD2011Qurk system (demo)
48
Amber Feng, Michael J. Franklin, Donald Kossmann, Tim Kraska, Samuel Madden, Sukriti Ramesh, Andrew Wang, Reynold XinCrowdDB: Query Processing with the VLDB CrowdPVLDB2011CrowdDB system (demo)
49
Aditya G. Parameswaran, Anish Das Sarma, Hector Garcia-Molina, Neoklis Polyzotis, Jennifer WidomHuman-assisted graph search: it's okay to ask questionsPVLDB2011Graph search
50
Adam Marcus, Eugene Wu, David R. Karger, Samuel Madden, Robert C. MillerHuman-powered Sorts and JoinsPVLDB2011Sort and join queries
51
Aditya G. Parameswaran, Neoklis PolyzotisAnswering Queries using Humans, Algorithms and DatabasesCIDR2011Vision paper
52
Adam Marcus, Eugene Wu, Samuel Madden, Robert C. MillerCrowdsourced Databases: Query Processing with PeopleCIDR2011Qurk system
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239