Database on New Hardware

Solid State Drive (SSD)

  • Manos Athanassoulis, Shimin Chen, Anastasia Ailamaki, Phillip B. Gibbons, Radu Stoica: Online Updates on Data Warehouses via Judicious Use of Solid-State Storage. ACM Trans. Database Syst. 40(1): 6 (2015)
  • Prashanth Menon, Tilmann Rabl, Mohammad Sadoghi, Hans-Arno Jacobsen: CaSSanDra: An SSD boosted key-value store. ICDE 2014: 1162-1167

Multicore CPU

  • Jana Giceva, Gustavo Alonso, Timothy Roscoe, Tim Harris: Deployment of Query Plans on Multicores. PVLDB 8(3): 233-244 (2014)

GPU

  • Kai Zhang, Kaibo Wang, Yuan Yuan, Lei Guo, Rubao Lee, Xiaodong Zhang: Mega-KV: A Case for GPUs to Maximize the Throughput of In-Memory Key-Value Stores. PVLDB 8(11): 1226-1237 (2015)
  • Max Heimel, Martin Kiefer, Volker Markl: Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation. SIGMOD Conference 2015: 1477-1492
  • Kenneth S. Bøgh, Sean Chester, Ira Assent: Work-Efficient Parallel Skyline Computation for the GPU. PVLDB 8(9): 962-973 (2015) Assigned to Vasileios Zois

Non-volatile RAM (NVRAM)

  • Joy Arulraj, Andrew Pavlo, Subramanya Dulloor: Let's Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. SIGMOD Conference 2015: 707-722
  • Steven Pelley, Thomas F. Wenisch, Brian T. Gold, Bill Bridge: Storage Management in the NVRAM Era. PVLDB 7(2): 121-132 (2013)
  • Andreas Chatzistergiou, Marcelo Cintra, Stratis D. Viglas: REWIND: Recovery Write-Ahead System for In-Memory Non-Volatile Data-Structures. PVLDB 8(5): 497-508 (2015)
  • Gihwan Oh, Sangchul Kim, Sang-Won Lee, Bongki Moon: SQLite Optimization with Phase Change Memory for Mobile Applications. PVLDB 8(12): 1454-1465 (2015)

Main-memory

  • Darko Makreshanski, Georgios Giannikis, Gustavo Alonso, Donald Kossmann: MQJoin: Efficient Shared Execution of Main-Memory Joins. PVLDB 9(6): 480-491 (2016)
  • Pedro Pedreira, Chris Croswhite, Luis Bona: Cubrick: Indexing Millions of Records per Second for Interactive Analytics. PVLDB 9(13): 1305-1316 (2016)
  • Hao Zhang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, Meihui Zhang: In-Memory Big Data Management and Processing: A Survey. IEEE Trans. Knowl. Data Eng. 27(7): 1920-1948 (2015)

Hardware Transactional Memory (HTM)

  • Darko Makreshanski, Justin J. Levandoski, Ryan Stutsman: To Lock, Swap, or Elide: On the Interplay of Hardware Transactional Memory and Lock-Free Indexing. PVLDB 8(11): 298-1309 (2015)

Non-uniform Memory Access (NUMA)

  • Viktor Leis, Peter A. Boncz, Alfons Kemper, Thomas Neumann: Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age. SIGMOD Conference 2014: 743-754
  • Iraklis Psaroudakis, Tobias Scheuer, Norman May, Abdelkader Sellami, Anastasia Ailamaki: Scaling Up Concurrent Main-Memory Column-Store Scans: Towards Adaptive NUMA-aware Data and Task Placement. PVLDB 8(12): 1442-1453 (2015)

In-storage computation

  • Insoon Jo, Duck-Ho Bae, Andre S. Yoon, Jeong-Uk Kang, Sangyeun Cho, Daniel D. G. Lee, Jaeheon Jeong: YourSQL: A High-Performance Database System Leveraging In-Storage Computing. PVLDB 9(12): 924-935 (2016) Assigned to Payas Rajan

FPGA

  • Ildar Absalyamov, Prerna Budhkar, Skyler Windh, Robert J. Halstead, Walid A. Najjar, Vassilis J. Tsotras: FPGA-accelerated group-by aggregation using synchronizing caches. DaMoN 2016: 11:1-11:9 Assigned to Vasileios Zois

Big Data Management

Big Data Indexing

  • Sarath Lakshman, Sriram Melkote, John Liang, Ravi Mayuram: Nitro: A Fast, Scalable In-Memory Storage Engine for NoSQL Global Secondary Index. PVLDB 9(13): 1413-1424 (2016)
  • Peng Lu, Gang Chen, Beng Chin Ooi, Hoang Tam Vo, Sai Wu: ScalaGiST: Scalable Generalized Search Trees for MapReduce Systems . PVLDB 7(14): 1797-1808 (2014)

Streaming Data

  • Matei Zaharia, Tathagata Das, Haoyuan Li, Timothy Hunter, Scott Shenker, Ion Stoica: Discretized streams: fault-tolerant streaming computation at scale. SOSP 2013: 423-438

Fault Tolerance

  • Jorge-Arnulfo Quiané-Ruiz, Christoph Pinkel, Jörg Schad, Jens Dittrich: RAFTing MapReduce: Fast recovery on the RAFT. ICDE 2011: 589-600
  • Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauly, Michael J. Franklin, Scott Shenker, Ion Stoica: Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. NSDI 2012: 15-28

Task Execution

  • Botong Huang, Nicholas W. D. Jarrett, Shivnath Babu, Sayan Mukherjee, Jun Yang: Cumulon: Matrix-Based Data Analytics in the Cloud with Spot Instances. PVLDB 9(3): 156-167 (2015)

Query Optimization

  • Jack Chen, Samir Jindel, Robert Walzer, Rajkumar Sen, Nika Jimsheleishvilli, Michael Andrews: The MemSQL Query Optimizer: A modern optimizer for real-time analytics in a distributed database. PVLDB 9(13): 1401-1412 (2016) Assigned to Christina Pavlopoulou
  • Konstantinos Kloudas, Rodrigo Rodrigues, Nuno M. Preguiça, Margarida Mamede: PIXIDA: Optimizing Data Parallel Jobs in Wide-Area Data Analytics. PVLDB 9(2): 72-83 (2015)

Provenance in Distributed Execution

  • Matteo Interlandi, Kshitij Shah, Sai Deep Tetali, Muhammad Ali Gulzar, Seunghyun Yoo, Miryung Kim, Todd D. Millstein, Tyson Condie: Titian: Data Provenance Support in Spark. PVLDB 9(3): 216-227 (2015) Assigned to Zacharias Chasparis

Comparative Execution

  • Jennie Duggan, Aaron J. Elmore, Michael Stonebraker, Magdalena Balazinska, Bill Howe, Jeremy Kepner, Sam Madden, David Maier, Tim Mattson, Stanley B. Zdonik: The BigDAWG Polystore System. SIGMOD Record 44(2): 11-16 (2015)
  • Juwei Shi, Yunjie Qiu, Umar Farooq Minhas, Limei Jiao, Chen Wang, Berthold Reinwald, Fatma Özcan: Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics. PVLDB 8(13): 2110-2121 (2015)

Big Data Management System (BDMS)

  • Sattam Alsubaiee, Yasser Altowim, Hotham Altwaijry, Alexander Behm, Vinayak R. Borkar, Yingyi Bu, Michael J. Carey, Inci Cetindil, Madhusudan Cheelangi, Khurram Faraaz, Eugenia Gabrielova, Raman Grover, Zachary Heilbron, Young-Seok Kim, Chen Li, Guangqiang Li, Ji Mahn Ok, Nicola Onose, Pouria Pirzadeh, Vassilis J. Tsotras, Rares Vernica, Jian Wen, Till Westmann: AsterixDB: A Scalable, Open Source BDMS. PVLDB 7(14): 1905-1916 (2014)
  • Sattam Alsubaiee, Alexander Behm, Vinayak R. Borkar, Zachary Heilbron, Young-Seok Kim, Michael J. Carey, Markus Dreseler, Chen Li: Storage Management in AsterixDB. PVLDB 7(10): 841-852 (2014)
  • E. Preston Carman Jr., Till Westmann, Vinayak R. Borkar, Michael J. Carey, Vassilis J. Tsotras: A scalable parallel XQuery processor. Big Data 2015: 164-173 Assigned to Christina Pavlopoulou

Big Graphs

  • Joseph E. Gonzalez, Reynold S. Xin, Ankur Dave, Daniel Crankshaw, Michael J. Franklin, Ion Stoica: GraphX: Graph Processing in a Distributed Dataflow Framework. OSDI 2014: 599-613
  • Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, Grzegorz Czajkowski: Pregel: a system for large-scale graph processing. SIGMOD Conference 2010: 135-146 Assigned to Xiu Zhang

Database for Emerging Applications

Social Networks

  • John Krumm, Eric Horvitz: Eyewitness: identifying local events via space-time signals in twitter feeds. SIGSPATIAL/GIS 2015: 20:1-20:10 Assigned to Yi Wu
  • Hongzhi Yin, Zhiting Hu, Xiaofang Zhou, Hao Wang, Kai Zheng, Nguyen Quoc Viet Hung, Shazia Wasim Sadiq: Discovering interpretable geo-social communities for user behavior prediction. ICDE 2016: 942-953 Assigned to Xiu Zhang

Graph Processing

  • Angen Zheng, Alexandros Labrinidis, Panos K. Chrysanthis: Planar: Parallel lightweight architecture-aware adaptive graph repartitioning. ICDE 2016: 121-132

RDF/SPARQL

  • Nikolaos Papailiou, Dimitrios Tsoumakos, Panagiotis Karras, Nectarios Koziris: Graph-Aware, Workload-Adaptive SPARQL Query Caching. SIGMOD Conference 2015: 1777-1792
  • Gunes Aluc, M. Tamer Özsu, Khuzaima Daudjee: Workload Matters: Why RDF Databases Need a New Design. PVLDB 7(10): 837-840 (2014)

Data cleaning

  • Xu Chu, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Nan Tang, Yin Ye: KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing. SIGMOD Conference 2015: 1247-1261
  • Michele Dallachiesa, Amr Ebaid, Ahmed Eldawy, Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Nan Tang: NADEEF: a commodity data cleaning system. SIGMOD Conference 2013: 541-552
  • Ziawasch Abedjan, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker: DataXFormer: A robust transformation discovery system. ICDE 2016: 1134-1145 Assigned to Zacharias Chasparis

Crowdsourcing

  • Daniel Haas, Jiannan Wang, Eugene Wu, Michael J. Franklin: CLAMShell: Speeding up Crowds for Low-latency Data Labeling. PVLDB 9(4): 372-383 (2015)
  • Ju Fan, Guoliang Li, Beng Chin Ooi, Kian-Lee Tan, Jianhua Feng: iCrowd: An Adaptive Crowdsourcing Framework. SIGMOD Conference 2015: 1015-1030
  • Sibo Wang, Xiaokui Xiao, Chun-Hee Lee: Crowd-Based Deduplication: An Adaptive Approach. SIGMOD Conference 2015: 1263-1277

Visualization

  • Ahmed Eldawy, Mohamed F. Mokbel, Christopher Jonathan: HadoopViz: A MapReduce framework for extensible visualization of big spatial data. ICDE 2016: 601-612 Assigned to Saheli Ghosh
  • Yongjoo Park, Michael J. Cafarella, Barzan Mozafari: Visualization-aware sampling for very large databases. ICDE 2016: 755-766

Machine Learning

  • Edward Ma, Vishrut Gupta, Meichun Hsu, Indrajit Roy: dmapply: A functional primitive to express distributed machine learning algorithms in R. PVLDB 9(13): 1293-1304 (2016
  • Matthias Boehm, Michael Dusenberry, Deron Eriksson, Alexandre V. Evfimievski, Faraz Makari Manshadi, Niketan Pansare, Berthold Reinwald, Frederick Reiss, Prithviraj Sen, Arvind Surve, Shirish Tatikonda: SystemML: Declarative Machine Learning on Spark. PVLDB 9(13): 1425-1436 (2016)
  • Jaeho Shin, Sen Wu, Feiran Wang, Christopher De Sa, Ce Zhang, Christopher Ré: Incremental Knowledge Base Construction Using DeepDive. PVLDB 8(11): 1310-1321 (2015)
  • U. Kang, Charalampos E. Tsourakakis, Christos Faloutsos: PEGASUS: A Peta-Scale Graph Mining System. ICDM 2009: 229-238

Privacy/Secutiry

  • Zhao Chang, Dong Xie, Feifei Li: Oblivious RAM: A Dissection and Experimental Evaluation. PVLDB 9(12): 1113-1124 (2016) Assigned to Minying Meng

Recommendation

  • Aneesh Sharma, Jerry Jiang, Praveen Bommannavar, Brian Larson, Jimmy J. Lin: GraphJet: Real-Time Content Recommendations at Twitter. PVLDB 9(13): 1281-1292 (2016)

Key-value stores

  • Dipti Shankar, Xiaoyi Lu, Md. Wasi-ur-Rahman, Nusrat S. Islam, Dhabaleswar K. Panda: Benchmarking key-value stores on high-performance storage and interconnects for web-scale workloads. Big Data 2015: 539-544

Human Interaction with Database Systems

  • Arnab Nandi, Lilong Jiang, Michael Mandel: Gestural Query Specification. PVLDB 7(4): 289-300 (2013)
  • Fei Li, H. V. Jagadish: Understanding Natural Language Queries over Relational Databases. SIGMOD Record 45(1): 6-13 (2016)

Spatial and Spatio-temporal Data

Indexing

  • Lu Wang, Robert Christensen, Feifei Li, Ke Yi: Spatial Online Sampling and Aggregation. PVLDB 9(3): 84-95 (2015)
  • Ying Lu, Cyrus Shahabi, Seon Ho Kim: Efficient indexing and retrieval of large-scale geo-tagged video databases. GeoInformatica 20(4): 829-857 (2016)
  • Abdeltawab M. Hendawi, Jie Bao, Mohamed F. Mokbel, Mohamed H. Ali: Predictive tree: An efficient index for predictive queries on road networks. ICDE 2015: 1215-1226

Selectivity Estimation

  • Ning An, Zhen-Yu Yang, Anand Sivasubramaniam: Selectivity Estimation for Spatial Joins. ICDE 2001: 368-375 Assigned to Saheli Ghosh
  • Xiaoyang Wang, Ying Zhang, Wenjie Zhang, Xuemin Lin, Wei Wang: Selectivity Estimation on Streaming Spatio-Textual Data Using Local Correlations. PVLDB 8(2): 101-112 (2014)

Road Networks

  • Shangfu Peng, Hanan Samet: Analytical queries on road networks: an experimental evaluation of two system architectures. SIGSPATIAL/GIS 2015: 1:1-1:10 Assigned to Payas Rajan
  • Abdeltawab M. Hendawi, Amruta Khot, Aqeel Rustum, Anas Basalamah, Ankur Teredesai, Mohamed H. Ali: COMA: Road Network Compression for Map-Matching. MDM (1) 2015: 104-109 Assigned to Minying Meng

Big Spatial Data

  • Ricardo Fernandes, Piotr Zaczkowski, Bernd Göttler, Conor Ettinoffe, Anis Moussa: TrafficDB: HERE's High Performance Shared-Memory Data Store. PVLDB 9(13): 1365-1376 (2016)
  • Ahmed Eldawy, Mohamed F. Mokbel: SpatialHadoop: A MapReduce framework for spatial data. ICDE 2015: 1352-1363 Assigned to Tin Vu
  • Ahmed M. Aly, Ahmed R. Mahmood, Mohamed S. Hassan, Walid G. Aref, Mourad Ouzzani, Hazem Elmeleegy, Thamir Qadah: AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data. PVLDB 8(13): 2062-2073 (2015) Assigned to Tin Vu
  • Dong Xie, Feifei Li, Bin Yao, Gefei Li, Liang Zhou, Minyi Guo: Simba: Efficient In-Memory Spatial Analytics. SIGMOD Conference 2016: 1071-1085 Assigned to Andres Calderon Romero

Joins

  • Suprio Ray, Bogdan Simion, Angela Demke Brown, Ryan Johnson: Skew-resistant parallel in-memory spatial join. SSDBM 2014: 6:1-6:12
  • Farhan Tauheed, Thomas Heinis, Anastasia Ailamaki: THERMAL-JOIN: A Scalable Spatial Join for Dynamic Workloads. SIGMOD Conference 2015: 939-950

Volunteer Geographic Information (VGI)

  • Dingxiong Deng, Cyrus Shahabi, Ugur Demiryurek, Linhong Zhu: Task selection in spatial crowdsourcing from worker's perspective. GeoInformatica 20(3): 529-568 (2016)

Ridesharing

  • Bin Cao, Louai Alarabi, Mohamed F. Mokbel, Anas Basalamah: SHAREK: A Scalable Dynamic Ride Sharing System. MDM (1) 2015: 4-13 Assigned to Yi Wu

Computational Geometry (CG)

  • Ahmed Eldawy, Yuan Li, Mohamed F. Mokbel, Ravi Janardan: CG_Hadoop: computational geometry in MapReduce. SIGSPATIAL/GIS 2013: 284-293 Assigned to Andres Calderon Romero
  • Samuel Audet, Cecilia Albertsson, Masana Murase, Akihiro Asahara: Robust and efficient polygon overlay on parallel stream processors. SIGSPATIAL/GIS 2013: 294-303

Spatial Keyword Queries

  • Taesung Lee, Jin-Woo Park, Sanghoon Lee, Seung-won Hwang, Sameh Elnikety, Yuxiong He: Processing and Optimizing Main Memory Spatial-Keyword Queries. PVLDB 9(3): 132-143 (2015)

Indoor Environment

  • Heba Aly, Moustafa Youssef: Dejavu: an accurate energy-efficient outdoor localization system. SIGSPATIAL/GIS 2013: 154-163
  • Kenneth Fuglsang Christensen, Lasse Linnerup Christiansen, Torben Bach Pedersen, Jeppe Pihl: Searchlight: Context-aware predictive Continuous Querying of moving objects in symbolic space. ICDE 2015: 687-698

Spatial Data Mining

  • Reem Y. Ali, Venkata M. V. Gunturi, Andrew J. Kotz, Shashi Shekhar, William F. Northrop: Discovering Non-compliant Window Co-Occurrence Patterns: A Summary of Results. SSTD 2015: 391-410
  • Luan Tran, Liyue Fan, Cyrus Shahabi: Distance-based Outlier Detection in Data Streams. PVLDB 9(12): 1089-1100 (2016)