Database on New Hardware
Solid State Drive (SSD)
- Manos Athanassoulis, Shimin Chen, Anastasia Ailamaki, Phillip B. Gibbons, Radu Stoica: Online Updates on Data Warehouses via Judicious Use of Solid-State Storage. ACM Trans. Database Syst. 40(1): 6 (2015)
- Prashanth Menon, Tilmann Rabl, Mohammad Sadoghi, Hans-Arno Jacobsen: CaSSanDra: An SSD boosted key-value store. ICDE 2014: 1162-1167
Multicore CPU
- Jana Giceva, Gustavo Alonso, Timothy Roscoe, Tim Harris: Deployment of Query Plans on Multicores. PVLDB 8(3): 233-244 (2014)
GPU
- Kai Zhang, Kaibo Wang, Yuan Yuan, Lei Guo, Rubao Lee, Xiaodong Zhang: Mega-KV: A Case for GPUs to Maximize the Throughput of In-Memory Key-Value Stores. PVLDB 8(11): 1226-1237 (2015)
- Max Heimel, Martin Kiefer, Volker Markl: Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation. SIGMOD Conference 2015: 1477-1492
Kenneth S. Bøgh, Sean Chester, Ira Assent: Work-Efficient Parallel Skyline Computation for the GPU. PVLDB 8(9): 962-973 (2015)Assigned to Vasileios Zois
Non-volatile RAM (NVRAM)
- Joy Arulraj, Andrew Pavlo, Subramanya Dulloor: Let's Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. SIGMOD Conference 2015: 707-722
- Steven Pelley, Thomas F. Wenisch, Brian T. Gold, Bill Bridge: Storage Management in the NVRAM Era. PVLDB 7(2): 121-132 (2013)
- Andreas Chatzistergiou, Marcelo Cintra, Stratis D. Viglas: REWIND: Recovery Write-Ahead System for In-Memory Non-Volatile Data-Structures. PVLDB 8(5): 497-508 (2015)
- Gihwan Oh, Sangchul Kim, Sang-Won Lee, Bongki Moon: SQLite Optimization with Phase Change Memory for Mobile Applications. PVLDB 8(12): 1454-1465 (2015)
Main-memory
- Darko Makreshanski, Georgios Giannikis, Gustavo Alonso, Donald Kossmann: MQJoin: Efficient Shared Execution of Main-Memory Joins. PVLDB 9(6): 480-491 (2016)
- Pedro Pedreira, Chris Croswhite, Luis Bona: Cubrick: Indexing Millions of Records per Second for Interactive Analytics. PVLDB 9(13): 1305-1316 (2016)
- Hao Zhang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, Meihui Zhang: In-Memory Big Data Management and Processing: A Survey. IEEE Trans. Knowl. Data Eng. 27(7): 1920-1948 (2015)
Hardware Transactional Memory (HTM)
- Darko Makreshanski, Justin J. Levandoski, Ryan Stutsman: To Lock, Swap, or Elide: On the Interplay of Hardware Transactional Memory and Lock-Free Indexing. PVLDB 8(11): 298-1309 (2015)
Non-uniform Memory Access (NUMA)
- Viktor Leis, Peter A. Boncz, Alfons Kemper, Thomas Neumann: Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age. SIGMOD Conference 2014: 743-754
- Iraklis Psaroudakis, Tobias Scheuer, Norman May, Abdelkader Sellami, Anastasia Ailamaki: Scaling Up Concurrent Main-Memory Column-Store Scans: Towards Adaptive NUMA-aware Data and Task Placement. PVLDB 8(12): 1442-1453 (2015)
In-storage computation
Insoon Jo, Duck-Ho Bae, Andre S. Yoon, Jeong-Uk Kang, Sangyeun Cho, Daniel D. G. Lee, Jaeheon Jeong: YourSQL: A High-Performance Database System Leveraging In-Storage Computing. PVLDB 9(12): 924-935 (2016)Assigned to Payas Rajan
FPGA
Ildar Absalyamov, Prerna Budhkar, Skyler Windh, Robert J. Halstead, Walid A. Najjar, Vassilis J. Tsotras: FPGA-accelerated group-by aggregation using synchronizing caches. DaMoN 2016: 11:1-11:9Assigned to Vasileios Zois
Big Data Management
Big Data Indexing
- Sarath Lakshman, Sriram Melkote, John Liang, Ravi Mayuram: Nitro: A Fast, Scalable In-Memory Storage Engine for NoSQL Global Secondary Index. PVLDB 9(13): 1413-1424 (2016)
- Peng Lu, Gang Chen, Beng Chin Ooi, Hoang Tam Vo, Sai Wu: ScalaGiST: Scalable Generalized Search Trees for MapReduce Systems . PVLDB 7(14): 1797-1808 (2014)
Streaming Data
- Matei Zaharia, Tathagata Das, Haoyuan Li, Timothy Hunter, Scott Shenker, Ion Stoica: Discretized streams: fault-tolerant streaming computation at scale. SOSP 2013: 423-438
Fault Tolerance
- Jorge-Arnulfo Quiané-Ruiz, Christoph Pinkel, Jörg Schad, Jens Dittrich: RAFTing MapReduce: Fast recovery on the RAFT. ICDE 2011: 589-600
- Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauly, Michael J. Franklin, Scott Shenker, Ion Stoica: Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. NSDI 2012: 15-28
Task Execution
- Botong Huang, Nicholas W. D. Jarrett, Shivnath Babu, Sayan Mukherjee, Jun Yang: Cumulon: Matrix-Based Data Analytics in the Cloud with Spot Instances. PVLDB 9(3): 156-167 (2015)
Query Optimization
Jack Chen, Samir Jindel, Robert Walzer, Rajkumar Sen, Nika Jimsheleishvilli, Michael Andrews: The MemSQL Query Optimizer: A modern optimizer for real-time analytics in a distributed database. PVLDB 9(13): 1401-1412 (2016)Assigned to Christina Pavlopoulou- Konstantinos Kloudas, Rodrigo Rodrigues, Nuno M. Preguiça, Margarida Mamede: PIXIDA: Optimizing Data Parallel Jobs in Wide-Area Data Analytics. PVLDB 9(2): 72-83 (2015)
Provenance in Distributed Execution
Matteo Interlandi, Kshitij Shah, Sai Deep Tetali, Muhammad Ali Gulzar, Seunghyun Yoo, Miryung Kim, Todd D. Millstein, Tyson Condie: Titian: Data Provenance Support in Spark. PVLDB 9(3): 216-227 (2015)Assigned to Zacharias Chasparis
Comparative Execution
- Jennie Duggan, Aaron J. Elmore, Michael Stonebraker, Magdalena Balazinska, Bill Howe, Jeremy Kepner, Sam Madden, David Maier, Tim Mattson, Stanley B. Zdonik: The BigDAWG Polystore System. SIGMOD Record 44(2): 11-16 (2015)
- Juwei Shi, Yunjie Qiu, Umar Farooq Minhas, Limei Jiao, Chen Wang, Berthold Reinwald, Fatma Özcan: Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics. PVLDB 8(13): 2110-2121 (2015)
Big Data Management System (BDMS)
- Sattam Alsubaiee, Yasser Altowim, Hotham Altwaijry, Alexander Behm, Vinayak R. Borkar, Yingyi Bu, Michael J. Carey, Inci Cetindil, Madhusudan Cheelangi, Khurram Faraaz, Eugenia Gabrielova, Raman Grover, Zachary Heilbron, Young-Seok Kim, Chen Li, Guangqiang Li, Ji Mahn Ok, Nicola Onose, Pouria Pirzadeh, Vassilis J. Tsotras, Rares Vernica, Jian Wen, Till Westmann: AsterixDB: A Scalable, Open Source BDMS. PVLDB 7(14): 1905-1916 (2014)
- Sattam Alsubaiee, Alexander Behm, Vinayak R. Borkar, Zachary Heilbron, Young-Seok Kim, Michael J. Carey, Markus Dreseler, Chen Li: Storage Management in AsterixDB. PVLDB 7(10): 841-852 (2014)
E. Preston Carman Jr., Till Westmann, Vinayak R. Borkar, Michael J. Carey, Vassilis J. Tsotras: A scalable parallel XQuery processor. Big Data 2015: 164-173Assigned to Christina Pavlopoulou
Big Graphs
- Joseph E. Gonzalez, Reynold S. Xin, Ankur Dave, Daniel Crankshaw, Michael J. Franklin, Ion Stoica: GraphX: Graph Processing in a Distributed Dataflow Framework. OSDI 2014: 599-613
Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, Grzegorz Czajkowski: Pregel: a system for large-scale graph processing. SIGMOD Conference 2010: 135-146Assigned to Xiu Zhang
Database for Emerging Applications
Social Networks
John Krumm, Eric Horvitz: Eyewitness: identifying local events via space-time signals in twitter feeds. SIGSPATIAL/GIS 2015: 20:1-20:10Assigned to Yi WuHongzhi Yin, Zhiting Hu, Xiaofang Zhou, Hao Wang, Kai Zheng, Nguyen Quoc Viet Hung, Shazia Wasim Sadiq: Discovering interpretable geo-social communities for user behavior prediction. ICDE 2016: 942-953Assigned to Xiu Zhang
Graph Processing
- Angen Zheng, Alexandros Labrinidis, Panos K. Chrysanthis: Planar: Parallel lightweight architecture-aware adaptive graph repartitioning. ICDE 2016: 121-132
RDF/SPARQL
- Nikolaos Papailiou, Dimitrios Tsoumakos, Panagiotis Karras, Nectarios Koziris: Graph-Aware, Workload-Adaptive SPARQL Query Caching. SIGMOD Conference 2015: 1777-1792
- Gunes Aluc, M. Tamer Özsu, Khuzaima Daudjee: Workload Matters: Why RDF Databases Need a New Design. PVLDB 7(10): 837-840 (2014)
Data cleaning
- Xu Chu, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Nan Tang, Yin Ye: KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing. SIGMOD Conference 2015: 1247-1261
- Michele Dallachiesa, Amr Ebaid, Ahmed Eldawy, Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Nan Tang: NADEEF: a commodity data cleaning system. SIGMOD Conference 2013: 541-552
Ziawasch Abedjan, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker: DataXFormer: A robust transformation discovery system. ICDE 2016: 1134-1145Assigned to Zacharias Chasparis
Crowdsourcing
- Daniel Haas, Jiannan Wang, Eugene Wu, Michael J. Franklin: CLAMShell: Speeding up Crowds for Low-latency Data Labeling. PVLDB 9(4): 372-383 (2015)
- Ju Fan, Guoliang Li, Beng Chin Ooi, Kian-Lee Tan, Jianhua Feng: iCrowd: An Adaptive Crowdsourcing Framework. SIGMOD Conference 2015: 1015-1030
- Sibo Wang, Xiaokui Xiao, Chun-Hee Lee: Crowd-Based Deduplication: An Adaptive Approach. SIGMOD Conference 2015: 1263-1277
Visualization
Ahmed Eldawy, Mohamed F. Mokbel, Christopher Jonathan: HadoopViz: A MapReduce framework for extensible visualization of big spatial data. ICDE 2016: 601-612Assigned to Saheli Ghosh- Yongjoo Park, Michael J. Cafarella, Barzan Mozafari: Visualization-aware sampling for very large databases. ICDE 2016: 755-766
Machine Learning
- Edward Ma, Vishrut Gupta, Meichun Hsu, Indrajit Roy: dmapply: A functional primitive to express distributed machine learning algorithms in R. PVLDB 9(13): 1293-1304 (2016
- Matthias Boehm, Michael Dusenberry, Deron Eriksson, Alexandre V. Evfimievski, Faraz Makari Manshadi, Niketan Pansare, Berthold Reinwald, Frederick Reiss, Prithviraj Sen, Arvind Surve, Shirish Tatikonda: SystemML: Declarative Machine Learning on Spark. PVLDB 9(13): 1425-1436 (2016)
- Jaeho Shin, Sen Wu, Feiran Wang, Christopher De Sa, Ce Zhang, Christopher Ré: Incremental Knowledge Base Construction Using DeepDive. PVLDB 8(11): 1310-1321 (2015)
- U. Kang, Charalampos E. Tsourakakis, Christos Faloutsos: PEGASUS: A Peta-Scale Graph Mining System. ICDM 2009: 229-238
Privacy/Secutiry
Zhao Chang, Dong Xie, Feifei Li: Oblivious RAM: A Dissection and Experimental Evaluation. PVLDB 9(12): 1113-1124 (2016)Assigned to Minying Meng
Recommendation
- Aneesh Sharma, Jerry Jiang, Praveen Bommannavar, Brian Larson, Jimmy J. Lin: GraphJet: Real-Time Content Recommendations at Twitter. PVLDB 9(13): 1281-1292 (2016)
Key-value stores
- Dipti Shankar, Xiaoyi Lu, Md. Wasi-ur-Rahman, Nusrat S. Islam, Dhabaleswar K. Panda: Benchmarking key-value stores on high-performance storage and interconnects for web-scale workloads. Big Data 2015: 539-544
Human Interaction with Database Systems
- Arnab Nandi, Lilong Jiang, Michael Mandel: Gestural Query Specification. PVLDB 7(4): 289-300 (2013)
- Fei Li, H. V. Jagadish: Understanding Natural Language Queries over Relational Databases. SIGMOD Record 45(1): 6-13 (2016)
Spatial and Spatio-temporal Data
Indexing
- Lu Wang, Robert Christensen, Feifei Li, Ke Yi: Spatial Online Sampling and Aggregation. PVLDB 9(3): 84-95 (2015)
- Ying Lu, Cyrus Shahabi, Seon Ho Kim: Efficient indexing and retrieval of large-scale geo-tagged video databases. GeoInformatica 20(4): 829-857 (2016)
- Abdeltawab M. Hendawi, Jie Bao, Mohamed F. Mokbel, Mohamed H. Ali: Predictive tree: An efficient index for predictive queries on road networks. ICDE 2015: 1215-1226
Selectivity Estimation
Ning An, Zhen-Yu Yang, Anand Sivasubramaniam: Selectivity Estimation for Spatial Joins. ICDE 2001: 368-375Assigned to Saheli Ghosh- Xiaoyang Wang, Ying Zhang, Wenjie Zhang, Xuemin Lin, Wei Wang: Selectivity Estimation on Streaming Spatio-Textual Data Using Local Correlations. PVLDB 8(2): 101-112 (2014)
Road Networks
Shangfu Peng, Hanan Samet: Analytical queries on road networks: an experimental evaluation of two system architectures. SIGSPATIAL/GIS 2015: 1:1-1:10Assigned to Payas RajanAbdeltawab M. Hendawi, Amruta Khot, Aqeel Rustum, Anas Basalamah, Ankur Teredesai, Mohamed H. Ali: COMA: Road Network Compression for Map-Matching. MDM (1) 2015: 104-109Assigned to Minying Meng
Big Spatial Data
- Ricardo Fernandes, Piotr Zaczkowski, Bernd Göttler, Conor Ettinoffe, Anis Moussa: TrafficDB: HERE's High Performance Shared-Memory Data Store. PVLDB 9(13): 1365-1376 (2016)
Ahmed Eldawy, Mohamed F. Mokbel: SpatialHadoop: A MapReduce framework for spatial data. ICDE 2015: 1352-1363Assigned to Tin VuAhmed M. Aly, Ahmed R. Mahmood, Mohamed S. Hassan, Walid G. Aref, Mourad Ouzzani, Hazem Elmeleegy, Thamir Qadah: AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data. PVLDB 8(13): 2062-2073 (2015)Assigned to Tin VuDong Xie, Feifei Li, Bin Yao, Gefei Li, Liang Zhou, Minyi Guo: Simba: Efficient In-Memory Spatial Analytics. SIGMOD Conference 2016: 1071-1085Assigned to Andres Calderon Romero
Joins
- Suprio Ray, Bogdan Simion, Angela Demke Brown, Ryan Johnson: Skew-resistant parallel in-memory spatial join. SSDBM 2014: 6:1-6:12
- Farhan Tauheed, Thomas Heinis, Anastasia Ailamaki: THERMAL-JOIN: A Scalable Spatial Join for Dynamic Workloads. SIGMOD Conference 2015: 939-950
Volunteer Geographic Information (VGI)
- Dingxiong Deng, Cyrus Shahabi, Ugur Demiryurek, Linhong Zhu: Task selection in spatial crowdsourcing from worker's perspective. GeoInformatica 20(3): 529-568 (2016)
Ridesharing
Bin Cao, Louai Alarabi, Mohamed F. Mokbel, Anas Basalamah: SHAREK: A Scalable Dynamic Ride Sharing System. MDM (1) 2015: 4-13Assigned to Yi Wu
Computational Geometry (CG)
Ahmed Eldawy, Yuan Li, Mohamed F. Mokbel, Ravi Janardan: CG_Hadoop: computational geometry in MapReduce. SIGSPATIAL/GIS 2013: 284-293Assigned to Andres Calderon Romero- Samuel Audet, Cecilia Albertsson, Masana Murase, Akihiro Asahara: Robust and efficient polygon overlay on parallel stream processors. SIGSPATIAL/GIS 2013: 294-303
Spatial Keyword Queries
- Taesung Lee, Jin-Woo Park, Sanghoon Lee, Seung-won Hwang, Sameh Elnikety, Yuxiong He: Processing and Optimizing Main Memory Spatial-Keyword Queries. PVLDB 9(3): 132-143 (2015)
Indoor Environment
- Heba Aly, Moustafa Youssef: Dejavu: an accurate energy-efficient outdoor localization system. SIGSPATIAL/GIS 2013: 154-163
- Kenneth Fuglsang Christensen, Lasse Linnerup Christiansen, Torben Bach Pedersen, Jeppe Pihl: Searchlight: Context-aware predictive Continuous Querying of moving objects in symbolic space. ICDE 2015: 687-698
Spatial Data Mining
- Reem Y. Ali, Venkata M. V. Gunturi, Andrew J. Kotz, Shashi Shekhar, William F. Northrop: Discovering Non-compliant Window Co-Occurrence Patterns: A Summary of Results. SSTD 2015: 391-410
- Luan Tran, Liyue Fan, Cyrus Shahabi: Distance-based Outlier Detection in Data Streams. PVLDB 9(12): 1089-1100 (2016)