別紙3 データストア関連論文リスト 著者 出典 タイトル Benjamin H.Sigelman, Luiz Barroso, 1 Mike Burrows, Pat Stephenson, Manoj 発表年略称 Google Dapper, a Large-Scale Distributed Systems Technical Tracing Infrastructure Report PODC Pregel: A System for Large-Scale Graph Processing Plakal, Donald Beaver, Saul Jaspan, Grzegorz Malewicz, Matthew H. 2 Austern, Aart J.C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, Fay Chang, Jeffrey Dean, Sanjay Bigtable: A Distributed Storage System for 3 Ghemawat, Wilson C. Hsieh, Deborah A. Structured Data. Wallach, Mike Burrows, Tushar Philip Bernstein, Colin Reid, Sudipto Hyder - A Transactional Record Manager 4 Das ACM Trans. Comput. CIDR for Shared Flash Jason Baker, Chris Bond, James Megastore: Providing Scalable, Highly Available Storage for Interactive Services James Larson, Jean-Michel Leon, Yawei 5 Corbett, JJ Furman, Andrey Khorlin, Carlo Curino, Evan Jones, Raluca Popa, 6 Nirmesh Malviya, Eugene Wu, Sam 7 8 9 10 11 12 13 14 15 Madden, Har Balakrishnan, Nickolai John Ousterhout, Parag Agrawal, David Erickson, Christos Kozyrakis, Jacob Leverich, David Mazi?res, Subhasish Peter Alvaro, Tyson Condie, Neil Conway, Khaled Elmeleegy, Joseph M. Hellerstein, Russell Sears Alysson Bessani, Miguel Correia, Bruno Quaresma, Fernando Andre, Paulo Sousa Vinayak R. Borkar, Michael J. Carey, Raman Grover, Nicola Onose, Rares Vernica Roger S. Barga, Yogesh L. Simmhan, Eran Chinthaka, Satya Sanket Sahoo, Jared Jackson, Nelson Araujo Haryadi S. Gunawi, Abhishek Rajimwale, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau Ashok Anand and Sayandeep Sen,Andrew Krioukov, Florentina Popovici, Aditya Akella, Andrea ArpaciDaniel Ford, Francois Labelle, Florentina I. Popovici, Murray Stokely, Van-Anh Truong, Luiz Barroso, Carrie Pradeep Kumar Gunda, Lenin Ravindranath, Chandramohan A. Thekkath, Yuan Yu, and Li Zhuang, Daniel Peng and Frank Dabek, 16 Relational Cloud: a Database Service for the cloud The case for RAMCloud. Boom analytics: exploring data-centric, declarative programming for the cloud. DepSky: Dependable and Secure Storage in a Cloud-of-Clouds Hyracks: A flexible and extensible foundation for data-intensive computing. Provenance for Scientific Workflows Towards Reproducible Research. SQCK: A Declarative File System Checker Avoiding File System Micromanagement with Range Writes Availability in Globally Distributed Storage Systems Nectar: Automatic Management of Data and Computation in Datacenters Large-scale Incremental Processing Using Distributed Transactions and Notifications Prince Mahajan, Srinath Setty, Sangmin 17 Lee, Allen Clement, Lorenzo Alvisi, Mike Depot: Cloud Storage with Minimal Trust Dahlin, and Michael Walfish, Doug Beaver, Sanjeev Kumar, Harry C. 18 Li, Jason Sobel, and Peter Vajgel, Johnson, R., Pandis, I., Hardavellas, N., 19 Ailamaki, A. and Falsafi, B. Finding a Needle in Haystack: Facebook's Photo Storage Shore-MT: a scalable storage manager for the multicore era. R. Ikeda and J. Widom. 20 Panda: A System for Provenance and Data. 1 CIDR CIDR CACM EuroSys EuroSys ICDE IEEE Data Eng. Bull. OSDI OSDI OSDI OSDI OSDI OSDI OSDI EDBT TaPP '10 2010 Dapper 2009 Pregel 2008 BigTabl e 2011 Hyder 2011 Megasto re 2011 Relation al Cloud 2011 RamClo ud 2010 Boom 2011 DepSky 2011 Hyracks 2010 Trident 2008 SQCK 2008 RangeW rite data 2010 avilabili ty 2010 Nectar 2010 Percolat or 2010 Depot 2010 Haystac k 2009 ShoreM T 2010 Panda 別紙3 データストア関連論文リスト Asit Mishra, Joseph L Hellerstein, 21 Walfredo Cirne Daniel Abadi and Samuel Madden 22 Daniel Abadi, Samuel Madden, and 23 Nabil Hachem Stavros Harizopoulos, Daniel Abadi, 24 Samuel Madden, and Michael Towards Characterizing Cloud Backend Workloads: Insights from Google Compute Clusters Compression in Column Oriented Databases Column-Stores vs. Row-Stores: How Different Are They Really? OLTP Through the Looking Glass, And What We Found There Stonebraker Parag Agrawal, Adam Silberstein, Brian Asynchronous view maintenance for VLSD 25 F. Cooper, Utkarsh Srivastava, Raghu databases. Ramakrishnan Evan Jones, Daniel Abadi, and Samuel Low Overhead Concurrency Control for 26 Madden SIGMETRI Google 2009 CS Perf. Metrics Eval. C-storeSIGMOD 2006 compres s SIGMOD 2008 C-store2 SIGMOD SIGMOD SIGMOD Partitioned Main Memory Databases Donald Kossmann, Tim Kraska, Simon 27 Loesing Tuan Cao, Marcos Antonio Vaz Salles, 28 Benjamin Sowell, Yao Yue, Alan J. Demers, Johannes Gehrke, Walker M. Adam Silberstein, Russell Sears, 29 Wenchao Zhou, Brian F. Cooper Emad Soroush, Magdalena Balazinska, 30 Daniel L. Wang Sudipto Das, Divyakant Agrawal , Amr 31 El Abbadi Alon Halevy, Hector Gonzalez, Jayant 32 Madhavan, Christian Jensen, Jonathan Goldberg-Kidon, Warren Shen, Rebecca Steve Ko, Imranul Hoque, Brian Cho, 33 Indranil Gupta Kashi Vishwanath, Nachi Nagappan 34 Hrishikesh Amur, James Cipar, Varun 35 Gupta, Michael Kozuch, Gregory Ganger, Karsten Schwan Lonnie Princehouse, Hussam Abu36 Libdeh, Hakim Weatherspoon Peter Bodik, Armando Fox, Michael 37 Franklin, Michael Jordan, David Patterson Brian F. Cooper, Adam Silberstein, 38 Erwin Tam, Raghu Ramakrishnan, Russell Sears Timothy Wood, H. Andres Lagar-Cavilla 39 and K.K. Ramakrishnan, Prashant Shenoy, and Jacobus Van der Merwe Herodotos Herodotou, Fei Dong, and 40 Shivnath Babu Zhiming Shen, Sethuraman Subbiah, 41 and Xiaohui Gu and John Wilkes An evaluation of alternative architectures for transaction processing in the cloud. Fast checkpoint recovery algorithms for frequently consistent applications. A batch of PNUTS: experiences connecting cloud batch and serving systems. ArrayStore: a storage manager for complex parallel array processing. G-Store: A Scalable Data Store for Transactional Multi key Access in the Cloud Google Fusion Tables: Data Management, Integration and Collaboration in the Cloud Making Cloud Intermediate Data FaultTolerant Characterizing Cloud Computing Hardware Reliability Robust and Flexible Power-Proportional Storage RACS: A Case for Cloud Storage Diversity Characterizing, Modeling, and Generating Workload Spikes for Stateful Services Benchmarking cloud serving systems with YCSB PipeCloud: Using Causality to Overcome Speed-of-Light Delays in Cloud-Based Disaster Recovery No One (Cluster) Size Fits All: Automatic Cluster Sizing for Data-intensive Analytics. CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems. 2 SIGMOD SIGMOD SIGMOD SIGMOD SOCC SOCC SOCC SOCC SOCC SOCC SOCC SOCC SOCC SOCC SOCC 2008 Shore 2009 PNUTSView 2010 HStore2 2010 Eval Cloud 2011 CheckRecover 2011 PNUTSbatch 2011 ArraySt ore 2010 G-Store 2010 FusionT able Interme 2010 diateDa ta HW 2010 reliabilit y PowerPr 2010 opotiona l 2010 RACS 2010 workloa d spike 2010 YCSB 2011 PipeClo ud 2011 Elastisiz er 2011 CloudSc ale 別紙3 データストア関連論文リスト Swapnil Patil, Milo Polte, Kai Ren, 42 Wittawat Tantisiriroj, Lin Xiao, Julio Lopez, and Garth Gibson Bikash Sharma, Victor Chudnovsky, 43 Joseph L. Hellerstein, and Rasekh Rifaat, and Chita R. Das Giuseppe DeCandia, Deniz Hastorun, 44 Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Mike Mammarella, Shant Hovsepian, 45 and Eddie Kohler. YCSB++: Benchmarking and Performance Debugging Advanced Features in Scalable Table Stores. Modeling and Synthesizing Task Placement Constraints in Google Compute Clusters. Dynamo: Amazon's Highly Available KeyValue Store Modular data storage with Anvil. Michael Stonebraker, Paul Brown, Alex 46 Poliakov, Suchi Raman John C. McCullough, John Dunagan and 47 Alec Wolman, Alex C. Snoeren, Nguyen Tran, Marcos K. Aguilera, and 48 Mahesh Balakrishnan, Microsoft Research Silicon Valley Ashish Chawla, Benjamin Reed, Karl 49 Juhnke, and Ghousuddin Syed, Yahoo! Inc Dennis Fetterly, Maya Haridasan, and 50 Michael Isard, Swaminathan Sundararaman Peter Macko and Margo Seltzer, Keith A. 51 Smith, Marc Eshel, Roger Haskin, Dean 52 Hildebrand, Manoj Naik, Frank Schmuck, and Renu Tewari, Beth Trushkowsky, Peter Bod?k, 53 Armando Fox, Michael J. Franklin, Michael I. Jordan, and David A. Swapnil Patil and Garth Gibson, 54 Nitin Agrawal, Leo Arulraj, Andrea C. 55 Arpaci-Dusseau, and Remzi H. Arpaci- 56 57 58 59 60 61 62 Dusseau Michael Stonebraker, Daniel Abadi, Adam Batkin, Xuedong Chen, Mitch Cherniack, Miguel Ferreira, Edmond Michael Stonebraker, Samuel Madden, Daniel J. Abadi, Stavros Harizopoulos, Nabil Hachem, Pat Helland Brian F. Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-Arno Jacobsen, Ronnie Chaiken, Bob Jenkins, Per-?ke Larson, Bill Ramsey, Darren Shakib, Simon Weaver, and Jingren Zhou. Azza Abouzeid, Kamil BajdaPawlikowski, Daniel J. Abadi, Alexander Rasin, Avi Silberschatz Donald Kossmann, Tim Kraska, Simon Loesing, Stephan Merkli, Raman Mittal, Flavio Pfaffhauser Johnson, R., Pandis, I., Stoica, R., Athanassoulis, M. and Ailamaki, A. The Architecture of SciDB Stout: An Adaptive Interface to Scalable Cloud Storage Online Migration for Geo-distributed Storage Systems Semantics of Caching with SPOCA: A Stateless, Proportional, OptimallyConsistent Addressing Algorithm TidyFS: A Simple and Small Distributed File System Tracking Back References in a WriteAnywhere File System Panache: A Parallel File System Cache for Global File Access SOCC SOCC SOSP SOSP SSDBM C-Store: A Column-oriented DBMS The End of an Architectural Era (It's Time for a Complete Rewrite). PNUTS: Yahoo!'s hosted data serving platform. SCOPE: easy and efficient parallel processing of massive data sets. HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads. Cloudy: A Modular Cloud Storage System. Aether: A scalable approach to logging. 3 2011 UM 2007 Dynamo 2009 Anvil 2011 SciDB USENIX ATC 2010 Stout USENIX ATC 2011 Nomad USENIX ATC 2011 SPOCA USENIX ATC 2011 TidyFS USENIX FAST 2010 Backlog USENIX FAST 2010 Panache The SCADS Director: Scaling a Distributed USENIX Storage System Under Stringent FAST Performance Requirements USENIX Scale and Concurrency of GIGA+: File FAST System Directories with Millions of Files Emulating Goliath Storage Systems with David 2011 YCSB++ USENIX FAST VLDB VLDB VLDB VLDB VLDB VLDB VLDB 2011 SCADS Director 2011 GIGA+ 2011 David 2005 C-store 2007 H-Store 2008 PNUTS 2008 SCOPE 2009 Hadoop DB 2010 Cloudy 2010 Aether 別紙3 データストア関連論文リスト Pandis, I., Johnson, R., Hardavellas, N. 63 and Ailamaki, A. Biplob Debnath, Sudipta Sengupta, Jin 64 Li Francesco Fusco, Marc Stoecklin, 65 Michaol Vlachos Mustafa Canim, George Mihaila, 66 Bishwaranjan Bhattacharjee, Kenneth Ross, Christian Lang Hoang Tam Vo, Chun Chen, Beng Chin 67 Ooi Data-Oriented Transaction Execution. FlashStore: High Throughput Persistent Key-Value Store Net-Fli: On-the-fly Compression, Archiving and Indexing of Streaming Network Traffic SSD Bufferpool Extensions for Database Systems Towards Elastic Transactional Cloud Storage with Range Query Support Umar Farooq Minhas, Shriram RemusDB: Transparent High-Availability for Database Systems Aboulnaga, Ken Salem, Andrew Warfield 68 Rajagopalan, Brendan Cully, Ashraf Gang Chen, Hoang Tam Vo, Sai Wu, 69 Beng Chin Ooi, M. Tamer zsu Hyungsoo Jung Hyuck Han, Alan 70 Fekete, Uwe Roehm Avrilia Floratou, Jignesh Patel, Eugene 71 Shekita, Sandeep Tata V. Srinivasan, Brian Bulkowski 72 Ippokratis Pandis, Pinar Tozun, Ryan 73 Johnson, Anastasia Ailamaki Philip Bernstein, Colin Reid, Ming Wu, 74 Xinhao Yuan Roxana Geambasu, Amit A. Levy, 75 Tadayoshi Kohno, Arvind Krishnamurthy, and Henry M. Levy, Jun Rao, Eugene Shekita, Sandeep Tata 76 Mary Baker, Mehul Shah, David S. H. 77 Rosenthal, Mema Roussopoulos, Petros Maniatis, TJ Giuli, and Prashanth Mark W. Storer, Kevin M. Greenan, 78 Ethan L. Miller, and Kaladhar Voruganti A Framework for Supporting DBMS-like Indexes in the Cloud Serializable Snapshot Isolation for Replicated Databases in High-Update Scenarios Column-Oriented Storage Techniques for MapReduce Citrusleaf: A Real-Time NoSQL DB which Preserves ACID PLP: Page Latch-free Shared-everything OLTP Optimistic Concurrency Control by Melding Trees Comet: An Active Distributed Key-Value Store Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore A fresh look at the reliability of long-term digital storage A secure, recoverable, long-term archival storage system 4 VLDB VLDB VLDB VLDB VLDB VLDB VLDB VLDB VLDB VLDB VLDB VLDB OSDI VLDB SIGOPS Oper. Syst. Rev. 40, 4 ACM Trans. Storage 2010 DORA 2010 FlashSt ore 2010 Net-Fli 2010 SSDDB2 2010 ecStore 2011 RemusD B 2011 CloudIn dexFW 2011 RSSI 2011 Column Hadoop 2011 Citrusle af 2011 PLP 2011 Meld 2010 Comet 2011 Spinnak er 2006 Fleshloo k 2009 Potshar ds