Selecting a Paper
2013 Papers (Already Selected)
- Note: How about Google File System, Bigtable, or Dynamo? Or something on networking? Or Gordon? Or cross-VM side channel attacks?
- Richard Snider: "Finding a needle in Haystack: Facebook’s photo storage" by Doug Beaver et. al. (OSDI 2010)
- Juan Vega: "Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency" by Brad Calder et. al. (SOSP 2011)
- Xin Feng: "Spanner: Google's Globally-Distributed Database" by J. Corbett et. al. (OSDI 2012)
- Timothy Lindberg: "Xen and the Art of Virtualization" by P. Barham et. al. (SOSP 2003)
- Peter Hodson: "The evolution of an x86 virtual machine monitor" by O. Agesen et. al. (ACM SIGOPS Operating Systems Review 2010)
- John McGill: "Virtualizing I/O Devices on VMWare Workstation's Hosted Virtual Machine Monitor" by J. Sugerman et. al. (Usenix 2001)
- Juan Mendez: "SnowFlock: Rapid Virtual Machine Cloning for Cloud Computing" by H. Andres Lagar-Cavilla et al. (EuroSys 2009)
- Jason O'Reilly: "An Architecture for Modular Data Centers" by James Hamilton (CIDR 2007)
- Jennifer Johnson: "It’s Not Easy Being Green" by P. Gao et. al. (SIGCOMM 2012)
- Angela Cheng: "Power Routing: Dynamic Power Provisioning in the Data Center" by Steven Pelley et. al. (ASPLOS 2010)
- Michael Morelli: "Hey, You, Get Off of My Cloud: Exploring Information Leakage in Third-Party Compute Clouds" by Thomas Ristenpart et. al. (CCS 2009)
- Alejandro Dominguez: "CloudAV: N-Version Antivirus in the Network Cloud" by Oberheide et al. (USENIX Security 2008)
- Scott Snow: "A New Form of DOS Attack in a Cloud and Its Avoidance Mechanism" by Huan Liu (CCSW 2010)
- Evan Glick: "Chatty Tenants and the Cloud Network Sharing Problem" by H. Ballani et. al. (NSDI 2013)
- Josh Butler: "Camdoop: Exploiting In-network Aggregation for Big Data Applications" by P. Costa et. al. (NSDI 2012)
- Eddie Escamilla: "Cross-VM Side Channels and Their Use to Extract Private Keys", Y. Zhang et. al., CCS 2012
Highlighted Papers
The general rule of thumb is that papers discussed in the class should be published after 2010. Here is a list of "selected papers" that, although they may be a few years older than our typical 2010+ standard, provide a strong introduction to cloud computing and related datacenter/architecture topics.
Systems:
- [2012 class] "Above the Clouds: A Berkeley View of Cloud Computing", by Michael Armbrust, et al. Technical Report, University of Berkeley, 2009
- "Data-Intensive Supercomputing: The case for DISC", by Randal Bryant, Technical Report, Carnegie Mellon University, 2007
- [2012 class] "Web search for a planet: the Google cluster architecture" by Luiz Barroso, et al. (IEEE Computer 2003)
- [2012 class] "MapReduce: Simplified Data Processing on Large Clusters" by Jeffrey Dean and Sanjay Ghemawat (OSDI 2004)
- [2012 class] "The Google File System" by Sanjay Ghemawat et al. (SOSP 2003)
- [2012 class] "Bigtable: A Distributed Storage System for Structured Data", by Fay Chang et. al. (OSDI 2006)
- "Spanner: Google's Globally-Distributed Database" by J. Corbett et. al. (OSDI 2012)
- [2012 class] "Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency" by Brad Calder et. al. (SOSP 2011)
- "Dynamo: Amazon's Highly Available Key-value Store" by G. DeCandia et. al. (SOSP 2007)
- [2012 class] "Pig Latin: A Not-So-Foreign Language for Data Processing", by Christopher Olston, et. al (SIGMOD 2008)
- "Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience" by Alan Gates et al. (VLDB 2009)
- [2012 class] "Finding a needle in Haystack: Facebook’s photo storage" by Doug Beaver et. al. (OSDI 2010)
- "Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks" by M. Isard et. al. (EuroSys 2007)
- "HAIL: A High-Availability and Integrity Layer for Cloud Storage" by Kevin Bowers et. al. (CCS 2009)
- "Flat Datacenter Storage" by E. Nightengale et. al. (OSDI 2012)
Security:
- [2012 class] "Hey, You, Get Off of My Cloud: Exploring Information Leakage in Third-Party Compute Clouds" by Thomas Ristenpart et. al. (CCS 2009)
- "Provable data possession at untrusted stores" by Giuseppe Ateniese et. al. (CCS 2007)
- [2012 class] "CloudAV: N-Version Antivirus in the Network Cloud" by Oberheide et al. (USENIX Security 2008)
- [2012 class] "A New Form of DOS Attack in a Cloud and Its Avoidance Mechanism" by Huan Liu (CCSW 2010)
- "Cross-VM Side Channels and Their Use to Extract Private Keys", Y. Zhang et. al., CCS 2012
Virtualization:
- "SnowFlock: Rapid Virtual Machine Cloning for Cloud Computing" by H. Andres Lagar-Cavilla et al. (EuroSys 2009)
- "Virtualizing I/O Devices on VMWare Workstation's Hosted Virtual Machine Monitor" by J. Sugerman et. al. (Usenix 2001)
- [2012 class] "Xen and the Art of Virtualization" by P. Barham et. al. (SOSP 2003)
- "The evolution of an x86 virtual machine monitor" by O. Agesen et. al. (ACM SIGOPS Operating Systems Review 2010)
- [2012 class] "Live Migration of Virtual Machines" by C. Clark et. al., (NSDI 2005)
- "Black-box and Gray-box Strategies for Virtual Machine Migration" by T. Wood et. al., (NDSI 2007)
- "Post-Copy Based Live Virtual Machine Migration Using Adaptive Pre-Paging and Dynamic Self-Balloning" by Michael Hines et. al. (VEE 2009)
- "Difference Engine: Harnessing Memory Redundancy in Virtual Machines" by Diwaker Gupta (OSDI 2008)
Networking:
- "F10: A Fault-Tolerant Engineered Network" by V. Liu et. al. (NSDI 2013)
- "Chatty Tenants and the Cloud Network Sharing Problem" by H. Ballani et. al. (NSDI 2013)
- "BCube: a high performance, server-centric network architecture for modular data centers" by C. Guo et. al. (SIGCOMM 2009)
- "NetPilot: Automating Datacenter Network Failure Mitigation" by X. Wu et. al. (SIGCOMM 2012)
- "It’s Not Easy Being Green" by P. Gao et. al. (SIGCOMM 2012)
- "FairCloud: Sharing the Network in Cloud Computing" by L. Popa et. al. (SIGCOMM 2012)
- "Leveraging Endpoint Flexibility in Data-Intensive Clusters" by M. Chowdhury et. al. (SIGCOMM 2013)
- "How Hard Can It Be? Designing and Implementing a Deployable Multipath TCP" by C. Raiciu et. al. (NDSI 2012)
- "Camdoop: Exploiting In-network Aggregation for Big Data Applications" by P. Costa et. al. (NSDI 2012)
- "VL2: A Scalable and Flexible Data Center Network" by A. Greenberg et. al. (SIGCOMM 2009)
- "PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric" by R. Mysore et. al (SIGCOMM 2009)
- "Floodless in SEATTLE: A Scalable Ethernet Architecture for Large Enterprises" by C. Kim (SIGCOMM 2008)
- "SPAIN: COTS Data-Center Ethernet for Multipathing over Arbitrary Topologies" by J. Mudigonda (NSDI 2010)
Power:
- [2012 class] "Power Routing: Dynamic Power Provisioning in the Data Center" by Steven Pelley et. al. (ASPLOS 2010)
- [2012 class] "Gordon: Using Flash Memory to Build Fast, Power-efficient Clusters for Data-intensive Applications" by Adrian Caulfield et. al. (ASPLOS 2009)
- "FAWN: A Fast Array of Wimpy Nodes" (SOSP 2009)
Misc:
- "Clearing the Clouds: A Study of Emerging Scale-out Workloads on Modern Hardware" by M. Ferdman et. al. (ASPLOS 2012)
- [2012 class] Shipping containers: "An Architecture for Modular Data Centers" by James Hamilton (CIDR 2007)
- Not a paper, but useful background information on this topic: http://7x24exchangerm.org/Modular%20Power%20Centers%20_RMH_2011.pdf
Conference / Workshops on Cloud Computing Topics
You may also select papers from the following list of "approved" conferences and workshops, provided that the paper was published after 2010. Want to add additional conferences to this list? Email Dr. Shafer for approval.
- Cloud Computing (and Parallel / Distributed Systems)
- USENIX Symposium on Operating Systems Design and Implementation (OSDI) - See OSDI'12, OSDI'10
- ACM/IEEE International Symposium on Cluster, Cloud, and Grid Computing (CCGRID)
- ACM Symposium on Cloud Computing (SOCC)
- ACM Cloud Computing Security Workshop (CCSW) - See CCSW'13, CCSW'12, CCSW'11, CCSW'10
- ACM Conference on Computer and Communications Security (CCS) - See CCS'13, CCS'12, CCS'11, CCS'10
- USENIX HotCloud Workshop - See HotCloud 2013, HotCloud 2012, HotCloud 2011, HotCloud 2010
- Other IEEE TCSC Cloud Computing Conferences
- Computer Networking (try to limit papers to topics tangentially related to cloud computing, such as large-scale datacenter networks, etc...)
- ACM Special Interest Group on Data Communication (SIGCOMM) - See SIGCOMM 2013, SIGCOMM 2012, SIGCOMM 2011, SIGCOMM 2010
- ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS) - See ANCS 2013, ANCS 2012, ANCS 2011, ANCS 2010
- USENIX Symposium on Networked System Design and Implementation (NDSI) - See NSDI'13, NSDI'12, NSDI'11, NSDI'10
- Computer Architecture (try to limit papers to topics tangentially related to cloud computing)
- International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) - See ASPLOS 2013, ASPLOS 2012, ASPLOS 2011, ASPLOS 2010
- International Symposium on Computer Architecture (ISCA) - See ISCA 2013, ISCA 2012, ISCA 2011, ISCA 2010, etc..
- IEEE/ACM International Symposium on Microarchitecture (MICRO)
- International Symposium on High-Performance Computer Architecture (HPCA)
- ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE) - See VEE 2011, VEE 2010
You can search for papers in Google Scholar, the ACM Digital Library (on campus only!), or IEEE Explore (on campus only!).