Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Workshop on Expo Day [clear filter]
Monday, October 1


Rackspace Cloud: An Overview

Learn how the Rackspace Cloud has helped companies scale from a startup to a fast-growing company.

2012 Keynote & Breakout Sessio...
avatar for Arrian Mehis

Arrian Mehis

Enterprise Architect, Rackspace Hosting
Enterprise Architect, Rackspace Hosting Arrian Mehis is an Enterprise Architect at Rackspace Hosting, the world’s leading specialist in hosting and cloud computing. Mehis joined Rackspace in his current capacity in May 2011, bringing with him nearly 20 years of IT industry experience... Read More →

Monday October 1, 2012 9:00am - 10:00am
Ballroom West


Build A Cloud Day - All Day Workshop

Build a Cloud Day is dedicated to teaching users how to build and manage a cloud computing environment using free and open source software. The program is designed to expose attendees to the concepts and best practices around deploying cloud computing infrastructure.

09:00 - 09:30 Welcome: Joe Brockmeier, Open Source Cloud Computing Evangelist, Citrix

Will kick off the event with a review of the program and short talk on cloud computing.

09:30 - 10:30 Delivering IaaS with CloudStack: David Nalley, Apache CloudStack

Apache CloudStack is arguably one of the most mature, frequently deployed IaaS platforms - having been used in multi-national public clouds, and private clouds scaling to tens of thousands of physical compute nodes, we’ll look at considerations for deploying CloudStack and how to gain the most efficiency from your deployment.

10:30 - 10:45 Break

10:45 - 11:30 Scaling Storage with Ceph: Ross Turk, VP Community at InkTank

Ceph is an open source distributed object store, network block device, and file system designed for reliability, performance, and scalability. It runs on commodity hardware, has no single point of failure, and is supported by the Linux kernel. This talk will describe the Ceph architecture, share its design principles, and discuss how it can be part of a cost-effective, reliable cloud stack.

11:30 - 12:15 Deploying Private PaaS with ActiveState Stackato: John Wetherill, Developer Evangelist, ActiveState

This presentation covers building and deploying a Private Platform-as-a-Service (PaaS) on CloudStack. Diane Mueller, ActiveState’s Cloud Evangelist shows how to deploy ActiveState’s Stackato, an enterpriseready multi-lingual Private PaaS that runs on any cloud and supports deploying and managing web & mobile applications in any language including Java, .Net, Python, Perl, PHP Ruby, Node.js, Clojure, Scala and Erlang - to name a few. Using the CloudStack UI, Diane demonstrates how to configure and deploy the PaaS and then shows how easy it is to push a live application in under an hour.

12:15 - 1:30 Lunch

1:30 - 2:15 Scalable Networking in Apache CloudStack: Chiradeep Vittal, Citrix

We will discuss approaches to providing network security and isolation to tenants in a large-scale public cloud. A scale-up solution that uses powerful hardware devices may be suitable in some cases while a scale-out solution that uses a federation of finely tuned firewalls may be more suitable in other cases. Apache CloudStack is flexible enough to provide the cloud operator with solutions at either end of the spectrum.

2:15 - 3:00 Building the App Cloud with BitNami: Erica Brescia, CEO BitRock

This presentation will introduce the BitNami library of applications and tools and explain how you can leverage it to provide your users with one-click deployment of their favorite apps. As cloud deployment becomes mainstream, focus is expanding from infrastructure topics such as high-availability and elasticity to application-level concerns. BitNami makes it easy to deploy and manage popular open source applications so your end-users can take advantage of your bright shiny cloud from day one.

3:00 - 3:15 Break

3:15 - 4:00 Challenges in Supporting Multi-Hypervisors in CloudStack: Kelven Yang, Citrix

Kelven will discuss the CloudStack support for multiple hypervisors in a single cloud. Kelven will discuss the two integration strategies: directly managed hypervisor host clusters and externally managed hypervisor host clusters. He will also describe the challenges to be solved when giving storage volumes first-class citizenship in different hypervisor environments. Kelven will describe how CloudStack-managed HA and hypervisor-native HA and DRS can work together, and how console access is uniformly supported across hypervisors.

4:00 - 4:45 Practice your Cloud-Fu with Scalr: Sebastian Stadil, CEO Scalr

Multi-cloud this! Orchestrate that! In this amazing presentation of Cloud-Fu, Sebastian Stadil, founder of the open source cloud management software Scalr, will demonstrate basic to advanced katas for building applications on the cloud that are resilient to pesky failures.

4:45 - 5:00 Close: Joe Brockmeier, Citrix


2012 Keynote & Breakout Sessio...
avatar for Erica Brescia

Erica Brescia

CEO, BitRock
CEO, BitRock Erica is the CEO of BitRock, makers of the popular BitNami cloud application library and management tools. BitNami Cloud Images drive millions of hours of usage across the leading cloud providers. Prior to joining BitRock, Erica managed multiple sales teams for T-Mobile... Read More →
avatar for Joe Brockmeier

Joe Brockmeier

Open Source Cloud Computing Evangelist, Citrix
Joe Brockmeier is a open source evangelist for Citrix, working on the Apache CloudStack (incubating) project. Joe has a long history of involvement with open source, and has also worked for Novell as the openSUSE community manager. He has also spent many years working as a technology... Read More →
avatar for David Nalley

David Nalley

Apache CloudStack
David Nalley is a committer and is currently on the PPMC for Apache CloudStack In addition he\’s a long time contributor to the Fedora Project, where among other things he is currently serving on the Fedora Project Board. He’s also contributed to in various forms to Cobbler... Read More →
avatar for Sebastian Stadil

Sebastian Stadil

Founder, Scalr, Scalr
Sebastian Stadil has been a Cloud developer since 2004, starting with web services for e-commerce and then for computational resources. He founded the Silicon Valley Cloud Computing Group, a user group of over 8000 members that meets monthly to present the latest developments in the... Read More →
avatar for Ross Turk

Ross Turk

VP of Community, InkTank
Ross Turk is Vice President of Community for Inktank, responsible for building a strategic relationship with users, contributors, and the open source community. Ross brings more than 15 years of experience creating software, managing complex IT systems, and helping companies understand... Read More →

Chiradeep Vittal

Principal Architect, Cloud Platforms Group, Citrix
Chiradeep Vittal is Principal Architect in the Cloud Platforms Group at Citrix Systems. He is a maintainer in the Apache CloudStack project where he contributes to networking and storage parts of the Infrastructureas- a-Service (IAAS) management system. He was a founding engineer... Read More →
avatar for John Wetherill

John Wetherill

Developer Evangelist, ActiveState
Originally from Canada, John has spent much of his career designing and building software at a handful of startups, at Sun Microsystems, NeXT Inc., and more recently in the smart grid and energy space. His biggest passion is for developer tools, or more generally any tool, language... Read More →

Kelven Yang

Software Architect, Cloud Platforms Group, Citrix

Monday October 1, 2012 9:00am - 5:00pm
Ballroom East


Presentation Mobility: The Freedom of the Cloud

"Presentation Mobility:  The Freedom of the Cloud"
Prezi Workshop will train users how to use Prezi in the cloud, on the desktop, and on the iPad.  I will also discuss how to use Prezi Meeting for collaborative presentation development.

2012 Keynote & Breakout Sessio...
avatar for Drew Banks

Drew Banks

Head of Marketing, Sales and Support, Prezi
Drew Banks is Head of Marketing, Sales, and Support @ Prezi, the zooming presentation software.  Backed by TED Conference and Accel Partners, Prezi has been widely adopted across education and business. In its 3+ years, Prezi has amassed over 14 million users and is growing at over... Read More →

Monday October 1, 2012 10:00am - 11:00am
Ballroom West


All Day Training Class on "Hadoop Ecosystem" (Separate Registration is Required)

One Day Intensive Hands-on Hadoop Training Class. Topics Covered Are:

Hadoop BI Developer Hadoop – Architecture, HDFS, EcoSystem & Map Reduce

Time: 10am – noon, 1pm – 4:30pm


Audience: Engineers, Programmers, Networking specialists, Managers, Executives


Software covered: HDFS, MapReduce, Pig, Hive, HBase


Labs: 5 labs, 20 mins each



  • Introduce students to the core concepts of Hadoop

  • Deep dive into the critical architecture paths of HDFS, MapReduce and HBase

  • Teach the basics of how to effectively write Pig and Hive scripts

  • Explain how to choose the correct use cases for Hadoop

  • Give each student access to an individual 1-node Hadoop cluster in Rackspace to run through some hands-on labs for the 5 software components: HDFS, MapReduce, Pig, Hive, HBase

  • Provide links to the best books, blog posts and videos for students to learn more about Hadoop on their own


Summary: This is a fast paced, vendor agnostic, technical overview of the Hadoop landscape. No prior knowledge of databases or programming is assumed. This survey course is targeted towards both technical and non-technical people who want to understand the emerging world of Big Data, with a specific focus on Hadoop. In each sub-topic, the instructor will provide links and resource recommendations for students who want to explore that area further (for example, YouTube videos, books, blog posts). Students will be given a ~100 page PDF slide deck which can be used as reference material after the course. PDFs will also be given out for the 5 short labs in the course.

Course structure:

 10am – 10:30am: Introduction to Big Data and Hadoop

 10:30am – 11:15am: HDFS Lecture

11:15am – 11:40am: HDFS lab

 11:40am – noon: MapReduce Introduction Lecture

 Noon – 1pm: Lunch

1pm – 1:20pm: MapReduce Advanced Lecture

1:20pm – 1:40pm: MapReduce Lab

1:40pm – 2pm: Pig Lecture

2pm – 2:20pm: Pig lab

2:20pm – 2:40pm: Hive Lecture

2:40pm – 3pm: Hive Lab

3pm – 3:40pm: HBase Lecture

3:40pm – 4pm – HBase Lab

4pm – 4:30pm: Next-gen Hadoop (2.0) Lecture

Session 1: Intro to Hadoop (10am to 10:30am)

  • Parallel Computer vs. Distributed Computing

  • Brief history of Hadoop

  • Scaling with Hadoop

  • Hadoop clusters at Yahoo! and Facebook

  • RDBMS/SQL vs. Hadoop

  • Hadoop Daemons introduction: NameNode, DataNode, JobTracker, TaskTracker

  • Intro to the Hadoop ecosystem: HDFS, MapReduce, Pig, Hive, HBase, ZooKeeper

  • Vendor Comparison (Cloudera vs. Hortonworks vs. Amazon EMR)

  • Hardware + Software recommendations for Hadoop


Session 2: HDFS (10:30am – 11:40am)

  • Linux File system options

  • Sample HDFS commands

  • HDFS sample architecture at Yahoo!

  • Data Locality

  • Rack Awareness

  • Write Pipeline

  • Read Pipeline

  • NameNode architecture (EditLog, FsImage, location of replicas, safe mode)

  • Secondary NameNode architecture

  • DataNode architecture

  • Heartbeats

  • Block Scanner

  • Fsck Health Check + file breakdown

  • Balancer

  • LAB #1: Exploring the HDFS cmd line


Session 3: MapReduce (11:40am to 1:40pm, minus one hour lunch)

  • MapReduce Architecture

  • JobTracker/TaskTracker

  • Combiner

  • Partitioner (shuffle)

  • Thinking in the MapReduce way (examples of Mappers & Reducers)

  • Counters

  • Hadoop Streaming (with python)

  • Hadoop Java example

  • Input/output formats

  • Speculative Execution

  • Distributed Cache

  • Job Scheduling (FIFO, Fair Scheduler, Capacity Scheduler)

  • LAB #2: Running MapReduce wordcount in Python & Java


Session 4: Pigs Eat Anything (1:40pm to 2:20pm)

  • Pig philosophy and architecture

  • Pig Latin and the Grunt shell

  • Loading data

  • Data types and schemas

  • Pig Latin details: structure, functions, expressions, relational operators

  • Intro to User Defined Functions and Scripts

  • LAB #3: Exploring Pig Latin commands


Session 5: Hive for Structured Data (2:20pm to 3pm)

  • Hive philosophy and architecture

  • Hive vs. RDBMS

  • HiveQL and Hive Shell

  • Managing tables

  • Data types and schemas

  • Querying data

  • LAB #4: Analyzing movie reviews with Hive


Session 6: Real-time I/O with HBase (3pm – 4pm)

  • HBase versions and origins

  • HBase architecture

  • HBase core concepts

  • HBase vs. RDBMS

  • HBase Master and Region Servers

  • Data Modeling

  • Column Families and Regions

  • HBase Internals: Bloom Filters and Block Indexes

  • Write Pipeline / Read Pipeline

  • Compactions

  • LAB #5: Intro to the HBase command line


Session 7: Next-gen Hadoop (4pm – 4:30pm)

  • HDFS improvements: HDFS Federation, NameNode HA, Snapshots

  • MapReduce improvements: YARN, Performance


Monday October 1, 2012 10:00am - 3:00pm
Redwood Room


Navigating the Transition from Relational Technology to NoSQL

NoSQL databases have emerged as a better match than relational systems for modern interactive applications, offering cost-effective data management at “Big Data” scale. But there are significant differences between structured and schema-less database technology. What should architects and technical managers know as they explore NoSQL solutions for their teams?
In this workshop you will learn:

  • How to evaluate NoSQL (both technical advantages and limitations) as a potential data management approach
  • Critical differences between NoSQL and RDBMS for designing, building and running production applications
  • Ideal use cases for NoSQL technology and sample reference architectures


2012 Keynote & Breakout Sessio...
avatar for Dipti Borkar

Dipti Borkar

Director of Product Management, COUCHBASE
Director of Product Management, Couchbase Dipti Borkar is director of product management at Couchbase where she is responsible for the company’s flagship product, Couchbase Server, and works with customers and users to understand emerging requirements for low-latency, scalable... Read More →

Monday October 1, 2012 11:00am - 12:00pm
Ballroom West


Open Source Cloud Hardware - 10x lower cost storage

You've got open source cloud software. What about open source cloud HARDWARE?
Storing petabytes of data in the cloud for your company can cost millions of dollars. Have you decided to keep the data on your infrastructure? Come learn about open source hardware options to save 10x on your storage costs.


2012 Keynote & Breakout Sessio...
avatar for Gleb Budman

Gleb Budman

Co-founder and CEO, Backblaze
Gleb Budman is co-founder and CEO of Backblaze, an online backup company that provides the only "backup all data" service with unlimited storage for just $5 per month. Previously, Gleb led product strategy, management, and marketing for venture-funded companies Kendara and MailFrontier from inception through acquisition by Exci... Read More →

Monday October 1, 2012 1:00pm - 1:45pm
Ballroom West


Tao of Data Science

Learn how to use your favorite programming language, like Ruby, to program complex batch and real-time data pipelines in your Big Data environment.

2012 Keynote & Breakout Sessio...
avatar for Flip Kromer

Flip Kromer

CTO, President, Co-Founder, Infochimps
CTO, President, Co-Founder Philip (Flip) Kromer is co-founder and CTO of Infochimps, a Big Data infrastructure provider. At Infochimps, Flip built a cloud architecture capable of spinning up clusters as large as 60 nodes with the push of a button, using devops to make this possible... Read More →

Monday October 1, 2012 1:45pm - 2:30pm
Ballroom West


Unified Eco-System for Effective Service offering and Revenue Maximization

Cloud Service Providers have to constantly drive two major business objectives: Service Excellence and Revenue Maximization. Neither can be compromised in today's competitive business landscape.

It is of paramount importance to set up a strong unified eco-system that enables the achieving of both goals. An ideal approach to accomplish them is to automate the billing process, which opens up newer revenue streams and ensures uninterrupted quality services to customers.


2012 Keynote & Breakout Sessio...
avatar for Satish Garikipati

Satish Garikipati

Director, Product Marketing (Cloud Billing)  Satish Garikipati is Director, Product Marketing (Cloud Billing) at Magnaquest, a market leader in providing subscription management solutions to global customers. Satish Garikipati has over 15 years experience in IT industry. He is a... Read More →
avatar for Rama Krishna Mashetty

Rama Krishna Mashetty

CMO, Magnaquest
Chief Marketing Officer, Magnaquest Rama krishna Mashetty, Chief Marketing Officer at Magnaquest, a market leader in providing subscription management solutions to global customers. Rama Krishna comes with a rich experience of over 18 years spanning across Product engineering, Product... Read More →

Monday October 1, 2012 3:00pm - 4:00pm
Ballroom West


Goodbye ETL, Hello Hadoop

Hadoop isn’t just big data, it can run on a single machine or a large cluster, it can process structured data and unstructured data or even plain byte arrays.  But it also removes the requirement that time consuming and expensive  data models be built prior to performing analytics.The presentation discusses why current agility in data analytics is critical and how the removal of the traditional ETL process significantly increases the success of analytic  projects.

2012 Keynote & Breakout Sessio...
avatar for Stefan Groschupf

Stefan Groschupf

CEO and Co-Founder, Datameer
CEO, Datameer Stefan Groschupf is the co-founder and CEO of Datameer and is well known for his entrepreneurial accomplishments in data management and large-scale distributed computing. Before Datameer, Stefan was the co-founder and CEO of Scale Unlimited, a leading provider of educational... Read More →

Monday October 1, 2012 4:00pm - 5:00pm
Ballroom West