Loading…
This event has ended. Create your own event → Check it out
This event has ended. Create your own
View analytic

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Tuesday, October 14
 

9:30am

Check In + Breakfast
Check in to the event, get schedule, name badge, swag and grab breakfast.

Tuesday October 14, 2014 9:30am - 10:00am
Lunch Hall

10:00am

Keynote
Introduction to Cassandra Day Seattle 2014 by Christian Hasker

Speakers

Tuesday October 14, 2014 10:00am - 10:15am
Track A

10:15am

Getting Started with Apache Cassandra — From Zero to Hero
This is a crash course introduction to Cassandra. You'll step away understanding how it's possible to to utilize this distributed database to achieve high availability across multiple data centers, scale out as your needs grow, and not be woken up at 3am just because a server failed. We'll cover the basics of data modeling with CQL, and understand how that data is stored on disk. We'll wrap things up by setting up Cassandra locally, so bring your laptops!

Speakers
avatar for Jonathan Haddad

Jonathan Haddad

Evangelist for Apache Cassandra, DataStax
Jon has 15 years experience in both development and operations. For 10 years he’s worked at various startups in southern California. For 2 years he had been the maintainer of cqlengine, the Python object mapper for Cassandra, now integrated into the native Cassandra driver. He’s now a Technical Evangelist at Datastax, continuing to focus on advancing Cassandra in the Python, operations and data science communities. Jon holds a degree in... Read More →


Tuesday October 14, 2014 10:15am - 12:20pm
Track B

10:20am

So You Want To Use Cassandra?
This talk discusses things to consider when considering Cassandra through the purview of a Pearson’s team’s recent Cassandra adoption after coming from a .NET/SQL world. Topics covered include data model design, operationalization of a cluster, and other best practices along with what happens when they aren’t followed.

Speakers
avatar for Codey Whitt

Codey Whitt

Senior Software Engineer, Pearson eCollege
Codey Whitt is a Senior Software Engineer on the GRID Platform Services team with Pearson eCollege. Having helped design, deploy, and operationalize Cassandra clusters in the cloud for use with new Java applications, he experienced the trials and triumphs of implementing Cassandra in an enterprise environment. While being a valuable resource for his Cassandra knowledge, Codey also helped define best practices for cloud application management... Read More →


Tuesday October 14, 2014 10:20am - 11:00am
Track A

11:00am

AntiPattern Jeopardy
Don't put your Cassandra application in "jeopardy'! Come learn from real examples from the field on how NOT to do Cassandra. Prizes might be involved!

Speakers
avatar for Rachel Pedreschi

Rachel Pedreschi

Lead Evangelist, Datastax
Rachel is Lead Evangelist at DataStax. A "Big Data Geek-ette," Rachel is no stranger to the world of high performance databases and data warehouses. She is a Vertica, Informix and Redbrick certified DBA on top of her work with Cassandra and has more than 15 years of business intelligence and ETL tool experience. Rachel has an MBA from San Francisco State University and a BA in Mathematics from University of California, Santa Cruz. She loves... Read More →


Tuesday October 14, 2014 11:00am - 11:40am
Track A

11:30am

Getting Started with Cassandra and Python
We'll be doing a deep dive into working with Cassandra using Python. We'll cover a wide range of tools, starting with the native driver to cqlengine. We'll also explore alternatives to the cqlsh repl to quickly explore our databases.

Speakers
avatar for Jonathan Haddad

Jonathan Haddad

Evangelist for Apache Cassandra, DataStax
Jon has 15 years experience in both development and operations. For 10 years he’s worked at various startups in southern California. For 2 years he had been the maintainer of cqlengine, the Python object mapper for Cassandra, now integrated into the native Cassandra driver. He’s now a Technical Evangelist at Datastax, continuing to focus on advancing Cassandra in the Python, operations and data science communities. Jon holds a degree in... Read More →


Tuesday October 14, 2014 11:30am - 12:20pm
Track B

11:40am

Setting up a DataStax Enterprise Instance on Microsoft Azure.
There are many options for Cloud Providers, but according to the Gartner Magic Quadrant 2014 for IaaS Solutions, Amazon AWS and Microsoft Azure are both leaders and visionaries. DataStax provides instructions for provisioning an Amazon Machine Image. This discussion will provide guidance on setting up a single-node DataStax Enterprise cluster using an Ubuntu 14.04 Server and a Windows Azure Virtual Machine. Using the DataStax Enterprise production installation in text mode, we will install DSE end to end during the presentation.

Speakers
avatar for Joey Filichia

Joey Filichia

Joey Filichia has been writing code since 1982, turning a hobby into a career in 1997. Over the course of his career, Joey has worked as a business intelligence architect, software architect, systems architect, web architect, database architect, technical lead, technical manager, database administrator, database developer, CIO, CTO, Research and Development Lead, teacher and preacher. He has even officiated a wedding or two! Although... Read More →


Tuesday October 14, 2014 11:40am - 12:20pm
Track A

12:20pm

Lunch Break
Tuesday October 14, 2014 12:20pm - 1:00pm
Lunch Hall

1:00pm

Reading Cassandra SSTables Directly for Offline Data Analysis
Here at FullContact we have lots and lots of contact data. In particular we have more than a billion profiles over which we would like to perform ad hoc data analysis. Much of this data resides in Cassandra, and we have many analytics MapReduce jobs that require us to iterate across terabytes of Cassandra data. To solve this problem we've implemented our own splittable input format which allows us to quickly process large SSTables for downstream analytics.

Speakers
avatar for Ben Vanberg

Ben Vanberg

Software Engineer, FullContact
Professional Software Engineer since 1999, and | working on big data solutions for the past 5 years. Currently working | at FullContact where Cassandra is at the center of our ecosystem.


Tuesday October 14, 2014 1:00pm - 1:40pm
Track A

1:00pm

Getting Started with Cassandra and Python (continued)
We'll be doing a deep dive into working with Cassandra using Python. We'll cover a wide range of tools, starting with the native driver to cqlengine. We'll also explore alternatives to the cqlsh repl to quickly explore our databases.

Speakers
avatar for Jonathan Haddad

Jonathan Haddad

Evangelist for Apache Cassandra, DataStax
Jon has 15 years experience in both development and operations. For 10 years he’s worked at various startups in southern California. For 2 years he had been the maintainer of cqlengine, the Python object mapper for Cassandra, now integrated into the native Cassandra driver. He’s now a Technical Evangelist at Datastax, continuing to focus on advancing Cassandra in the Python, operations and data science communities. Jon holds a degree in... Read More →


Tuesday October 14, 2014 1:00pm - 1:40pm
Track B

1:40pm

Feelin' the Flow: Analyzing Data with Spark and Cassandra
In the world of Big Data it's crucial that your data is accessible. Cassandra provides us with a means to reliably store our data, but how can we keep it flowing? That's where Spark steps up to provide a powerful one-two punch with Cassandra to get your data flowing in all the right directions.

Speakers
avatar for Rich Beaudoin

Rich Beaudoin

Senior Software Engineer, Pearson eCollege
Rich Beaudoin is a Sr. Software Engineer at Pearson working on the Data Channels team. When he's not presenting at a local technology Meetup, he is listening to Bob Dylan while hacking on the JVM, Cassandra, and Spark.


Tuesday October 14, 2014 1:40pm - 2:20pm
Track A

1:40pm

Data Modeling for Cassandra 101
This brief course will introduce you to the concepts that are important to creating a data model that is appropriate for your Cassandra database. You will learn how data is physically stored and how that translates to creating the appropriate logical and physical layers to make the most of your system.  

Speakers
avatar for Rachel Pedreschi

Rachel Pedreschi

Lead Evangelist, Datastax
Rachel is Lead Evangelist at DataStax. A "Big Data Geek-ette," Rachel is no stranger to the world of high performance databases and data warehouses. She is a Vertica, Informix and Redbrick certified DBA on top of her work with Cassandra and has more than 15 years of business intelligence and ETL tool experience. Rachel has an MBA from San Francisco State University and a BA in Mathematics from University of California, Santa Cruz. She loves... Read More →


Tuesday October 14, 2014 1:40pm - 3:55pm
Track B

2:20pm

A Cassandra Data Model for Serving up Cat Videos
Keyboard Cat, Nyan Cat, and of course the world famous Grumpy Cat--it seems like the Internet can’t get enough cat videos. If you were building an application to let users share and consume their fill of videos, how would you go about it? In this talk, we’ll take a look at the data model for KillrVideo, a sample video sharing application similar to YouTube where users can share videos, comment, rate them, and more. You’ll learn get a practical introduction to Cassandra data modelling, querying with CQL, how the application drives the data model, and how to shift your thinking from the relational world you probably have experience with.

Speakers
avatar for Luke Tillman

Luke Tillman

Language Evangelist for Apache Cassandra, DataStax
Luke is a Language Evangelist for Apache Cassandra at DataStax, with a focus on the .NET community but a passion for polyglot. He’s spent over 15 years as a software engineer, writing code for web applications both large and small. Prior to DataStax, he worked as a software engineer at Hobsons on systems used by hundreds of colleges and universities across the U.S. and the World.


Tuesday October 14, 2014 2:20pm - 3:00pm
Track A

3:00pm

Transitioning to Cassandra for an Already Giant Product
Data modeling, cluster sizing, and planning can be difficult when transitioning an existing product to Cassandra. Especially when the new Cassandra deployment needs to handle millions of operations per second on day one! In this talk I'll discuss our strategy for data modeling, cluster sizing, and our novel approach to data replication across data centers.

Speakers
avatar for Andrew Kuttig

Andrew Kuttig

Director of Software Engineering, SpotXchange, Inc.
Andrew has been the lead developer in charge of data processing at SpotXchange for the last 7 years. In that time SpotXchange has grown from tens of thousands of ads served per day to hundreds of millions. Today Andrew's team is responsible for processing the data from billions of events per day.


Tuesday October 14, 2014 3:00pm - 3:40pm
Track A

3:40pm

Break
Tuesday October 14, 2014 3:40pm - 3:55pm
Lunch Hall

3:55pm

Using Cassandra to Support Crisis Informatics Research
Crisis Informatics is an area of research that investigates how members of the public make use of social media during times of crisis. The amount of social media data generated by a single event is significant: millions of tweets and status updates accompanied by gigabytes of photos and video. To investigate the types of digital behaviors that occur around these events requires a significant investment in designing, developing, and deploying large-scale software infrastructure for both data collection and analysis. Project EPIC at the University of Colorado has been making use of Cassandra since Spring 2012 to provide a solid foundation for Project EPIC's data collection and analysis activities. Project EPIC has collected terabytes of social media data associated with hundreds of disaster events that must be stored, processed, analyzed, and visualized. This talk will cover how Project EPIC makes use of Cassandra and discuss some of the architectural, modeling, and analysis challenges encountered while developing the Project EPIC software infrastructure.

Speakers
avatar for Ken Anderson

Ken Anderson

Associate Professor, Department of Computer Science, University of Colorado Boulder
Ken Anderson is an Associate Professor of the Department of Computer Science at the University of Colorado Boulder. His research interests include software architecture and the design of scalable, reliable, and robust software infrastructure. Prof. Anderson is Co-Director of Project EPIC and CU’s Center for Software and Society. He recently led the effort to create a BA in Computer Science degree program at CU Boulder, significantly... Read More →


Tuesday October 14, 2014 3:55pm - 4:35pm
Track A

3:55pm

Building Java Applications With Cassandra
So you’re a JVM developer, you understand Cassandra’s architecture, and you’re on your way to knowing its data model well enough to build descriptive data models that perform well. What you need now is to know the Java Driver.

What seems like an inconsequential library that proxies your application’s queries to your Cassandra cluster is actually a sophisticated piece of code that solves a lot of problems for you that early Cassandra developers had to code by hand. Come to this session to see features you might be missing and examples of how to use the Java driver in real applications.

Speakers
avatar for Tim Berglund

Tim Berglund

Global Director of Training, DataStax
| Tim is a teacher, author, and technology leader with DataStax. He is a conference speaker internationally and in the United States, and contributes to the Denver, Colorado tech community as president of the Denver Open Source User Group. He is the co-presenter of various O’Reilly training videos on topics ranging from Git to Mac OS X Productivity Tips to Apache Cassandra, and is the author of Gradle Beyond the Basics. He blogs very... Read More →


Tuesday October 14, 2014 3:55pm - 4:35pm
Track B

4:35pm

Q&A Panel Discussion
Speakers
avatar for Tim Berglund

Tim Berglund

Global Director of Training, DataStax
| Tim is a teacher, author, and technology leader with DataStax. He is a conference speaker internationally and in the United States, and contributes to the Denver, Colorado tech community as president of the Denver Open Source User Group. He is the co-presenter of various O’Reilly training videos on topics ranging from Git to Mac OS X Productivity Tips to Apache Cassandra, and is the author of Gradle Beyond the Basics. He blogs very... Read More →
avatar for Jonathan Haddad

Jonathan Haddad

Evangelist for Apache Cassandra, DataStax
Jon has 15 years experience in both development and operations. For 10 years he’s worked at various startups in southern California. For 2 years he had been the maintainer of cqlengine, the Python object mapper for Cassandra, now integrated into the native Cassandra driver. He’s now a Technical Evangelist at Datastax, continuing to focus on advancing Cassandra in the Python, operations and data science communities. Jon holds a degree in... Read More →
avatar for Rachel Pedreschi

Rachel Pedreschi

Lead Evangelist, Datastax
Rachel is Lead Evangelist at DataStax. A "Big Data Geek-ette," Rachel is no stranger to the world of high performance databases and data warehouses. She is a Vertica, Informix and Redbrick certified DBA on top of her work with Cassandra and has more than 15 years of business intelligence and ETL tool experience. Rachel has an MBA from San Francisco State University and a BA in Mathematics from University of California, Santa Cruz. She loves... Read More →
avatar for Luke Tillman

Luke Tillman

Language Evangelist for Apache Cassandra, DataStax
Luke is a Language Evangelist for Apache Cassandra at DataStax, with a focus on the .NET community but a passion for polyglot. He’s spent over 15 years as a software engineer, writing code for web applications both large and small. Prior to DataStax, he worked as a software engineer at Hobsons on systems used by hundreds of colleges and universities across the U.S. and the World.


Tuesday October 14, 2014 4:35pm - 5:00pm
Track A