GOTO is a vendor independent international software development conference with more that 90 top speaker and 1300 attendees. The conference cover topics such as .Net, Java, Open Source, Agile, Architecture and Design, Web, Cloud, New Languages and Processes

Jean-Daniel Cryans, Software Engineer at Cloudera

Jean-Daniel Cryans

Biography: Jean-Daniel Cryans

Jean-Daniel Cryans works as a software engineer at Cloudera on the Storage team where he spends his days making Apache HBase better. Previous to that, he worked at StumbleUpon where he also worked on HBase while maintaining its production deployment. Jean-Daniel enjoys teaching HBase to new comers and old timers alike in the open source community or by giving presentations at Big Data and Apache Hadoop-related conferences and meetups. He became a committer and PMC member on the project in 2008 when he was still an undergrad student at ETS Montreal. Jean-Daniel now lives in San Francisco with his wife.

Twitter: @jdcryans

Presentation: Apache HBase, the scaling machine

Track: Big Data NoSQL Search / Time: Tuesday 13:20 - 14:10 / Location: Glazen Zaal

Apache HBase is a database that is designed to scale up to thousands of machines and manage billions of rows. It is successfully being used by web companies, banks, medical companies, only to name a few.

The first part of this presentation will be about HBase’s basics. We will start with the data model, which is quite different from the more common relational model, but very close to Google’s Bigtable with tables and families of columns. Then we will investigate the architecture and see what makes it so scalable at both the file system layer with Hadoop and at the database level with HBase.

The second part of the talk will be about use cases. The presenter will showcase the work done at his previous company where he was managing multiple HBase clusters of over a hundred nodes that were used for live web serving and batch processing. The challenges faced included not only doubling the hardware to fit the growing data set but also to accommodate a 5x increase in queries. Use cases from other companies that the presenter interacted with will also be presented in the interest of exploring different workloads and environments.