Loading…

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

NoSQL [clear filter]
Monday, September 28
 

10:30

HBase: State of the Database - Nick Dimiduk, Hortonworks
HBase is a mature, low-latency, distributed "big data" store. It is used in production by companies large and small, in all manner of industries. There is a vibrant and active developer and user community supporting HBase, which means it's constantly improving, adapting to user need and challenging deployments. In this talk, Nick provides an update of the latest happenings in core HBase, recent and pending releases.

Speakers
avatar for Nick Dimiduk

Nick Dimiduk

Hortonworks
Nick Dimiduk is a committer and PMC member on both Apache HBase and Apache Phoenix. He's Release Manager for the HBase 1.1 branch and an author of the book HBase in Action, on Manning Press. Nick has also contributed to a number of Apache projects around HBase, including, HTrace... Read More →



Monday September 28, 2015 10:30 - 11:20
Huba

14:00

Spark/Cassandra Integration, Theory and Practice - DuyHai DOAN, Datastax
Apache Spark is a general data processing framework which allows you perform map-reduce tasks (but not only) in memory. Apache Cassandra is a highly available and massively scalable NoSQL data-store.

By combining Spark flexible API and Cassandra performance, we get an interesting alternative to the Hadoop eco-system for both real-time and batch processing.

During this talk we will highlight the tight integration between Spark & Cassandra and demonstrate some usages with live code demo using Apache Zeppelin.

Speakers
avatar for DuyHai Doan

DuyHai Doan

Technical Advocate, Datastax
DuyHai DOAN is an Apache Cassandra Evangelist at DataStax and committer for Apache Zeppelin. He spends his time between technical presentations/meetups on Cassandra, coding on open source projects like Achilles or Apache Zeppelin to support the community and helping all companies... Read More →


Monday September 28, 2015 14:00 - 14:50
Huba

15:00

A Tale of Two Graphs: Property Graphs and RDF - Andy Seaborne and Paolo Castagna, Cloudera
Property Graphs and the Resource Description Framework (RDF) are both graph data models. Property Graphs originated with data practitioners while RDF was developed at W3C as an information model for the web. Both graph data models are "schema-neutral", there is no rigid organization of data up-front. Applications decide which part of the data graph to use and how to view it. New data and new applications can be introduced at any time without disturbing existing usage.

In this talk we will introduce both data models and look at some uses to show where they are (and are not) used. We will look at contrasting features of the data models by looking at use cases.

We will then look at two systems, Apache Spark/GraphX, for property graphs, and Apache Jena, for RDF databases, and how they deal with graph structure and how they can scale to big data.

Speakers
avatar for Paolo Castagna

Paolo Castagna

Systems Engineer, Cloudera
Paolo works as a Systems Engineer at Cloudera (EMEA). Before joining Cloudera he worked at HP Labs. Paolo is a PMC member and committer of the Apache Jena project (http://jena.apache.org/), 'addicted' to data (mostly RDF or graph shaped datasets).
avatar for Andy Seaborne

Andy Seaborne

Andy works on infrastructure for linked data graph systems. He was lead editor for SRARQL, the RDF Query Language. Andy developed the ARQ query engine which is released as part of Apache Jena where he is a committer. He has spoken at conferences and at developer events about linked... Read More →


Monday September 28, 2015 15:00 - 15:50
Huba

16:00

S2Graph : A Large-Scale Graph Database with HBase - Doyung Yoon, Daumkakao
As dominant social network service provider, Daumkakao confronted several technical challenges to store and traverse large graph data.
First, our social network has 10 billion edges and 200 million vertices and users create 1 billion new edges everyday by interacting with our services so our system needed to be distributed and scalable.
Second, our system needed to provide low-latency and high concurrency to meet our quality of service.
Third, for viral effect, user`s activities should be delivered to right place at right time in real time so simple result cache can`t be used.
Lastly, there are about a hundred of services that daumkakao is operating and our system needed to provide common way to store and traverse data for synergy between services.
S2Graph successfully solved these technical challenges, so we'd like to introduce the methodology and architecture we used.

Speakers
avatar for Daewon Jeong

Daewon Jeong

Programmer, kakao
Works on S2Graph team
avatar for Doyung Yoon

Doyung Yoon

Software Engineer, Kakao
Doyung works in a distributed graph database team at Kakao as software engineer, where his focus is on performance and usability. He developed Apache S2Graph, an open-source distributed graph database, and has previously presented it at ApacheCon BigData Europe and ApacheCon BigData... Read More →



Monday September 28, 2015 16:00 - 16:50
Huba
 
Tuesday, September 29
 

14:00

CouchDB 2.0: The Awkward Bits - Mike Wallace, IBM
Apache CouchDB 2.0 will bring Dynamo-style clustering to CouchDB, allowing data and applications to scale out over hundreds of nodes for increased throughput and storage. While operation of a single-node CouchDB instance will remain largely unaffected, the addition of clustering and sharding introduces some new annoyances which have the potential to complicate operational life. In this talk, Mike will identify the parts of clustered CouchDB 2.0 that could be considered to be particularly awkward from an operations perspective so that we can be better prepared when things start to get real.

Speakers
MW

Mike Wallace

Software Engineer and Systems Operator, IBM
Mike Wallace is a software engineer and systems operator with a particular interest in distributed systems and the many and varied ways they can fail. He has been an engineer at IBM (formerly Cloudant) for the last two years as both a developer and operator of their globally distributed... Read More →


Tuesday September 29, 2015 14:00 - 14:50
Krudy/Jokai

15:00

Decentralized Document Delivery - Benjamin Young, The Hypothesis Project
Apache CouchDB is a document-centric database. It also replicates. It can make exact copies of a database and keep them in sync--even as the network comes and goes. Both (or more) databases can be actively written too and changes synchronized across them. There is no center.

Add PouchDB, a CouchDB compatible database that lives inside the browser or node.js, and you have an architecture ready to survive the fickleness of technology, businesses, and other regimes. The end result is data where you need it.

We'll take a look at architecting for a decentralized future built of documents delivered democratically across all the divides.

Speakers
avatar for Benjamin Young

Benjamin Young

Strategic Architect, John Wiley & Sons
Benjamin Young is a Strategic Architect at John Wiley & Sons, Inc. His work for Wiley includes collaborating to build out internal policy and standards for open source, APIs, and privacy enhancing technology. Benjamin also works extensively with standards organizations such as the... Read More →



Tuesday September 29, 2015 15:00 - 15:50
Krudy/Jokai

16:00

Hands-On with Apache CouchDB 2.0 - Mike Wallace, IBM; Michelle Phung, IBM; Glynn Bird, IBM
This is a hands-on introduction to Apache CouchDB. We'll tour the user-facing API and Fauxton dashboard while looking at the concepts behind it. You'll learn what it means to build an application on top of CouchDB and make it shine within an hour's time.

You will learn basic data storage and retrieval, data-design, querying, replication and various neat features on the edges of CouchDB. If you are coming from the relational world, this talk will help you understand how to "think in CouchDB".

Speakers
GB

Glynn Bird

IBM
Before joining IBM Cloud Data Services, Glynn served as the Head of IT and Development for Central Index, creating a white-label frontend for a NoSQL business directory (using PHP, Node.js, MySQL, Redis, Cloudant, and Redshift). His experience includes writing CRM systems, "find my... Read More →
avatar for Michelle Phung

Michelle Phung

Software Engineer, IBM, Cloudant
Apache CouchDB Contributes to Admin Console (Project Fauxton) I'm from SF!
MW

Mike Wallace

Software Engineer and Systems Operator, IBM
Mike Wallace is a software engineer and systems operator with a particular interest in distributed systems and the many and varied ways they can fail. He has been an engineer at IBM (formerly Cloudant) for the last two years as both a developer and operator of their globally distributed... Read More →


Tuesday September 29, 2015 16:00 - 16:50
Krudy/Jokai
 
Wednesday, September 30
 

10:00

Upholstering Apache CouchDB - Benjamin Young, The Hypothesis Project
Apache CouchDB does two things other databases don't: it replicates and it speaks HTTP as it's primary protocol. These unique qualities all you to build applications that are "of the Web" but that can also move "off the Web"--into your local network, your farm equipment, or the phone you're carrying.

In this tutorial we'll take a look at CouchApps. CouchApps are application logic (index definitions, document and result templates, validation functions) that live inside a CouchDB database along with the static HTML, JS, CSS, and images needed for the UI.

In this tutorial we'll take a look at various tools for building, integrating, and deploying CouchApps. We'll take a deep dive into building a CouchApp: both the thought process and the code. Near the end, we'll throw a replication party--moving the app between attendee devices sharing the app and it's accumulated data.

Speakers
avatar for Benjamin Young

Benjamin Young

Strategic Architect, John Wiley & Sons
Benjamin Young is a Strategic Architect at John Wiley & Sons, Inc. His work for Wiley includes collaborating to build out internal policy and standards for open source, APIs, and privacy enhancing technology. Benjamin also works extensively with standards organizations such as the... Read More →


Wednesday September 30, 2015 10:00 - 10:50
Huba

11:00

HBase and Drill: How Loosely Typed SQL is Ideal for NoSQL - Tugdual Grall, MapR Technologies
The Apache HBase approach to data has a huge potential for expressing NoSQL-y, non-relational programs. Apache Drill supports SQL for non-relational data. Paradoxically, combining this NoSQL with this SQL tool results in something even better.

Using concrete examples such as Time Series and Music Database applications, I will show how and why you should combine HBase and Drill to create highly scalable and available applications exposing NoSQL data to any SQL compliant tool.

Speakers
avatar for Tugdual Grall

Tugdual Grall

Technical Evangelist, MapR
Tugdual Grall Bio: Tugdual Grall, est Chief Technical Evangelist EMEA chez MapR. Il travaille avec les clients et les communautés de développeurs européennes, pour faciliter l’adoption de MapR, Hadoop et NoSQL. Avant de travailler chez MapR, “Tug”, était Technical Evangelist... Read More →


Wednesday September 30, 2015 11:00 - 11:50
Huba