Back To Schedule
Monday, September 28 • 14:00 - 14:50
Overcoming the Many-to-Many Data Mapping Mess With Apache Streams - Steve Blackmon, People Pattern

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

These days we have the tools and resources to collect and wrangle data at unprecedented scale, yet we remain plagued by compatibility gaps and semantic nuances with every new source we invite into our domain. Despite the best efforts of well meaning folks for decades, data integration remains a many-to-many problem.

Apache Streams (incubating) is an open-source real-time reference implementation for the Activity Streams specification. Streams contains libraries and patterns for specifying, publishing, and inter-linking schemas, and assists with conversion of activities and objects between the representation, format, and encoding preferred by supported data providers, processors, and indexes.

In this talk I will explain what Streams does, how it works (more or less), and how it can be used to compile a real-time, multi-network, polyglot content repository of profiles, posts, etc.

avatar for Steve Blackmon

Steve Blackmon

VP Technology, People Pattern, Inc.
VP Technology at People Pattern, previously Director of Data Science at W2O Group, co-founder of Ravel, stints at Boeing, Lockheed Martin, and Accenture. Committer and PMC for Apache Streams (incubating). Experienced user of Spark, Storm, Hadoop, Pig, Hive, Nutch, Cassandra, Tinkerpop... Read More →

Monday September 28, 2015 14:00 - 14:50 CEST

Attendees (0)