Loading…

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Wildcard [clear filter]
Monday, September 28
 

14:00

Overcoming the Many-to-Many Data Mapping Mess With Apache Streams - Steve Blackmon, People Pattern
These days we have the tools and resources to collect and wrangle data at unprecedented scale, yet we remain plagued by compatibility gaps and semantic nuances with every new source we invite into our domain. Despite the best efforts of well meaning folks for decades, data integration remains a many-to-many problem.

Apache Streams (incubating) is an open-source real-time reference implementation for the Activity Streams specification. Streams contains libraries and patterns for specifying, publishing, and inter-linking schemas, and assists with conversion of activities and objects between the representation, format, and encoding preferred by supported data providers, processors, and indexes.

In this talk I will explain what Streams does, how it works (more or less), and how it can be used to compile a real-time, multi-network, polyglot content repository of profiles, posts, etc.

Speakers
avatar for Steve Blackmon

Steve Blackmon

VP Technology, People Pattern, Inc.
VP Technology at People Pattern, previously Director of Data Science at W2O Group, co-founder of Ravel, stints at Boeing, Lockheed Martin, and Accenture. Committer and PMC for Apache Streams (incubating). Experienced user of Spark, Storm, Hadoop, Pig, Hive, Nutch, Cassandra, Tinkerpop... Read More →


Monday September 28, 2015 14:00 - 14:50
Tas

15:00

Unified Access to All Your Data Points With Apache MetaModel - Kasper Sørense, Human Inference
The wave of Big Data has overwhelming potential, but has also revealed an overwhelming challenge in the need to combine multiple sources. The representation of data is growing immensely just like the amount of data is – you might very well be ingesting as many sources as this: Relational, NoSQL, Hadoop, XML/JSON/CSV files, Cloud/SaaS systems and search indexes. In this presentation, Kasper Sørensen will introduce Apache MetaModel. With this project metadata has been put first and querying is based on this concept too. Apache MetaModel allows for a uniformed view of data from many sources, but just as important it also enables Data Federation and Data Integration patterns that are automatically adapting based on the metadata available in the source. The talk will be practically oriented, showing running code with MetaModel and examples of production usage in multiple business cases.

Speakers
avatar for Kasper Sørensen

Kasper Sørensen

Principal Tech Lead, Human Inference / Neopost
Kasper Sørensen is PMC of Apache MetaModel and Principal Tech Lead of Human Inference, a Neopost company. Having founded several open source projects, including Apache MetaModel and DataCleaner, he is passionate about building and sharing products for the Data Quality, Big Data and... Read More →


Monday September 28, 2015 15:00 - 15:50
Tas