Back To Schedule
Wednesday, September 30 • 14:30 - 15:20
Integrating Fully-Managed Data Streaming Services with Apache Samza - Renato Marroquinm ETH Zurich

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Recently, the interest in highly scalable stream processing engines has risen, thus many projects have appeared. Apache Samza is a distributed stream-processing framework that uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, and resource management. It is one of the most popular stream processing engines out there used by many high-profile companies. On the other hand, we have Amazon Kinesis that is a fully managed service for real-time processing of streaming data which allows users to scale the amount of data ingested by Kinesis without worrying about the infrastructure details. This presentation gives a brief introduction about the very popular Samza-Kafka integration, then focuses on the new Samza-Kinesis integration, and explains users the new opportunities they have due to the new Samza-Kinesis integration.

avatar for Renato Marroquin

Renato Marroquin

PhD student, ETH Zurich
PhD Student at ETHZ Zurich working with distributed databases. Computer Science Master by the Pontifical University of Rio de Janeiro worked with Apache Pig. Google Summer of Code participant, Apache Gora PMC Member and Committer, Open Source and Big Data Enthusiast. Renato has spoken... Read More →

Wednesday September 30, 2015 14:30 - 15:20 CEST

Attendees (0)