Back To Schedule
Monday, September 28 • 11:30 - 12:20
Apache Tez - Helping You Build Your Hadoop Big Data Engines - Bikas Saha, Hortonworks

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

YARN has opened up Hadoop to a variety of high performance purpose-built applications specialized for specific domains. Many of these need a common set of capabilities like scheduling, fault tolerance & scalability while not giving up on important aspects like multi-tenancy & security. We will provide an overview of how Apache Tez provides these capabilities via a dataflow based API to model these applications and an extensible orchestration framework for optimal performance. We will cover broad ecosystem adoption by Apache Hive, Pig, Cascading, Scalding, Flink & commercial vendors and provide some experiment results. We will look at the Tez Web UI for progress monitoring and performance debugging tools. Finally, we will look ahead at upcoming Tez features like hybrid execution which enables new types of integration with existing systems.


Bikas Saha

Bikas is an active Apache community member and has contributed to the Apache Hadoop and Tez projects and focuses mainly on the distributed compute stack on Hadoop. He works for Hortonworks, a company that supports an open source based Apache Hadoop distribution. Bikas has spoken widely... Read More →

Monday September 28, 2015 11:30 - 12:20 CEST

Attendees (0)