Oozie is an open-source workflow / coordination service to manage data processing jobs for Apache Hadoop™. It is an extensible, scalable and data-aware service to orchestrate dependencies between jobs running on Hadoop (including HDFS, Pig and MapReduce).
In this talk, we will introduce oozie and share experience in Yahoo!