Apache Oozie Essentials
上QQ阅读APP看书,第一时间看更新

Oozie concepts

Before we move further, let's look at a few basic concepts of Oozie. In each chapter, we will take some time to learn some new concepts of Oozie besides looking at working examples.

Workflows

Workflow tells Oozie what to do. They are the DAG (a collection of actions arranged in required dependency graph. As a part of Workflow's definition, we write some actions and call them in a certain order.

These are of various types for tasks that we can do as a part of the Workflow, for example, Fs (Hadoop filesystem) action, Pig action, Hive action, MapReduce action, Spark action, and so on. We will discuss Fs action in this chapter.

Coordinator

Coordinator tells Oozie when to do a task, for example, when is the component in Oozie world decided by time or when is the given input data set available. We will discuss the Coordinators later in this book.

Bundles

Bundles tell Oozie what all things to do together as a group, for example, a set of Coordinators that can be run together to satisfy a given business requirement can be combined as Bundle.