(Created page with "== Apache Oozie == Apache Oozie is a workflow manager, designed especially for running Hadoop MapReduce jobs It contains 2 parts: * workflow engine: runs workfl...") |
|||
Line 28: | Line 28: | ||
The Oozie workflow is written in [[XML]] using Hadoop Process Definition language | The Oozie workflow is written in [[XML]] using Hadoop Process Definition language | ||
+ | |||
== Editors == | == Editors == | ||
Line 33: | Line 34: | ||
* see http://gethue.com/new-apache-oozie-workflow-coordinator-bundle-editors/ | * see http://gethue.com/new-apache-oozie-workflow-coordinator-bundle-editors/ | ||
+ | |||
+ | == Sources == | ||
+ | * [[Hadoop: The Definitive Guide (book)]] | ||
[[Category:Hadoop]] | [[Category:Hadoop]] | ||
[[Category:Workflow Management]] | [[Category:Workflow Management]] | ||
[[Category:ETL]] | [[Category:ETL]] |
Apache Oozie is a workflow manager, designed especially for running Hadoop MapReduce jobs
It contains 2 parts:
It's a service:
JobControl
, it doesn't submit the tasks itself
A workflow is a DAG of action nodes and control-flow nodes
Action Nodes
Control Flow Nodes
The Oozie workflow is written in XML using Hadoop Process Definition language
Hue has Oozie Workflow editor - so it is possible to design workflows manually