Skip to content

Commit 0d73d42

Browse files
author
mafju
committed
Improving style in the INSTALL file
1 parent a30efe8 commit 0d73d42

File tree

1 file changed

+28
-25
lines changed

1 file changed

+28
-25
lines changed

INSTALL.md

+28-25
Original file line numberDiff line numberDiff line change
@@ -1,26 +1,26 @@
1-
In order to run the application you have to have Cloudera's Hadoop installed. The steps of installation procedure are described below.
1+
In order to run the application, you need to have Cloudera's Hadoop installed. The steps of the installation procedure are given below.
22

33
Hadoop: installation and configuration
44
======================================
5-
Warning: Because of a bug in the Oozie version provided with Cloudera's Hadoop (which is removed in the version available in the source code repository), **you have to have (Oracle) Java JDK 1.6 installed**. This version of Oozie does not work with JDK 1.7.
5+
IMPORTANT: Because of a bug in the Oozie version provided with Cloudera's Hadoop (by the way: this bug is removed in the version of Oozie available in the source code repository), **you need to have Oracle Java JDK 1.6 installed**. Oozie **does not** work with JDK 1.7.
66

77
---
88

9-
Install Cloudera Hadoop CDH4 with MRv1 in accordance with the instructions given in [Cloudera CDH4 intallation guide](https://ccp.cloudera.com/display/CDH4DOC/CDH4+Installation+Guide) , to be more specific:
9+
The instructions below show how to install Cloudera Hadoop CDH4 with MRv1 in accordance with the instructions given in [Cloudera CDH4 intallation guide](https://ccp.cloudera.com/display/CDH4DOC/CDH4+Installation+Guide).
1010

11-
Hadoop can be run in one of three modes:
11+
It is important to know that Hadoop can be run in one of three modes:
1212

1313
- **standalone mode** - runs all of the Hadoop processes in a single JVM which makes it easy to debug the application.
14-
- **pseudo-distributed mode** - runs a full-fledged Hadoop on your local computer
15-
- **distributed mode** - runs on a cluster consisting of many nodes/hosts
14+
- **pseudo-distributed mode** - runs a full-fledged Hadoop on your local computer.
15+
- **distributed mode** - runs the application on a cluster consisting of many nodes/hosts.
1616

17-
Below we will show how to install Hadoop initially in pseudo-distributed mode but we will be able to switch between standalone and pseudo-distributed modes.
17+
Below we will show how to install Hadoop initially in the pseudo-distributed mode but with a possibility to switch between the standalone and the pseudo-distributed mode.
1818

19-
Install Hadoop in pseudo-distributed mode, see [Cloudera CDH4 pseudo distributed mode installation guide](https://ccp.cloudera.com/display/CDH4DOC/Installing+CDH4+on+a+Single+Linux+Node+in+Pseudo-distributed+Mode)
19+
Hadoop: installation
20+
--------------------
21+
Installing Hadoop in pseudo-distributed mode (based on [Cloudera CDH4 pseudo distributed mode installation guide](https://ccp.cloudera.com/display/CDH4DOC/Installing+CDH4+on+a+Single+Linux+Node+in+Pseudo-distributed+Mode)) in case of 64-bit Ubuntu 12.04:
2022

21-
In case of Ubuntu 12.04:
22-
23-
- create new file `/etc/apt/sources.list.d/cloudera.list` with contents:
23+
- create a new file `/etc/apt/sources.list.d/cloudera.list` with contents:
2424

2525
deb http://archive.cloudera.com/cdh4/ubuntu/precise/amd64/cdh precise-cdh4 contrib
2626
deb-src http://archive.cloudera.com/cdh4/ubuntu/precise/amd64/cdh precise-cdh4 contrib
@@ -37,24 +37,27 @@ In case of Ubuntu 12.04:
3737

3838
sudo apt-get install hadoop-0.20-conf-pseudo
3939
40-
- next, follow the steps described in the Cloudera's guide for installing Hadoop in pseudo-distributed mode starting from the step "Step 1: Format the NameNode." This is available at [Cloudera CDH4 pseudo distributed mode installation guide - "Step 1: Format the Namenode"](https://ccp.cloudera.com/display/CDH4DOC/Installing+CDH4+on+a+Single+Linux+Node+in+Pseudo-distributed+Mode#InstallingCDH4onaSingleLinuxNodeinPseudo-distributedMode-Step1%3AFormattheNameNode.).
40+
- next, follow the steps described in the Cloudera's guide to installing Hadoop in the pseudo-distributed mode starting from the step "Step 1: Format the NameNode." This is available at [Cloudera CDH4 pseudo distributed mode installation guide - "Step 1: Format the Namenode"](https://ccp.cloudera.com/display/CDH4DOC/Installing+CDH4+on+a+Single+Linux+Node+in+Pseudo-distributed+Mode#InstallingCDH4onaSingleLinuxNodeinPseudo-distributedMode-Step1%3AFormattheNameNode.).
4141

42-
---
42+
Hadoop: after install
43+
---------------------
4344

44-
You can **switch between standalone and pseudo-distributed configurations** (or others) of Hadoop using the `update-alternatives` command, e.g.:
45+
### Switching between Hadoop modes
46+
When you have Hadoop installed, you can **switch between standalone and pseudo-distributed configurations** (or other kinds of configurations) of Hadoop using the `update-alternatives` command, e.g.:
4547

46-
- `update-alternatives --display hadoop-conf` for list of available configurations and information which is the active one
48+
- `update-alternatives --display hadoop-conf` for list of available configurations and information which one is currently active
4749
- `sudo update-alternatives --set hadoop-conf /etc/hadoop/conf.empty` to set the active configuration to `/etc/hadoop/conf.empty` which corresponds to Hadoop standalone mode.
4850

49-
You can view the web interfaces to the following services using the addresses:
51+
### Web interfaces
52+
You can view the web interfaces to the following services using appropriate addresses:
5053

5154
- **NameNode** - provides a web console for viewing HDFS, number of Data Nodes, and logs - [http://localhost:50070/](http://localhost:50070/)
5255
- In the pseudo-distributed configuration, you should see one live DataNode named "localhost".
53-
- **JobTracker** - allows viewing and running completed and failed jobs with logs - [http://localhost:50030/](http://localhost:50030/)
56+
- **JobTracker** - allows viewing the completed, currently running, and failed jobs along with their logs - [http://localhost:50030/](http://localhost:50030/)
5457

5558
Oozie: installation and configuration
5659
-------------------------------------
57-
Based on [Cloudera CDH4 Oozie installation guide](https://ccp.cloudera.com/display/CDH4DOC/Oozie+Installation#OozieInstallation-ConfiguringOozieinstall)
60+
The description below is based on [Cloudera CDH4 Oozie installation guide](https://ccp.cloudera.com/display/CDH4DOC/Oozie+Installation#OozieInstallation-ConfiguringOozieinstall).
5861

5962
- Install Oozie with
6063

@@ -65,7 +68,7 @@ Based on [Cloudera CDH4 Oozie installation guide](https://ccp.cloudera.com/displ
6568
sudo -u oozie /usr/lib/oozie/bin/ooziedb.sh create -run
6669

6770

68-
- this should result in a similar output:
71+
- this should result an output similar to this one:
6972

7073
Validate DB Connection
7174
DONE
@@ -82,7 +85,7 @@ Based on [Cloudera CDH4 Oozie installation guide](https://ccp.cloudera.com/displ
8285

8386
The SQL commands have been written to: /tmp/ooziedb-8221670220279408806.sql
8487

85-
- Install ExtJS version 2.2 library:
88+
- Install version 2.2 of ExtJS library:
8689
- download the zipped library from [http://extjs.com/deploy/ext-2.2.zip](http://extjs.com/deploy/ext-2.2.zip)
8790
- copy the zip file to `/var/lib/oozie` end extract it there
8891
- Install Oozie ShareLib:
@@ -98,15 +101,15 @@ Based on [Cloudera CDH4 Oozie installation guide](https://ccp.cloudera.com/displ
98101

99102
sudo service oozie start
100103

101-
- Check Status of the server:
102-
- from command-line:
104+
- Check the status of the server:
105+
- From command-line:
103106

104107
oozie admin -oozie http://localhost:11000/oozie -status
105108

106-
This should be printed out:
109+
as a result, should be printed out:
107110

108111
System mode: NORMAL
109112

110-
- Through a webpage - use a web browser to open: [http://localhost:11000/oozie/](http://localhost:11000/oozie/)
113+
- Through a webpage - use a web browser to open a webpage at the following address: [http://localhost:11000/oozie/](http://localhost:11000/oozie/)
111114

112-
If you want to check if Oozie correctly executes workflows, you can run some example workflows as described in [Cloudera Oozie example workflows](http://archive.cloudera.com/cdh4/cdh/4/oozie/DG_Examples.html). Note that contrary to what is written there, the Oozie server is not available at `http://localhost:8080/oozie` but at `http://localhost:11000/oozie` address
115+
If you want to check if Oozie correctly executes its workflows, you can run some of the example workflows provided with Oozie as described in [Cloudera Oozie example workflows](http://archive.cloudera.com/cdh4/cdh/4/oozie/DG_Examples.html). Note that contrary to what is written there, the Oozie server is not available at `http://localhost:8080/oozie` but at `http://localhost:11000/oozie` address.

0 commit comments

Comments
 (0)