-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME
28 lines (24 loc) · 1.41 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
Thomas Edward Johnson
03/20/2017
--------------------------------------------------------------------------------------------------------
Information
--------------------------------------------------------------------------------------------------------
This project is for my Big Data Tech class. Using a VM provided by Cloudera, we are able to simulate a
distributed system running HDFS.
--------------------------------------------------------------------------------------------------------
Command
--------------------------------------------------------------------------------------------------------
hadoop jar partitioner.jar stubs.ProcessLogs weblog <output directory>
The above command line command will run my Hadoop Job. The jar name is as listed partitioner.jar, the driver class is named ProcessLogs but it is contained in the package stubs so it must be called with the package name. The input information comes from the weblog directory we uploaded earlier in the semester but the name of the output directory is up to you.
--------------------------------------------------------------------------------------------------------
______
________________// \\
/ / ^ \\
/ | \\ __
||| \____/_\ \_/ /
||| \\\____//
|/| /__________\ |\\_
^ | | \ | \_/
| | | |
|_/\/| |_/\/|