Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Local Mode: To run the scripts in local mode, no Hadoop or HDFS installation is required. All files are installed and run from your local host and file system.
  • Hadoop Mode: To run the scripts in hadoop (mapreduce) mode, you need access to a Hadoop cluster and HDFS installation.

The Pig tutorial file ([attachment:pigtutorial.tar.gz] or the tutorial/pigtutorial.tar.gz file in the pig distribution) includes the Pig JAR file (pig.jar) and the tutorial files (tutorial.jar, Pigs scripts, log files). These files work with Hadoop 0.18 and provide everything you need to run the Pig scripts. To get started, follow these basic steps:

...

  1. Download the Pig tutorial file to your local directory ([pigtutorial.tar.gz])
  2. Unzip the Pig tutorial file (the files are stored in a newly created directory, pigtmp).
    Code Block
    $ tar -xzf pigtutorial.tar.gz
    
  3. Move to the pigtmp directory.
  4. Review the contents of the Pig tutorial file.
  5. Copy the pig.jar file to the appropriate directory on your system. For example: /home/me/pig.
  6. Create an environment variable, PIGDIR, and point it to your directory. For example: export PIGDIR=/home/me/pig (bash, sh) or setenv PIGDIR /home/me/pig (tcsh, csh).

...