Big Data - Apache Pig

Back to Course

Lesson Description

Lession - #459 Apache pig Home

Apache Pig is an high level platform for creating programs that run on Apache Hadoop. The language for this platform is called Pig Latin.

What is Apache Pig utilized for?

Apache Pig is a abstration over MapReduce. It is an tool/platform which is utilized to analyze larger sets of data representing them as data streams. Pig is for the most part utilized with Hadoop; we can perform all of the data control activities in Hadoop utilizing Apache Pig.

Apache Pig Features

I. Rich set of operators.
ii. Simplicity of programming.
iii. Advancement potential opportunities.
iv. Extensibility.
v. Udf's(User Defined Functions>
vi. Handles a wide range of information.
vii. Join operation
viii. Multi-query approach.