Home » Pig Latin Concepts

Pig Latin

The Pig Latin is a data flow language used by Apache Pig to analyze the data in Hadoop. It is a textual language that abstracts the programming from the Java MapReduce idiom into a notation.

Pig Latin Statements

The Pig Latin statements are used to process the data. It is an operator that accepts a relation as an input and generates another relation as an output.

  • It can span multiple lines.
  • Each statement must end with a semi-colon.
  • It may include expression and schemas.
  • By default, these statements are processed using multi-query execution

Pig Latin Conventions

Convention Description
( ) The parenthesis can enclose one or more items. It can also be used to indicate the tuple data type.
Example – (10, xyz, (3,6,9))
[ ] The straight brackets can enclose one or more items. It can also be used to indicate the map data type.
Example – [INNER | OUTER]
{ } The curly brackets enclose two or more items. It can also be used to indicate the bag data type
Example – { block | nested_block }
… The horizontal ellipsis points indicate that you can repeat a portion of the code.
Example – cat path [path …]

Latin Data Types

Simple Data Types

Type Description
int It defines the signed 32-bit integer.
Example – 2
long It defines the signed 64-bit integer.
Example – 2L or 2l
float It defines 32-bit floating point number.
Example – 2.5F or 2.5f or 2.5e2f or 2.5.E2F
double It defines 64-bit floating point number.
Example – 2.5 or 2.5 or 2.5e2f or 2.5.E2F
chararray It defines character array in Unicode UTF-8 format.
Example – tutoraspire
bytearray It defines the byte array.
boolean It defines the boolean type values.
Example – true/false
datetime It defines the values in datetime order.
Example – 1970-01- 01T00:00:00.000+00:00
biginteger It defines Java BigInteger values.
Example – 5000000000000
bigdecimal It defines Java BigDecimal values.
Example – 52.232344535345

Complex Types

Type Description
tuple It defines an ordered set of fields.
Example – (15,12)
bag It defines a collection of tuples.
Example – {(15,12), (12,15)}
map It defines a set of key-value pairs.
Example – [open#apache]
Next TopicPig Data Types

You may also like