# pig **Repository Path**: shareProject/pig ## Basic Information - **Project Name**: pig - **Description**: Apache Pig是一个分析大型数据集的平台,它由表达数据分析程序的高级语言和评估这些程序的基础设施组成 - **Primary Language**: Java - **License**: Apache-2.0 - **Default Branch**: trunk - **Homepage**: https://www.oschina.net/p/pig - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2022-04-13 - **Last Updated**: 2022-04-13 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README Apache Pig =========== Pig is a dataflow programming environment for processing very large files. Pig's language is called Pig Latin. A Pig Latin program consists of a directed acyclic graph where each node represents an operation that transforms data. Operations are of two flavors: (1) relational-algebra style operations such as join, filter, project; (2) functional-programming style operators such as map, reduce. Pig compiles these dataflow programs into (sequences of) map-reduce or Apache Tez jobs and executes them using Hadoop. It is also possible to execute Pig Latin programs in a "local" mode (without Hadoop cluster), in which case all processing takes place in a single local JVM. General Info =============== For the latest information about Pig, please visit our website at: http://pig.apache.org/ and our wiki, at: http://wiki.apache.org/pig/ Getting Started =============== 1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial 2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and http://wiki.apache.org/pig/RunPig 3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank Contributing to the Project =========================== We welcome all contributions. For the details, please, visit https://cwiki.apache.org/confluence/display/PIG/HowToContribute