# EasyMapReduce **Repository Path**: fanx-dev/EasyMapReduce ## Basic Information - **Project Name**: EasyMapReduce - **Description**: 基于Hadoop的大数据分析工具 - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 1 - **Forks**: 0 - **Created**: 2020-10-04 - **Last Updated**: 2024-12-23 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # EasyMapReduce Write Hadoop MapReduce in high-level API. Inspired by Apache Spark # Word Count ``` Void main() { init ds := c.load(input).map{ it.split(' ').join("\n") }.reduce(0)|Int r,v|{ r + 1 } ds.save(output) } ``` #### Run ``` cd demo fan wordCount.fan ``` #### Config set 'HADOOP_HOME' env vars #### Debug Running in a local simulation env. ``` fan wordCount.fan -debug ``` #### How it works Show more detail by '-debug_show' args. ``` fan wordCount.fan -debug_show ``` #### Passing Hadoop Args ``` fan wordCount.fan -hadoop '-files ./dict,./data' ``` More [Hadoop Streaming Args](https://hadoop.apache.org/docs/r1.2.1/streaming.html)