# spring-hadoop **Repository Path**: xstudio/spring-hadoop ## Basic Information - **Project Name**: spring-hadoop - **Description**: Spring for Apache Hadoop is a framework for application developers to take advantage of the features of both Hadoop and Spring. - **Primary Language**: Java - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2015-11-10 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README The [Spring for Apache Hadoop](http://projects.spring.io/spring-hadoop/) project provides extensions to [Spring](http://projects.spring.io/spring-framework/), [Spring Batch](http://projects.spring.io/spring-batch/), and [Spring Integration](http://projects.spring.io/spring-integration/) to build manageable and robust pipeline solutions around Hadoop. Spring for Apache Hadoop extends Spring Batch by providing support for reading from and writing to HDFS, running various types of Hadoop jobs (Java MapReduce, Streaming, [Hive](http://hive.apache.org), [Pig](http://pig.apache.org)) and [HBase](http://hbase.apache.org). An important goal is to provide excellent support for non-Java based developers to be productive using Spring Hadoop and not have to write any Java code to use the core feature set. Spring for Apache Hadoop also applies the familiar Spring programming model to Java MapReduce jobs by providing support for dependency injection of simple jobs as well as a POJO based MapReduce programming model that decouples your MapReduce classes from Hadoop specific details such as base classes and data types. # Docs You can find out more details from the [user documentation](http://docs.spring.io/spring-hadoop/docs/current/reference/html/) or by browsing the [javadocs](http://docs.spring.io/spring-hadoop/docs/current/api/). If you have ideas about how to improve or extend the scope, please feel free to contribute. # Artifacts For build dependencies to use in your own projects see our [Quick Start](http://projects.spring.io/spring-hadoop/#quick-start) page. # Building Spring for Apache Hadoop uses Gradle as its build system. To build the system simply run: gradlew from the project root folder. This will compile the sources, run the tests and create the artifacts. Note that the tests by default tries to access a localhost single-node Hadoop cluster. ## Supported distros By default Spring for Apache Hadoop compiles against the Apache Hadoop 2.7.x stable relase (hadoop27). The following distros and versions are supported: - Apache Hadoop 2.7.x (hadoop27) default - Apache Hadoop 2.6.x (hadoop26) - Pivotal HD 3.0 (phd30) - Pivotal HD 2.1 (phd21) - Cloudera CDH5 (cdh5) - Hortonworks HDP 2.2 (hdp22) For anyone using older distros and versions we recommend using either one of these: - Spring for Apache Hadoop 2.0.x which is maintained here: https://github.com/spring-projects/spring-hadoop/tree/2.2.x - Spring for Apache Hadoop 2.1.x which is maintained here: https://github.com/spring-projects/spring-hadoop/tree/2.1.x - Spring for Apache Hadoop 2.0.x which is maintained here: https://github.com/spring-projects/spring-hadoop/tree/2.0.x - Spring for Apache Hadoop 1.1.x which is maintained here: https://github.com/spring-projects/spring-hadoop/tree/1.1.x To compile against a specific distro version pass the `-Pdistro=