A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Apache Gobblin is a highly scalable data management solution for structured and byte-oriented data in heterogeneous data ecosystems.
If building the distribution with tests turned on:
./gradlew rat. Report will be generated under build/rat/rat-report.html
./gradlew build -x findbugsMain -x test -x rat -x checkstyleMainThe distribution will be created in build/gobblin-distribution/distributions directory. (or)
./gradlew buildThe distribution will be created in build/gobblin-distribution/distributions directory.