A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Fast SHAP value computation for interpreting tree-based models
DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.
Apache Hive
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization a...
最近更新: 3天前DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.
最近更新: 3天前The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.
最近更新: 3天前Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.
最近更新: 3天前DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks
最近更新: 3天前A mobile interface for linkedin/iris, built for iOS and Android on the Ionic platform
最近更新: 3天前A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive,...
最近更新: 3天前