A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Fast SHAP value computation for interpreting tree-based models
Apache Hive
Multithreaded, gzip-compatible compression and decompression, available as a platform-independent Java library and command-line utilities.
最近更新: 2天前Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived vital stat...
最近更新: 2天前Framework for defining machine learning models, including feature generation and transformations, as directed acyclic graphs (DAGs).
最近更新: 2天前Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
最近更新: 2天前