A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Fast SHAP value computation for interpreting tree-based models
Apache Hive
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization a...
最近更新: 2天前Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.
最近更新: 2天前DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks
最近更新: 2天前shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.
最近更新: 2天前A plugin-oriented tool for automating the investigation of broken hosts and services.
最近更新: 2天前Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin f...
最近更新: 2天前A python library for building nginx configuration files programatically
最近更新: 2天前Bluepill is a reliable iOS testing tool that runs UI tests using multiple simulators on a single machine
最近更新: 2天前AdFullSsl is a tool that can automatically detect SSL non-compliant ads and fix them
最近更新: 2天前