rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

最近更新: 6天前

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

最近更新: 6天前

hmtl

🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP

最近更新: 6天前

pipeline-rl-cmu

最近更新: 6天前

PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

最近更新: 6天前

DeepEP

DeepEP: an efficient expert-parallel communication library

最近更新: 6天前

finepdfs

Codebase for FinePDFs

最近更新: 6天前

hf-mcp-server

Hugging Face MCP Server

最近更新: 6天前

jupyter-agent

Training LLMs to reason and analyze data with notebooks

最近更新: 6天前

boomtitan

fork of torchtitan for the boom project

最近更新: 6天前

torchtitan

A PyTorch native platform for training generative AI models

最近更新: 6天前

tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

最近更新: 6天前

lora-fast

Minimal repository to demonstrate fast LoRA inference with Flux family of models.

最近更新: 6天前

finetrainers

Scalable and memory-optimized training of diffusion models

最近更新: 6天前

responses.js

A lightweight express.js server implementing OpenAI’s Responses API, built on top of Chat Completions, powered by Hugging Face Inference Providers.

最近更新: 6天前

arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

最近更新: 6天前

flux-fast

Making Flux go brrr on GPUs.

最近更新: 6天前

vllm-dca

A high-throughput and memory-efficient inference and serving engine for LLMs

最近更新: 6天前

hf-nix

最近更新: 6天前

prime

prime is a framework for efficient, globally distributed training of AI models over the internet.

最近更新: 6天前
成就
3
Star
5
Fork
成员(1)
镜像

搜索帮助