E2STR

The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer

最近更新: 1年多前

Make-An-Audio-2

a text-conditional diffusion probabilistic model capable of generating high fidelity audio.

最近更新: 1年多前

GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

最近更新: 1年多前

offline-RL-congestion-control

最近更新: 1年多前

AffineQuant

Official implementation of the ICLR 2024 paper AffineQuant

最近更新: 接近2年前

libtpa

最近更新: 接近2年前

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

最近更新: 接近2年前

VTVQA

Towards Video Text Visual Question Answering: Benchmark and Baseline

最近更新: 接近2年前

QSync

Official resporitory for "QSync: Adpative Mixed-Precision for Training Synchronization".

最近更新: 接近2年前

ImageDream

The code releasing for https://image-dream.github.io/

最近更新: 接近2年前

FreeSeg

最近更新: 接近2年前

AvatarVerse

code repo for the paper "AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose" (AAAI2024)

最近更新: 接近2年前

bitsail

BitSail is a distributed, high-performance data integration engine and provides global data integration solutions in batch, streaming, and incremen...

最近更新: 2年前

fc-clip

This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Panoptic Segmentation with Single Frozen Convolutional CLIP

最近更新: 2年前

pv3d

最近更新: 2年前

DWSF

code repository for Practical Deep Dispersed Watermarking with Synchronization

最近更新: 2年前

ColTrack

This repository is an official implementation of Collaborative Tracking Learning for Frame-Rate-Insensitive Multi-Object Tracking

最近更新: 2年前

MVDream

Multi-view Diffusion for 3D Generation

最近更新: 2年前

MVDream-threestudio

3D generation code for MVDream

最近更新: 2年多前

midi_melody_extraction

最近更新: 2年多前
成就
530
Star
131
Fork
成员(3)
551147 normalcoder 1578927376
诺墨
1305863 starryc 1594099416
嘻酱
7825243 dengyiyun 1599025613
长花天门冬

搜索帮助