Skip to content
View WenmuZhou's full-sized avatar

Block or report WenmuZhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more aboutblocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more aboutreporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results
Python 6 1 UpdatedJul 14, 2024

【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

85 3 UpdatedOct 18, 2024

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 304 30 UpdatedSep 24, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,138 345 UpdatedOct 22, 2024

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 781 37 UpdatedJun 27, 2024

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,168 71 UpdatedJul 16, 2024

An unofficial implementation for "CoSeR: Bridging Image and Language for Cognitive Super-Resolution (CVPR 2024)"

Python 48 5 UpdatedAug 9, 2024

Tạp chứng hòa văn đương kiểm trắc hòa kiểu chính

Python 24 4 UpdatedSep 18, 2024

Tương paddleocr

Python 8 UpdatedJul 28, 2024

A curated list of papers, code and resources pertaining to image harmonization.

424 28 UpdatedJul 23, 2024

Tòng linh học tập, chế tác đổng nhân tình thế cố đích đại ngữ ngôn mô hình

Python 780 56 UpdatedOct 18, 2024
Jupyter Notebook 23 1 UpdatedOct 6, 2024

Cơ vu Yolov5 xa bài kiểm trắc, canh khoái canh chuẩn.

Python 1,201 400 UpdatedFeb 7, 2022
Python 42 3 UpdatedAug 29, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,260 52 UpdatedAug 15, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,067 141 UpdatedSep 3, 2024

Official pytorch repository for “Guidance with Spherical Gaussian Constraint for Conditional Diffusion”

Python 42 2 UpdatedJul 17, 2024

Code for DesignEdit

Python 305 22 UpdatedJul 21, 2024

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,391 115 UpdatedJul 17, 2024

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,366 290 UpdatedOct 11, 2024

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 617 40 UpdatedSep 8, 2024

[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices

Python 469 44 UpdatedJan 23, 2024

Diffusion Model-Based Image Editing: A Survey (arXiv)

438 29 UpdatedAug 19, 2024

CAMixerSR: Only Details Need More “Attention” (CVPR 2024)

Python 217 11 UpdatedJun 4, 2024

[ECCV2024] IDM-VTON: Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Python 3,792 595 UpdatedJul 30, 2024

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,140 245 UpdatedMay 31, 2024

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

Python 9 1 UpdatedMar 6, 2024

Efficient vision foundation models for high-resolution generation and perception.

Python 2,031 175 UpdatedOct 22, 2024

Tại A cổ ( cổ phiếu ) thị tràng thượng huấn luyện cường hóa học tập giao dịch trí năng thể

Jupyter Notebook 196 83 UpdatedMar 27, 2024
Next