huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 26.7k
Star 134k

Code
Issues 1k
Pull requests 431
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/transformers

[Modular Transformers] Request for comments

#33916 openedOct 3, 2024by LysandreJik

Open

Labels 123 Milestones 0

New issue

Have a question about this project?Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of serviceand privacy statement.We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1,002 Open 14,954 Closed

Author

Filter by author

Label

Filter by label

Usealt+click/returnto exclude labels

or⇧+click/returnfor logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Llama model not generating mentioned "max_tokens" number of tokens as output bug

#34296 openedOct 21, 2024by vineel96

2 of 4 tasks

SiglipVisionEmbeddings doesn't cast pixel_values like CLIPVisionEmbeddings does bug

#34294 openedOct 21, 2024by fpgaminer

4 tasks

DinoV2 is incorrectly documented as a default patch size of 16 instead of 14 bug

#34292 openedOct 21, 2024by OFSkean

1 of 4 tasks

LLaVa with multiple image input throws error: Image features and image tokens do not match bug Multimodal Vision

#34284 openedOct 21, 2024by Shruthi42

4 tasks

cross attention mask is always zeros in mllama bug Multimodal Vision

#34280 openedOct 21, 2024by xgal

1 of 4 tasks

[Trainer][Eval] Why the model output for the first element in eval batch is skipped in logits? trainer

#34278 openedOct 21, 2024by konradkalita

bitnet support Feature request

Request for a new feature

Quantization

#34277 openedOct 21, 2024by Darshan2104

Limit number of parametes logged withMLflowCallback Feature request

Request for a new feature

trainer

#34276 openedOct 21, 2024by cecheta

GenerationConfig parameters are deleted from the website on 4.52.2 and main bug Documentation

#34273 openedOct 21, 2024by eyalmazuz

4 tasks

image_transforms preprocess quite slow when run large image with qwen2vl bug Performance Vision

#34272 openedOct 21, 2024by zhjunqin

4 tasks

Access to model outputs inside LogitProcessor Feature request

Request for a new feature

Generation

#34265 openedOct 20, 2024by AdityaMayukhSom

T5 models fail when loaded withtorch_dtype=torch.half bug Core: Modeling

Internals of the library; Models.

Usage

General questions about the library

#34264 openedOct 19, 2024by Rohan138

2 of 4 tasks

New GA fix causes training loss multiple times higher across the board (5x to 10x higher) bug trainer

#34263 openedOct 19, 2024by JianbangZ

4 tasks

Finetuned LLAMA Model is working same old pretrained model after combining LORA weights with old model lora PEFT Quantization

#34262 openedOct 19, 2024by abhi201002

Mixtral manualhead_dim Feature request

Request for a new feature

Usage

General questions about the library

#34261 openedOct 19, 2024by wavy-jung

Add support for Janus model from DeepSeek AI New model

#34249 openedOct 18, 2024by ighoshsubho

2 tasks done

RuntimeError: Failed to import transformers.pipelines because of the following error (look up to see its traceback): dlopen: cannot load any more object with static TLS bug Installation Usage

General questions about the library

#34243 openedOct 18, 2024by liuzhao104

4 tasks

Add DDP token averaging for equivalent non-parallel training similar to #34191 Discussion

Discussion on a topic (keep it focused or open a new issue though)

Feature request

Request for a new feature

#34242 openedOct 18, 2024by sbwww

How to output token by token use transformers? bug Discussion

Discussion on a topic (keep it focused or open a new issue though)

#34241 openedOct 18, 2024by xuanzhangyang

4 tasks

Trainer.hyperparameter_searchkwargsparameter has an inexact definition if using Optuna Documentation

#34239 openedOct 18, 2024by GuillemGSubies

GGUF support for BERT architecture Feature request

Request for a new feature

#34238 openedOct 18, 2024by Dimmension

LlamaRotaryEmbeddinginv_freqbuffer is left uninitialized byinit_empty_weights+load_checkpoint_and_dispatch bug

#34234 openedOct 18, 2024by ringohoffman

3 of 4 tasks

cache wrong code bug

#34232 openedOct 18, 2024by mdy666

4 tasks

Add support for HuBERT batch norm instead of weight norm in pos_conv_emb Feature request

Request for a new feature

#34229 openedOct 17, 2024by gallilmaimon

ISSUES in #TRANSLATING.md

#34227 openedOct 17, 2024by anshumangahlot

Previous12 3 4 5…40 41 Next

PreviousNext

ProTip! Typegion any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly