-
Notifications
You must be signed in to change notification settings - Fork 26.7k
Issues: huggingface/transformers
New issue
Have a question about this project?Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of serviceand privacy statement.We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Llama model not generating mentioned "max_tokens" number of tokens as output
bug
#34296
openedOct 21, 2024 by
vineel96
2 of 4 tasks
SiglipVisionEmbeddings doesn't cast pixel_values like CLIPVisionEmbeddings does
bug
#34294
openedOct 21, 2024 by
fpgaminer
4 tasks
DinoV2 is incorrectly documented as a default patch size of 16 instead of 14
bug
#34292
openedOct 21, 2024 by
OFSkean
1 of 4 tasks
LLaVa with multiple image input throws error: Image features and image tokens do not match
bug
Multimodal
Vision
#34284
openedOct 21, 2024 by
Shruthi42
4 tasks
cross attention mask is always zeros in mllama
bug
Multimodal
Vision
#34280
openedOct 21, 2024 by
xgal
1 of 4 tasks
[Trainer][Eval] Why the model output for the first element in eval batch is skipped in logits?
trainer
#34278
openedOct 21, 2024 by
konradkalita
bitnet support
Feature request
Request for a new feature
Quantization
#34277
openedOct 21, 2024 by
Darshan2104
Limit number of parametes logged withRequest for a new feature
trainer
MLflowCallback
Feature request
#34276
openedOct 21, 2024 by
cecheta
GenerationConfig parameters are deleted from the website on 4.52.2 and main
bug
Documentation
#34273
openedOct 21, 2024 by
eyalmazuz
4 tasks
image_transforms preprocess quite slow when run large image with qwen2vl
bug
Performance
Vision
#34272
openedOct 21, 2024 by
zhjunqin
4 tasks
Access to model outputs inside LogitProcessor
Feature request
Request for a new feature
Generation
#34265
openedOct 20, 2024 by
AdityaMayukhSom
T5 models fail when loaded withInternals of the library; Models.
Usage
General questions about the library
torch_dtype=torch.half
bug
Core: Modeling
#34264
openedOct 19, 2024 by
Rohan138
2 of 4 tasks
Mixtral manualRequest for a new feature
Usage
General questions about the library
head_dim
Feature request
#34261
openedOct 19, 2024 by
wavy-jung
Add support for Janus model from DeepSeek AI
New model
#34249
openedOct 18, 2024 by
ighoshsubho
2 tasks done
RuntimeError: Failed to import transformers.pipelines because of the following error (look up to see its traceback): dlopen: cannot load any more object with static TLS
bug
Installation
Usage
General questions about the library
#34243
openedOct 18, 2024 by
liuzhao104
4 tasks
Add DDP token averaging for equivalent non-parallel training similar to #34191
Discussion
Discussion on a topic (keep it focused or open a new issue though)
Feature request
Request for a new feature
#34242
openedOct 18, 2024 by
sbwww
How to output token by token use transformers?
bug
Discussion
Discussion on a topic (keep it focused or open a new issue though)
#34241
openedOct 18, 2024 by
xuanzhangyang
4 tasks
Trainer.hyperparameter_search
kwargs
parameter has an inexact definition if using Optuna
Documentation
#34239
openedOct 18, 2024 by
GuillemGSubies
GGUF support for BERT architecture
Feature request
Request for a new feature
#34238
openedOct 18, 2024 by
Dimmension
LlamaRotaryEmbedding
inv_freq
buffer is left uninitialized byinit_empty_weights
+load_checkpoint_and_dispatch
bug
#34234
openedOct 18, 2024 by
ringohoffman
3 of 4 tasks
Add support for HuBERT batch norm instead of weight norm in pos_conv_emb
Feature request
Request for a new feature
#34229
openedOct 17, 2024 by
gallilmaimon
PreviousNext
ProTip!
Typegion any issue or pull request to go back to the issue listing page.