huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 27.4k
Star 137k

Code
Issues 992
Pull requests 537
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

992 Open 15,403 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Unknown quantization type, got fp8 bug

#35471 opened Dec 31, 2024 by ruidazeng

2 of 4 tasks

#35470 opened Dec 31, 2024 by saeed6944

LayerDrop broken in various Flax models (Whisper/BART/more...) bug

#35468 opened Dec 31, 2024 by sssshhhhhh

2 of 4 tasks

Support SDPA & Flash Attention 2 for LayoutLMv3 Feature request

Request for a new feature

#35467 opened Dec 31, 2024 by stancld

"Is it possible for Hugging Face to implement a chat model for quick information retrieval similar to vLLM?" Feature request

Request for a new feature

#35464 opened Dec 31, 2024 by BeastyZ

Qwen2-VL used to work with inputs_embeds instead of input_ids, but no more bug

#35463 opened Dec 31, 2024 by minostauros

2 of 4 tasks

8bits GPTQ quantization output bug

#35460 opened Dec 30, 2024 by joshuaongg21

1 of 4 tasks

How can I disable legacy processing in llava-next bug

#35457 opened Dec 30, 2024 by foreverpiano

1 of 4 tasks

Installation Error for transformers Package (ðŸ”¥ maturin failed) bug

#35454 opened Dec 29, 2024 by SauceChord

2 of 4 tasks

[Feature Request] Add beam search text streaming visualization feature Feature request

Request for a new feature

#35451 opened Dec 29, 2024 by MosheOfer1

Assistant decoding w. Llava-Next does not work bug VLM

#35450 opened Dec 29, 2024 by ddehun

4 tasks

Support Constant Learning Rate with Cooldown Feature request

Request for a new feature

#35449 opened Dec 29, 2024 by LoserCheems

Tokenizer does not split text according to newly added input tokens bug Core: Tokenization

Internals of the library; Tokenization.

#35447 opened Dec 29, 2024 by jiongjiongli

2 of 4 tasks

tokenizer should be replaced to processing_class in Seq2SeqTrainer? bug Core: Tokenization

Internals of the library; Tokenization.

trainer

#35446 opened Dec 29, 2024 by zzaebok

2 of 4 tasks

Allow static cache to be larger than sequence length / batch size for encoder-decoder models Feature request

Request for a new feature

#35444 opened Dec 29, 2024 by cptspacemanspiff

Compatibility Issue with Python 3.13 bug dependencies

Pull requests that update a dependency file

#35443 opened Dec 29, 2024 by pocerberus

2 of 4 tasks

Calling to() is not supported for 4-bit quantized models with the installed version of bitsandbytes. The current device is cuda:0. If you intended to move the model, please install bitsandbytes >= 0.43.2. bug Quantization

#35442 opened Dec 28, 2024 by Kulbuntu

1 of 3 tasks

Missing weights are not properly initialized when using model.from_pretrained() bug Core: Modeling

Internals of the library; Models.

#35437 opened Dec 27, 2024 by YifanXu74

4 tasks done

Memory leak on python 3.10.* bug dependencies

Pull requests that update a dependency file

#35434 opened Dec 27, 2024 by KhoiTrant68

2 of 4 tasks

tokenizers.apply_chat_template with continue_final_message=True with trailing spaces in input bug Chat Template

#35433 opened Dec 27, 2024 by chuyishang

1 of 4 tasks

apply class transformers.SequenceBiasLogitsProcessor on Qwen model Feature request

Request for a new feature

Generation

#35432 opened Dec 27, 2024 by buptspig

GPT2Attention() class with _attn() method when add_cross_attention=True and therefore is_cross_attention=True. bug Feature request

Request for a new feature

#35430 opened Dec 27, 2024 by CHLEE-Leo

cannot custom warmup_min_lr of deepspeed lr scheduler bug DeepSpeed

#35428 opened Dec 27, 2024 by SeunghyunSEO

2 of 4 tasks

model.config.to_diff_dict() delivers different result to model.save_pretrained() bug Core: Modeling

Internals of the library; Models.

#35426 opened Dec 27, 2024 by umarbutler

2 of 4 tasks

DeepSeek V3 Support New model

#35425 opened Dec 26, 2024 by casper-hansen

2 tasks done

Previous 1 2 3 4 5 … 39 40 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2024-12-01.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly