huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 33k
Star 160k

Code
Issues 1.1k
Pull requests 1.3k
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: huggingface/transformers

Labels 139 Milestones 0

New pull request New

1,280 Open 24,727 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix local_files_only tokenizer fallback when tokenizer files are missing (Issue 45538)

#45541 opened Apr 21, 2026 by Brianzhengca

Loading…

4 of 7 tasks

Fix cross-attention cache layer type for T5Gemma2 long inputs

#45540 opened Apr 21, 2026 by Beichen-Ma

Loading…

4 of 6 tasks

Revert #45045: changes break modular's purpose

#45539 opened Apr 21, 2026 by Cyrilvallez Member

Loading…

NVFP4 quantization: streaming loader, fused MoE experts (Qwen + Llama…

#45537 opened Apr 20, 2026 by ddreeselogs

Loading…

[Sam3LiteText] Remove unnecessary modules/configs

#45535 opened Apr 20, 2026 by yonigozlan Member

Loading…

ALM base model class

#45534 opened Apr 20, 2026 by eustlb Contributor • Draft

1 of 2 tasks

Fix AMD CI: rebuild torchvision with libjpeg + refresh expectations

#45533 opened Apr 20, 2026 by Abdennacer-Badaoui Member

Loading…

[Model] Add SLANet Model Support

#45532 opened Apr 20, 2026 by zhang-prog Contributor

Loading…

[CB] Changes for long generation

#45530 opened Apr 20, 2026 by remi-or Collaborator • Draft

utils: handle flash_attn missing from importlib packages_distributions without crashing

#45524 opened Apr 20, 2026 by SAY-5

Loading…

Fix Seq2SeqLM ExecuTorch export: add encoder_attention_mask to decoder and use static encoder shapes

#45523 opened Apr 20, 2026 by duyhv-qualgo

Loading…

3 tasks

[Trainer] Add ddp_static_graph option

#45519 opened Apr 20, 2026 by KeitaW

Loading…

4 of 5 tasks

T5Gemma2: fix prepare_decoder_input_ids_from_labels

#45516 opened Apr 19, 2026 by Tokarak

Loading…

2 of 6 tasks

Fix GraniteMoeHybrid _update_mamba_mask crash on attention-only models

#45514 opened Apr 19, 2026 by tianhaocui

Loading…

[Qwen3.5] Fix Qwen3.5 linear attention multi-token cached forward

#45513 opened Apr 19, 2026 by kashif Contributor

Loading…

6 tasks

[OutputRecorder] re.search on layer_name

#45512 opened Apr 19, 2026 by eustlb Contributor

Loading…

cache_utils: fix QuantizedLayer to correctly propagate reorder_cache, crop, and batch ops to quantized buffers

#45510 opened Apr 19, 2026 by GitGlimpse895

Loading…

1 of 6 tasks

Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest

#45506 opened Apr 18, 2026 by sirzechs66

Loading…

5 of 6 tasks

Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest

#45500 opened Apr 18, 2026 by sirzechs66 • Draft

5 of 6 tasks

Add V-JEPA 2.1 inference support

#45497 opened Apr 17, 2026 by davevanveen

Loading…

5 of 6 tasks

Fix: propagate quantization_config to text sub-config for composite models in AutoModelForCausalLM

#45494 opened Apr 17, 2026 by lvliang-intel

Loading…

[WIP] Major processing refactor

#45493 opened Apr 17, 2026 by zucchini-nlp Member

Loading…

Add ctsm model

#45490 opened Apr 17, 2026 by kashif Contributor

Loading…

6 tasks

Align gemma3n cache sharing to gemma4

#45489 opened Apr 17, 2026 by Cyrilvallez Member

Loading…

Fix model parallel issue for altclip model and ChineseClip model

#45487 opened Apr 17, 2026 by kaixuanliu Contributor

Loading…

Previous 1 2 3 4 5 … 51 52 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!