GitHub · Where software is built

[Community contributions] Model cards
#36979 · stevhliu opened on Mar 25, 2025
155

Labels Milestones New issue

<code>make fixup</code> can't find PLC1802

#39853

· jackzhxng opened

on Aug 1, 2025

Inconsistent Function calling behaviour by Mistral-7B-Instruct-v0.3

#39852

· dvn8weil opened

on Aug 1, 2025

Support topNSigma sampling in <code>generate</code>

Feature request

#39850

· pramodith opened

on Aug 1, 2025

Accelerate seems to default mixed precision to bf16 when passing a DeepSpeed config.

#39849

· alexge233 opened

on Aug 1, 2025

Expected behavior of <code>compute_result</code> is hard to expect and inconsistent

#39842

· MilkClouds opened

on Aug 1, 2025

MistralCommonTokenizer does not match PreTrainedTokenizer

#39841

· Fhrozen opened

on Aug 1, 2025

pack_image_features RuntimeError when vision_feature_select_strategy="full"

#39839

· llnnnnnn opened

on Aug 1, 2025

Crash when running Llama4 on transformers-4.54.1

#39835

· IKACE opened

on Aug 1, 2025

Allow extra outputs from <code>GenerationMixin.generate</code>

Feature request

#39834

· jood-canva opened

on Aug 1, 2025

Flash Attention fails with non aligned position_ids

#39814

· alessiodevoto opened

on Jul 31, 2025

Why <code>lm-head</code> weight still exists with <code>"tie_word_embeddings": true</code>

#39812

· Kelvinlby opened

on Jul 31, 2025

Missing einops dependency causing ModuleNotFoundError

#39811

· iforgetmyname opened

on Jul 31, 2025