🤗Transformers

Topic	Replies	Views	Activity
Seeking Advice on Hysteroscopy Lesion Classification with Transfer Learning 🤗Transformers	0	13	July 20, 2026
Rtx 5090 35b nvfp4 🤗Transformers	3	117	July 18, 2026
Any plans for an INT8 ConvRot version for FireRed Image Edit v1.1? 🤗Transformers	1	86	July 16, 2026
Fine tuning for social media trends 🤗Transformers	3	104	July 13, 2026
Rethinking Transformer Architecture Through the Lens of Group Theory: 🤗Transformers	5	115	July 13, 2026
Contributing Gemma 4 support and ONNX export optimizations 🤗Transformers	1	85	July 7, 2026
How should I balance learning rate and data sampling during CPT on multiple datasets? 🤗Transformers	1	44	July 7, 2026
[Research] From Functional Geometry to Dynamic Grammar: New LIMEN Audits (V23–V24) Across 7 Architectures 🤗Transformers	2	48	July 2, 2026
A comprehensive, bilingual guide to Transformers: From foundations to KV-cache compression & attention dynamics 🤗Transformers	0	25	June 29, 2026
Error fix of the 503 loop 🤗Transformers	1	79	June 25, 2026
Deprecated parameters of pipeline() included in the course 🤗Transformers	0	31	June 12, 2026
A note on interpreting internal dynamics: Stability vs. Semantic Correctness in Transformers 🤗Transformers	0	33	June 2, 2026
How can LLMs be fine-tuned for specialized domain knowledge? 🤗Transformers	3	1609	May 29, 2026
Need generative model, high-quality description generation 🤗Transformers	3	115	May 28, 2026
SFTTrainerflags blocks assistant_only_loss=True 🤗Transformers	3	156	May 26, 2026
Date format for tine-tuning AI models 🤗Transformers	5	120	May 22, 2026
Chatbot Start Prompt for GPT-J 🤗Transformers	5	1405	May 21, 2026
Automatic -100 masking of the questions in Labels 🤗Transformers	1	51	May 21, 2026
PTQ INT8 via TFLiteConverter — encoder-decoder seq2seq model loses encoder context entirely after conversion 🤗Transformers	3	120	May 16, 2026
Fucking hugging face changed the zerogpu 🤗Transformers	0	36	May 14, 2026
Train a fully open SmolLM4-750M model 🤗Transformers	0	251	May 11, 2026
The BPE pre-tokenizer was not recognized! 🤗Transformers	6	364	May 7, 2026
Custom batches in sentence-transformers for MultipleNegativesRankingLoss 🤗Transformers	3	152	May 1, 2026
I developed an experimental Graph-Native Artificial Brain engine 🤗Transformers	4	88	May 1, 2026
When i use tool its pause and restart space not working why DeepSpeed	0	22	April 30, 2026
CPU offloading error scenario 🤗Transformers	11	412	April 27, 2026
Gemma 3 12B: 4-bit Quantization failing/ignored in Transformers v5.1.0 (Gemma3ForConditionalGeneration) 🤗Transformers	9	516	April 24, 2026
Why am I facing this Error while running this code 🤗Transformers	1	126	April 23, 2026
What are the best tutorials to learn Transformers step by step? 🤗Transformers	2	175	April 20, 2026
LLM Course code errors 🤗Transformers	8	396	April 17, 2026