LLaMA Factory @llamafactory_ai

Towards easy and efficient fine-tuning of large language models github.com/hiyouga/LLaMA-… Joined February 2024

Tweets

378
Followers

3K
Following

182
Likes

1K

Weiyang Liu @Besteuler

a month ago

Excited to see Orthogonal Finetuning (OFT) and Quantized OFT (QOFT) now merged into LLaMA-Factory! 🎉 OFT & QOFT are memory/time/parameter-efficient and excel at preserving pretraining knowledge. Try them in: 🔗 LLaMA-Factory: github.com/hiyouga/LLaMA-… 🔗 PEFT:…

2 15 73 5K 27

Download Image

LMSYS Org @lmsysorg

2 months ago

SGLang is now officially supporting OpenAI’s new GPT-OSS model!

OpenAI @OpenAI

2 months ago

SGLang is now officially supporting OpenAI’s new GPT-OSS model!

1K 3K 20K 6.6M 4K

2 18 93 27K 12

LLaMA Factory @llamafactory_ai

2 months ago

MiniCPM-V 4.0 visual fine-tuning is available at LlamaFactory 🌟

OpenBMB @OpenBMB

2 months ago

MiniCPM-V 4.0 visual fine-tuning is available at LlamaFactory 🌟

12 76 382 31K 155

Download Image

2 5 18 1K 0

Jingwei Zuo @JingweiZuo

2 months ago

Falcon-H1 technical report is now available! The latest open hybrid Transformer–Mamba model family. The 80+ page report details the key design decisions behind H1, from architecture innovations, data strategies to training recipes challenging conventional practices in the filed

2 2 7 554 1

Shizhe Diao @shizhediao

2 months ago

New tech report out! 🚀 Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training An expanded version of our ProRL paper — now with more training insights and experimental details. Read it here 👉 arxiv.org/abs/2507.12507

Shizhe Diao @shizhediao

4 months ago

19 69 416 61K 371

Download Image

2 14 110 10K 80

clem 🤗 @ClementDelangue

2 months ago

That would be so 🔥🔥🔥 @Alibaba_Qwen @Kimi_Moonshot

Teknium (e/λ) @Teknium1

2 months ago

That would be so 🔥🔥🔥 @Alibaba_Qwen @Kimi_Moonshot

11 23 250 27K 16

8 6 123 13K 7

Sedrick Keh @sedrickkeh2

3 months ago

📢📢📢 Releasing OpenThinker3-1.5B, the top-performing SFT-only model at the 1B scale! 🚀 OpenThinker3-1.5B is a smaller version of our previous 7B model, trained on the same OpenThoughts3-1.2M dataset.

1 33 120 13K 30

Download Image

LLaMA Factory @llamafactory_ai

3 months ago

Introduce Easy Dataset No-code framework for synthesizing fine-tuning data from unstructured documents using LLMs/Ollamas Supports OCR, chunking, QA augmentation, and export to LlamaFactory/Unsloth fine-tuning frameworks huggingface.co/papers/2507.04…

0 11 70 5K 66

Download Video

LLaMA Factory @llamafactory_ai

3 months ago

LLaMA-Factory supported the multi-modal fine-tuning of the open-source GLM-4.1V-Thinking model at Day0 🔥

𝚐𝔪𝟾𝚡𝚡𝟾 @gm8xx8

3 months ago

LLaMA-Factory supported the multi-modal fine-tuning of the open-source GLM-4.1V-Thinking model at Day0 🔥

3 12 52 4K 19

Download Image

0 2 18 999 4

TuringPost @TheTuringPost

3 months ago

PPO and GRPO — a workflow breakdown of the most popular reinforcement learning algorithms ➡️ Proximal Policy Optimization (PPO): The Stable Learner It’s used everywhere from dialogue agents to instruction tuning as it balances between learning fast and staying safe. ▪️ How PPO…

9 114 546 44K 614

Download Gif

LLaMA Factory @llamafactory_ai

4 months ago

LLaMA Factory on ROCm 🔥

AI at AMD @AIatAMD

4 months ago

LLaMA Factory on ROCm 🔥

2 18 150 57K 21

Download Image

0 2 4 718 2

AI at AMD @AIatAMD

4 months ago

Fine-tune Llama-3.1 8B with Llama-Factory on AMD GPUs with this step-by-step guide: bit.ly/4k14ORL Discover more fine-tuning tutorials on the ROCm AI Developer Hub: bit.ly/4kLQiOQ

2 18 150 57K 21

Download Image

LLaMA Factory @llamafactory_ai

4 months ago

LLaMA-Factory now supports fine-tuning the Falcon H1 family of models using Full-FineTune or LoRA, kudos @DhiaRhayem

0 8 14 2K 2

Download Image

Lysandre @LysandreJik

4 months ago

Insane milestone for Llama Factory!

LLaMA Factory @llamafactory_ai

4 months ago

Insane milestone for Llama Factory!

1 37 130 19K 85

Download Image

0 3 13 1K 1

verl project @verl_project

4 months ago

DeepSeek 671b and Qwen3 236b support with Megatron backend is now available as preview in verl v0.4.0 🔥🔥🔥 We will continue optimizing MoE model performance down the road. DeepSeek 671b: verl.readthedocs.io/en/latest/perf… verl v0.4: github.com/volcengine/ver…