docs: Readme Updated for optimized Usage with transformers library

#60

by sayed99 - opened Nov 2, 2025

base: refs/heads/main

←

from: refs/pr/60

Discussion Files changed

+99

-16

docs: Readme Updated for optimized Usage with transformers library1787ca52

sayed99

Nov 2, 2025

python code for transformers usage updated to use flash-attn as attention implementation to boost the performance and reduce memory usage.

xiaohei66

PaddlePaddle org Nov 6, 2025

@sayed99 Great work on this, and thank you for your contribution!

To ensure a smooth out-of-the-box experience for all users, we think it’s better to make flash-attn optional instead of default. To save you some time, I’ll go ahead and push a small commit to this PR to make that change.

Also, I suggest removing these two steps. They don’t seem necessary for a code example and removing them would simplify it.

from google.colab import files
...
# 2- Upload image (drag & drop any PNG/JPG)
...
# 3. Resize max-2048 preserving aspect ratio
...

Thanks again for the excellent work!

sayed99

Nov 6, 2025

@xiaohei66
Thank you for the suggestions and for helping improve the PR! I agree that making flash-attn optional and removing the extra Colab steps will simplify the example and make it more user-friendly. I appreciate your help in pushing the small commit, looking forward to reviewing the changes.

Nov 7, 2025

@xiaohei66
Hello, Thanks for your efforts,
I wonder if that merge would be merged to the main model card automatically soon?

ChengCui changed pull request status to merged Nov 11, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment