/vt/ - Virtual YouTubers » Searching for posts with the image hash ‘o/wmNUWrDGUl2MZYk5IKdg==’.

Archives: [ bant / c / con / e / i / n / news / out / p / pw / qst / toy / vip / vp / vt / w / wg / wsr ] Boards: [ meta ]

Domain changed to archive.palanq.win . Feb 14-25 still awaits import.

Searching for posts with the image hash ‘o/wmNUWrDGUl2MZYk5IKdg==’. 1 results found.

Anonymous

View Same Google ImgOps iqdb SauceNAO file.png, 96KiB, 763x292

Anonymous Mon 10 Feb 2025 21:16:58 No.95397854 View View Report

Quoted By:

>>95312069
You should be using 0.3 if you actually want to use it. But in my opinion, all of the tunes are brain damaged because the community is lazy when you actually have to implement fine tuning, Deepseek explicitly spells it out.
>For distilled models, we apply only SFT and do not include an RL stage, even though incorporating RL could substantially boost model performance. Our primary goal here is to demonstrate the effectiveness of the distillation technique, leaving the exploration of the RL stage to the broader research community.
But no, slop merges and LORas are easier and most people who have no idea what they are doing would rather pump out shit and hacks over doing the right thing literally spelled out.

Capcode	All Only User Posts Only Verified Posts Only Moderator Posts Only Manager Posts Only Admin Posts Only Developer Posts Only Founder Posts
Show Posts	All Only With Images Only Without Images Only Spoiler Images Only Non-Spoiler Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

Searching for posts with the image hash ‘o/wmNUWrDGUl2MZYk5IKdg==’. 1 results found.

On these archives

Your latest searches