/vt/ - Virtual YouTubers » Searching for posts with the image hash ‘iflNv5nCDiJ2cztelETaRQ==’.

Anonymous

View Same Google ImgOps iqdb SauceNAO Screenshot 2024-06-05 0.png, 82KiB, 1662x404

Anonymous Wed 05 Jun 2024 01:06:36 No.77394703 View View Report

Quoted By:

>>77388848
>Do we have estimateion or numbers for how long it will take to train?
We don't know, but my guess is training SD3 "DiT-only" will be faster that training SDXL "unet-only"
Training text encoders will be more difficult. SD3 has 3 text encoders, 2 clips like in SDXL and T5 with 4.7B parameters, which you won't be able to train
But if you ignore T5 and only use clip, it shouldn't be too bad

From their paper:
>we observe limited performance drops when using only the two CLIP-based text-encoders for the text prompts and replacing the T5 embeddings by zeros. Only for complex prompts involving either highly detailed descriptions of a scene or larger amounts of written text do we find significant performance gains when using all three text-encoders. Removing T5 has no effect on aesthetic quality ratings (50% win rate), and only a small impact on prompt adherence (46% win rate), whereas its contribution to the capabilities of generating written text are more significant (38% win rate).

>>77389731
Well, the list was auto-generated, so there's some mistakes.
Mascots shouldn't be here

Capcode	All Only User Posts Only Verified Posts Only Moderator Posts Only Manager Posts Only Admin Posts Only Developer Posts Only Founder Posts
Show Posts	All Only With Images Only Without Images Only Spoiler Images Only Non-Spoiler Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

Searching for posts with the image hash ‘iflNv5nCDiJ2cztelETaRQ==’. 1 results found.

On these archives

Your latest searches